Abstract
Predicting rice yield plays a significant role in preparing production plan and relevant decision-making for following year. The paper adopts ARIMA time series algorithm taking rice yield per hectare collected between 1991 and 2010 from different prefecture-level cities of Heilongjiang Province as the object of time series processing, to analyze the rice yield recorded between 1991 and 2009 of different prefecture-level cities and to build prediction model of rice yield. By using the model, the rice yield of different prefecture-level cities in 2010 is predicted. The predicted and actual space distribution of rice yield is obtained using ArcGIS software. And by conducting spatial data analysis, the space distribution diagram is mapped out. It is analyzed that the accuracy of predicted model on average attains over 95 %, featuring good prediction effect.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Heilongjiang is the national important commodity grain base; among which rice is one of the three major crops. Predicting rice production in Heilongjiang province exerts significant impact on the national macro policy. Foreign production prediction method focuses on statistical dynamics growth simulation model, meteorological yield prediction model and remote sensing technology prediction model; while domestic production prediction is based on the mathematical model; Liu Qianpu uses the space-time regression prediction model to predict the output of grain of Henan province and municipality, and Li Bingjn predicts short-term grain output using grey linear regression model; Chen Xiangfang proposes a kind of regression tree based on multivariate time series prediction model, to predict the yield of cucumber; Xu Xingmei proposes the model based on clustering analysis and scheduling algorithm to predict corn production. By removing the noise data and reducing the data dimension [7], the paper builds space-time prediction model of rice yield and the rice yield collected in 2010 of Heilongjiang province was predicted by using this model. Such aspects as yield data processing, selection of predictors, establishment of model and stability of model is noticed during the process of prediction, which has improved the accuracy of the prediction.
To sum up, it has yet reported that the comprehensive utilization of gis and time series analysis are used for space-time prediction of rice production in Heilongjiang province.
2 ARIMA Model
The ARIMA model is difference autoregressive moving average model in its full name; ARIMA (p, d, q) is called difference autoregressive moving average model, with AR being autoregression; p autoregression item, MA moving average number, d the difference time conducted when time series become reliable. The so-called ARIMA model refers to the model established by converting non-stationary time series into the stationary time series and then performing regression of the dependent variable lag value and the present value of the random error and lag values, so as to convert the nonstationary series into stationary series.
The model is generally referred to as the ARIMA (p, d, q); the model parameter–p, d, q are nonnegative integers, meaning autoregression, the order of integration, and all parts of the moving average model. Baucus ARIMA model serves as an important part of a Jenkins method of time series model.
2.1 Stationarity Test
Usually it is based on the following functions to determine sequences’ stationary.
Mean function:
Variance function:
Auto-covariance function:
Autocorrelation function:
2.2 Pure Randomness Test
Null hypothesis: delay between periods no more than m phase sequence values are independent of each other.
The test statistic:
\( Q_{LB} = n(n + 2)\sum\limits_{k - 1}^{m} {(\frac{{\hat{\rho }_{k}^{2} }}{n - k})} \sim \chi^{2} (m) \), reject the null hypothesis, and consider the sequence as a purely random sequence, can be modeled
\( Q_{LB} = n(n + 2)\sum\limits_{k - 1}^{m} {(\frac{{\hat{\rho }_{k}^{2} }}{n - k})} \sim \chi^{2} (m) \), accept the null hypothesis, consider the sequence as pure random sequence and model terminal.
In general, take m = 6, 12, 18.
2.3 Processing of Outliers
If \( X_{{{\text{t}} + 1}} \) is an outlier, we can use \( \hat{X}_{t} \) to correction \( X_{t + 1} \), \( \hat{X}_{t} = 2X_{t} - X_{t - 1} \) [8].
3 Rice Yield Prediction Model
3.1 Data Collection
This paper collected the rice yield data in municipalities of Heilongjiang Province from 1991 to 2010, taking rice yield from 1991 to 2009 as training set, taking rice yield in 2010 as test set.
3.2 Data Processing
In the case of Rice yield per unit area of Mudanjiang city, after the treatment of abnormal, getting a visual distribution map of the rice yield, as shown in Fig. 1:
Analysis of pictorial diagram, the rice yield doesn’t move smoothly enough. After a white noise inspection, we find that the sequence exists in white noise. Quadratic differential on the sequence as shown in Fig. 2:
To make the group number of autocorrelation and partial autocorrelation, Table 1 for autocorrelation function, Table 2 for the partial autocorrelation function:
Pictorial diagram can be seen that the production distribution has no obvious cyclical rice production, and the quadratic differential autocorrelation function and partial autocorrelation function can be seen that rice yields in Mudanjiang city Heilongjiang province in 1991–2008 were stationary series.
3.3 Determine the Order Number
Based on the BIC criterion
Inside, n is the number of parameters. If an order number \( n_{0}^{{\prime }} \) meet
\( BIC(n_{0}^{{\prime }} ) = \mathop { \hbox{min} }\limits_{1 \le n \le M(N)} BIC(n) Z \), \( M(N) \) is equal to [\( \sqrt N \)] or [\( \frac{N}{10} \)], \( n_{0}^{{\prime }} \) is the best order. After the calculation, p = 3, q = 1. Therefore, rice yield prediction model was ARIMA (3, 2, 1), To examine the Bic sequence analysis by SAS software, as shown in Table 3.
4 Yield Prediction
According to the rice yield prediction model, To predict rice yield in cities of Heilongjiang province in 2010. The forecast output and the actual output are shown in Table 4. The distribution maps of Prediction and actual yield in Mudanjiang city in 2010 is shown in Fig. 3.
5 Spatial Analysis of Rice Yield
By using ArcGIS software, To establish the spatial distribution map of The yield of rice in Heilongjiang Province in 2010 cities that the actual and the predicted. As shown in Figs. 4, 5.
Using the functions of GIS spatial analysis, Error analysis for the actual yield of rice and predicted values in 2010 Heilongjiang province. The spatial distribution map of error is shown in Fig. 6.
According to the above analysis, Getting the prediction error of Rice yield of municipalities in Heilongjiang province. As shown in Table 5.
It can be seen from Table 5 that Yichun has the least prediction error of 0.26 %, with Heihe having the largest prediction error of 23.86 %. By carrying out calculation, the prediction error of different cities on average is 4.5 %.
6 Conclusions
The paper establishes time prediction model of rice production in Heilongjiang province using ARIMA model and gis technology; By using the model, the prediction value of rice yield in Heilongjiang province in 2010 is obtained; and the actual production, predicted production and predict the spatial distribution of prediction error is acquired. By conducting analysis, the prediction accuracy of the model on average reaches over 95 % that can be used for rice yield prediction and that provides relevant scientific reference for government sectors’ overall planning and decision-making.
However, the prediction error of this model used for prediction of rice yield of Heihe reaches a maximum value of 23.8 %, indicating the model is not well designed taking all factors into account that further research is still required.
References
Wang, H.: Statistics and decision making, prediction of grain yield of mixture time series model based on (12), 23–25 (2013)
Liu, Q.: The grain yield of Henan province prediction. Spatio-Temporal Regression Model Based Anhui Agric. Sci. 39(28), 17631–17633 (2011). doi:10.3969/j.issn.0517-6611.2011.28.189
Li, B., Li, Q., Lu, X.: Application of gray linear regression combination model in the prediction of grain yield in Henan province. Henan Agric. Sci. 10, 44–47 (2009). doi:10.1016/j.jhin.2010.02.010
Li, J.: Prediction of grain production in Henan province, the grey system model. J. Henan Univ. Technol. (Soc. Sci. Publishing) 5(4), 1–3, 7 (2009). doi:10.3969/j.issn.1673-1751.2009.04.001
Chen, X., Chen, M., Feng, G., et al.: Computer engineering and design model for the prediction of multi variable time series regression tree yield of cucumber 33(1), 407–411 (2012). doi:10.3969/j.issn.1000-7024.2012.01.080
Xu, X., Zhou, C., Chen, G., et al.: Clustering analysis and sequential algorithm of corn production model test. J. Jilin Agric. Univ. 34(6), 688–691, 704 (2012)
Su, H., Zhu, C., Wen, C.: Model in corn fertilization forecasting. J. Jilin Agric. Univ. 32(3), 312–315 (2010)
Xiao, Z., Guo, M.: Time Series Analysis and Application of SAS. Wuhan University Press, Wuhan (2009)
Ana, S., Leonid, S., Aleksandar, D., Slovodanka, D.-K.: GinisWeb-the tool for GIS applications development based on Web. In: 10th International Electrotechnical Conference MEleCon 2000, vol. 1, pp. 331–333
ESRI Shapefile Technical Description, GIS by ESRI (1998)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 IFIP International Federation for Information Processing
About this paper
Cite this paper
Wang, G., Hu, H., Zhang, H., Chen, Y. (2016). Study on Time and Space Prediction Model About Rice Yield in Hei Longjiang Province. In: Li, D., Li, Z. (eds) Computer and Computing Technologies in Agriculture IX. CCTA 2015. IFIP Advances in Information and Communication Technology, vol 478. Springer, Cham. https://doi.org/10.1007/978-3-319-48357-3_50
Download citation
DOI: https://doi.org/10.1007/978-3-319-48357-3_50
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48356-6
Online ISBN: 978-3-319-48357-3
eBook Packages: Computer ScienceComputer Science (R0)