A novel multi-variable grey forecasting model and its application in forecasting the grain production in China

https://doi.org/10.1016/j.cie.2020.106915Get rights and content

Highlights

  • A novel multi-variable grey forecasting model, NMGM(1,N), is proposed.

  • A smooth generation method with variable weight is used to build NMGM(1,N).

  • NMGM(1,N) can weaken the influence of extreme value on model performance.

  • The novel model is successfully used to forecast the grain production in China.

Abstract

In the multi-variable grey forecasting model GM(1,N), the extreme value of the independent variable is one of the important factors that affect the simulation and prediction results of the dependent variable. In this study, a smooth generation method was used to weaken the influence of the extreme value on the performance of GM(1,N), and a novel multi-variable grey forecasting model NMGM(1,N) based on the smooth generation of independent variable sequences with variable weights was constructed. The parameters of NMGM(1,N) were estimated and proved, and the time-response expression and the final restored expression of NMGM(1,N) were deduced. Besides, the detailed modeling steps were provided, and a MATLAB program for building the new model was developed. Moreover, the evaluation criteria of the model were introduced, and numerical examples were used to test the performance of the model. Lastly, the new model was applied to forecast China’s grain production. The global mean relative percentage error of the new model was only 0.689%, in comparison with the ones obtained from the GM(1,N), GM(0,N) and OGM(1,N), which were 4.268%, 3.480% and 1.302% respectively. The findings show that the new model has the best performance, which confirms the effectiveness of the structure improvement.

Introduction

Grey forecasting models can be divided into two types: univariate and multi-variate models. In recent years, the grey forecasting model has been researched extensively and a series of important achievements have been made (Shaikh et al., 2017, Zeng et al., 2020, Zeng et al., 2020, Zhou et al., 2020, Zeng et al., 2020). The multi-variable grey forecasting model is represented by GM(1,N). The modeling object of the model is composed of a dependent variable (or System characteristic variable) and ‘N-1′ independent variables (or Related factor variable; Driver items) (Liu, Forrest, & Yang, 2012). The modeling process considers fully the influence of independent variables on the change trend of the dependent variable. GM(1,N) makes up the deficiency of single structure and limited simulation ability of single-variable grey prediction models. However, GM(1,N) has only been used as a system analysis tool for a long time, and its important prediction ability has not been widely promoted and applied. The main reason is that the model has many shortcomings in the modeling mechanism and model structure, which often leads to greater model errors than that of the GM(1,1) model in practical applications.

In order to improve the modeling performance of GM(1,N), scholars made a lot of efforts, which mainly include three aspects: expanding the model structure (Xiong, Huang, Peng, & Wu, 2020), optimizing model parameters (Dang et al., 2017, Wang and Hao, 2016, Wu and Zhang, 2018) and broadening the model’s application scope (Zeng, Shu, Yan, Shi, & He, 2019).

(a) Expanding the model structure: By Expanding the structure of GM(1,N), some structural parameters, such as the time lag term (Mao, Gao, & Xiao, 2015), were added into the GM(1,N) model, and then some new GM(1,N) models were proposed. Xie, Wu, Li, and Li (2020) introduced a multivariate nonlinear grey model based on the kernel method to improve the prediction accuracy of traffic-related emissions. Based on a Gaussian vector basis kernel function and a global polynomial kernel function combined with the characteristics of grey prediction models, a new multi-kernel GMC(1,N) model (Duan, Wang, Pang, Liu, & Zeng, 2020), which was more comprehensive and suitable for nonlinearity, was established.

(b) Optimizing model parameters: By optimizing the parameters of GM(1,N), such as the background-value coefficient (Zeng, Yan, He, & Shi, 2020) and initial-condition value (Luo & Liu, 2017), some new GM(1,N) models were established. Based on the whitening equation and recursive method of the GMC(1,N) model, a new multivariate grey prediction model RDGM(1,N) (Ma & Liu, 2016) was proposed. Based on the existing fractional multivariate grey model with convolution integral, a new fractional multivariate discrete grey model (Ma et al., 2019) was proposed.

(c) Broadening the model’s application scope: By broadening the application scope of GM(1,N), some new GM(1,N) models were proposed. The GM(1,N) model based on the virtual variable control (Ding, Dang & Xu, 2018) was established, which provides a quantitative solution to the small sample modeling problem when the system variable is affected by the virtual variable. The IEGM(1,N) model (Ding, Dang, Xu, & Wang, 2018) considering the interaction between driving factors was proposed, which provides an effective analysis tool for solving multi-variable, small-sample system modeling with interaction.

The above researches on GM(1,N) greatly enriched the theoretical system of the multi-variable grey forecasting model, expanded its application scope, and promoted its integration with practical problems. Nowadays, the upgraded GM(1,N) models are widely used in the fields of transportation, agriculture, energy, management, etc. (Ahmed et al., 2020, Wu et al., 2018, Ye et al., 2020), solving practical problems, such as engineering prediction (Guo, Liu, Wu, Gao, & Yang, 2015), simulation of anaerobic digestion system (Ren, 2018) and estimation of electronic waste (Duman, Kongar, & Gupta, 2019).

At present, all GM(1,N) models are established by directly using the sequences of independent variables as drivers. However, there exists a major drawback in the modeling process of these models. The sequences of independent variables are the basis of constructing the parameter matrix of GM(1,N), which determines the size of model parameters and the rationality of simulation and prediction results. It can be seen that the sequence of independent variables is an important factor that affects the modeling performance of GM(1,N). Currently, GM(1,N) models all use the sequences of independent variables for modeling directly. The extreme values in the sequences are likely to affect the rationality of the estimated values of model parameters. Therefore, to weaken the influence of extreme values in sequences of independent variables on the modeling results of GM(1,N), we proposed a new method of smooth generation of independent variable sequences with variable weights and constructed a new multi-variable grey forecasting model NMGM(1,N).

In order to verify the performance of the novel model and compare it with other models, we used NMGM(1,N) to forecast China’s grain production, and compared it with GM(1,N), GM(0,N) (Liu, Yang & Forrest, 2016) and OGM(1,N) (Zeng, Luo, Liu, Bai, & Li, 2016). The results showed that the global mean relative percentage error (GMRPE) of NMGM(1,N), GM(1,N), GM(0,N) and OGM(1,N) were 0.689%, 4.268%, 3.480% and 1.302% respectively. It can be seen that the performance of NMGM(1,N) was much better than that of GM(1,N) and GM(0,N). Even compared with OGM(1,N), the model with the best performance currently, its performance was also better. Similarly, the effectiveness and practicability of NMGM(1,N) model were also verified by other cases and evaluation criteria.

The rest of this paper is organized as follows. In Section 2, we introduce the OGM(1,N) model and analyze its main defect. In Section 3, we propose an improved GM(1,N) model based on the smooth generation method, then study and prove its parameter estimation, and derive its time-response expression and its final restored expression. In Section 4, we provide the detailed modeling steps and the MATLAB program for building the new model and optimizing the weight of the drivers of the model by the PSO algorithm. In Section 5, we introduce the evaluation criteria of the model, and verify the performance and stability of the model by numerical examples. In Section 6, we use the new model to forecast China's grain production. In this application, we describe the modeling and forecasting process in detail. In Section 7, we make a further overview of the improved new model and a prospect of future research work.

Section snippets

The OGM(1,N) model and its defect analysis

On the basis of analyzing the defects of the traditional GM(1,N) model in modeling mechanism, parameter estimation and model structure, the OGM(1,N) model was put forward by introducing a linear lag term and grey action quantity. The following part gives a simple description of the OGM(1,N) model and an analysis of its main shortcoming.

Definition 1

Let X10 be the system characteristic sequence (or called dependent variable sequence).

X10=x101,x102,,x10m

Let Xi(0)i=2,3,,N be the explanatory variable

The NMGM(1,N) model

The smooth generation method is a common method to weaken the extreme or abnormal values of sequences. Therefore, in this study, we used the smooth generation of independent variable sequences with variable weights to replace the traditional independent variable sequences generated by accumulated operation. Then, a novel multi-variable grey forecasting model based on the OGM(1,N) model with an improved structure was constructed, and the novel model was named the NMGM(1,N) model (Novel

Optimizing weight by PSO algorithm

The PSO (Particle Swarm Optimization) is a group intelligence algorithm for global optimization evolution designed by simulating the predation behavior of birds. The algorithm has the advantages of simple structure, few parameters and easy programming, and it has been widely used in function optimization, neural network training etc. (Navabi et al., 2020, Nouiri et al., 2018, Sadiq et al., 2020, Wang and Li, 2019). In this study, we used the PSO algorithm to optimize the drivers of the NMGM(1,N

Evaluation criteria

In order to evaluate the performance of the model, we use the mean relative percentage error (MRPE) and the absolute degree of grey incidence as the criteria. MRPE evaluates the overall performance of the model accuracy, while the absolute degree of grey incidence tests the similarity between the simulated sequence and the original sequence. In addition, in order to evaluate the stability of the model, we use the standard deviation (STD) of relative percentage error and the maximum relative

Forecasting the grain production in China

Grain production prediction is an important research topic. Effective analysis and accurate prediction of grain production is of great significance in strengthening the macro-control of grain production, promoting the adjustment of policies and ensuring food security. To test the effectiveness of NMGM(1,N), China’s grain production and its influencing factors from 2003 to 2018 were used as the experimental data, as shown in Table 4. The data were obtained from China statistical yearbook 2019.

Conclusion

In order to weaken the influence of extreme values in the sequences of independent variables on the accuracy of multi-variable forecasting models, a novel multi-variable grey model called NMGM(1,N) was proposed on the basis of the well-structured OGM(1,N) model, which adopted the smooth generation of independent variable sequences with variable weights as the driving term. The new model was applied in forecasting China’s grain production and compared with three other models, GM(1,N), GM(0,N)

CRediT authorship contribution statement

Bo Zeng: Conceptualization, Writing - review & editing. Hui Li: Writing - review & editing. Xin Ma: Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We thank the editors and the anonymous referees for their insightful comments and suggestions to improve the paper.

This work was supported by the National Natural Science Foundation of China (71771033, 72071023), Chongqing Natural Science Foundation of China (cstc2019jcyj-msxmX0003 and cstc2019jcyj-msxmX0767) and the Key Project of Scientific and Technological Research of Chongqing Education Commission (KJZD-K202000804).

Bo Zeng was born in 1975. He received his Ph.D. from Nanjing University of Aeronautics and Astronautics of China in 2012. He is a professor at Chongqing Technology and Business University. Currently, he serves on the editorial board of the International Journal of Grey System. His research interests include modeling methods of prediction and decision-making, optimization and control, intelligent algorithms and their applications. Email: [email protected], Tel: +86-18523027911.

References (33)

Cited by (67)

View all citing articles on Scopus

Bo Zeng was born in 1975. He received his Ph.D. from Nanjing University of Aeronautics and Astronautics of China in 2012. He is a professor at Chongqing Technology and Business University. Currently, he serves on the editorial board of the International Journal of Grey System. His research interests include modeling methods of prediction and decision-making, optimization and control, intelligent algorithms and their applications. Email: [email protected], Tel: +86-18523027911.

Miss. Hui Li was born in 1995. She is a graduate student of School of Management Science and Engineering at Chongqing Technology and Business University. Her research interests are focused on Grey System Theory and Grey Forecasting Model. Email: [email protected], Tel:+1-5223067628.

Xin Ma was born in 1989, Sichuan China. He got his M.S. and Ph.D degrees in Petroleum Engineering Computing Technology from Southwest Petroleum University in 2016 and 2013, respectively; and B.S. degree in Mathematics & Applied Mathematics from Xihua University in 2010. He was a visiting scholar of KU Leuven in 2019, and is now working as lecturer in Southwest University of Science and Technology from 2016. His research interests include grey system and machine learning with applications in energy. Email: [email protected]

View full text