Short communication
Understanding cumulative sum operator in grey prediction model with integral matching

https://doi.org/10.1016/j.cnsns.2019.105076Get rights and content

Highlights

  • We investigate the mechanism of cumulative sum operator in grey prediction model by using integral transformation.

  • We provide a novel paradigm for estimating the structural parameters and the initial value simultaneously.

  • We analyze the influence of sample size and noise level on modelling performance by using large-scale simulation studies.

Abstract

Grey prediction models have been widely used in various fields and disciplines. Cumulative sum operator, also called accumulative generation operator, is an essential step in grey modelling, but until now relatively limited attention has been paid to its mechanism of action. In this paper, we introduce the integral matching to explain it. By using the integral transformation, the grey prediction model whose nature is modelling the cumulative sum series with a differential equation proves to be equivalent to that modelling the original series with a reduced differential equation. The cumulative sum operator is the discretization and approximation of the definite integral terms by using the piecewise constant integral and thus can be improved by using the piecewise linear integral. Simulation studies detail the advantages in terms of the stability and robustness to noise.

Introduction

Time series prediction has been an important topic in various applications ranging from science through technology to engineering. In the past years many prediction methods have been developed, such as exponential smoothing [1], Box-Jenkins models [2], support vector regression [3], neural network [4], etc. See [5] for a review. Each model has its own modelling assumptions and application scopes. Grey prediction model, as one of the time series analysis and prediction approaches, assumes that the original time series is partially quasi-exponential. Since the cumulation and release in many generalized energy system usually conform to the quasi-exponential law [6], research has been consistently shown that grey modelling approach is promising in various fields and disciplines.

Grey model GM(1,1), first order (the first ‘1’) ordinary differential equation with one variable (the second ‘1’), was first proposed to modelling and forecasting the small-sample quasi-exponential time series in socioeconomic system [7]. In order to improve the accuracy, a large number of studies have been conducted. These include the optimization methods and the extension models. The optimization work is mainly focused on improving the performance without changing the structure of GM(1,1) model, such as the background coefficient optimization [8], [9], [10], the initial value improvement [11], [12], [13], the introduction of rolling and moving mechanism [14], [15], [16], and the combination with other models [17], [18], [19]. The extension studies are focused on extending the application scopes by changing the forcing term of the differential equation in GM(1,1) model from a constant to a time-varying term. These include NGM(1,1) model using the linear function of time as the forcing term [20] for the quasi non-homogeneous exponential series, and GPM(1,1,N) model using the polynomial function of time as the forcing term for the time series coupled with quasi-exponential and time-varying polynomial characteristics [21], [22], as well as KRNGM(1,1) model using the kernel function as the forcing term for more complex nonlinear time series [23]. In addition, these grey prediction models have been successfully used in both traditional and emerging fields. See [24] for a review.

In the above optimized and novel models, cumulative sum operator is not only an essential step but also one of the major feature distinguishing grey prediction models from other methods. But by now it still lacks the mechanism analysis especially from a mathematical analysis perspective, although reference [6] claims that the cumulative sum operator is able to recognize the pattern hidden in the time series. Furthermore, the structural parameters and the initial value are estimated by using a two-step method in the classical models, which may introduce extra error in the second step. Therefore, it is also necessary to analyze the mechanism of cumulative sum operator from a statistical perspective and then propose a simultaneous parameter estimation method.

The remainder of this paper is organized as follows: we give a brief introduction to GM(1,1) model in Section 2, and elaborate the modelling mechanism by introducing the integral matching in Section 3; the large-scale simulation studies are conducted in Section 4; the conclusion and future work are present in Section 5.

Section snippets

Basic concepts of GM(1,1) model

Definition 1

For a time series X(t)={x(t1),x(t2),,x(tn)}, the cumulative sum series is defined as Y(t)={y(t1),y(t2),,y(tn)}, wherey(tk)=i=1khix(ti),hk={1,k=1,tktk1,k=2,3,,n.Then the coupled equation{ddty(t)=ay(t)+b,y(t1)=ξ,tt1(2a)x(tk)=a(12y(tk1)+12y(tk))+b,k=2,3,,n(2b)is referred to as GM(1,1) model [6], and equations (2a) and (2b) are called the continuous-time and discrete-time forms, respectively.

Remark 1

The discrete-time equation (2b) is the approximation of equation (2a) by using the trapezoid rulex(t

Integral matching for reduced differential equation

Lemma 1

Lettingy(t)=x(t1)+t1tx(τ)dτ,tt1,then the differential equation (2a) is equivalent to the reduced differential equationddtx(t)=αx(t),x(t1)=β,tt1where α=a and β=b1a.

Proof

It can be seen from Eq. (6) that the initial values of differential equations (2a) and (7) are both equal to y(t1)=ξ=x(t1)=β=b1a.

Let us first prove that Eq. (7) can be derived from equation (2a). By substituting Eq. (6) into the left side and right side of equation (2a), respectively, it follows thatddty(t)=ddt(x(t1)+t1tx(τ)dτ)=x

Data generation process

In this section, we analyze the performance of the above approaches in Monte Carlo simulation studies. The following data generation process is considered:x˜(t)=x(t)+ε(t)where x(t) is obtained by explicitly solving the differential Eq. (7); ε(t)N(0,σ2) is the additive Gaussian noise where the noise level is measured by the signal to noise ratio SNR=Var[x(t)]Var[ε(t)]=1σ2Var[x(t)].

The observations are sampled from x˜(t) at every time interval of h in the range of t ∈ [0, 5], which generates 5h+1

Conclusion and future work

The cumulative sum operator is an essential step in the grey modelling process, and also the main feature distinguishing grey prediction models from other methods. We explain the mechanism of cumulative sum operator in the modelling process from both mathematical analysis and parameter estimation viewpoints. The integral matching provides a novel paradigm for simultaneous estimation of structural parameter and initial value in grey prediction models. Large-scale simulation experiments are

Declaration of Competing Interest

The authors declare that there are no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors appreciate the anonymous referees for the insightful comments and suggestions. This work was supported by the National Natural Science Foundation of China (Grants 71671090 and 71871117), Joint Research Project of National Natural Science Foundation of China and Royal Society of UK (Grant 71811530338), Fundamental Research Funds for the Central Universities of China (Grant NP2018466), Qinglan Project for excellent youth or middle-aged academic leaders in Jiangsu Province, China.

References (30)

Cited by (45)

  • A non-linear grey Fourier model based on kernel method for seasonal traffic speed forecasting

    2024, Communications in Nonlinear Science and Numerical Simulation
View all citing articles on Scopus
View full text