Wind power forecasting based on stacking ensemble model, decomposition and intelligent optimization algorithm

doi:10.1016/j.neucom.2021.07.084

Neurocomputing

Volume 462, 28 October 2021, Pages 169-184

https://doi.org/10.1016/j.neucom.2021.07.084 Get rights and content

Abstract

Wind power forecasting has high application value in power systems. However, due to the intermittence and fluctuation of wind power, it is difficult to predict wind power effectively using a single forecasting model. Therefore, to improve the accuracy and stability of wind power forecasting, an ensemble learning model based on stacking framework is proposed in this paper. First, several decomposition techniques are used to pre-process the original wind power data and an optimal decomposition method is selected through experiments. Then, a quadratic interpolation based on state transition algorithm is proposed to optimize the parameters of the Bernstein polynomial model and the weights of the Hermite neural network (HNN) to obtain two base learners. Finally, the Spearman correlation coefficient is used to analyze the correlation of several base learners. The base learners with low correlation and strong prediction ability are selected as the first-layer forecasting model of the stacking model, and the HNN is used as the second-layer prediction model to obtain the stacking ensemble model. To verify the effectiveness of the proposed model, a large number of comprehensive experiments are carried out with wind power data from a wind farm in Xinjiang, China. Experimental results show that the proposed model has higher prediction accuracy and stability than other single forecasting models.

Introduction

With the reduction of fossil fuels and the continuous increase of electricity demand, renewable energy has become an encouraging alternative energy that provides a new research perspective for researchers. Wind power is a feasible, renewable, and sustainable energy source to overcome the problem of fossil fuel deterioration because of its low operating cost, environmental friendliness, and ability of commercial-scale energy production [1]. However, the characteristics of wind speed fluctuation, indirectness, and low energy density reduce the reliability of power system operation [2]. Wind power forecasting can improve the economy of grid dispatching and the operation safety of wind farms with renewable energy. Therefore, accurate wind power forecasting is very important for wind power integration and power system operation.

Wind power forecasting is a forecasting technology that uses meteorological observation data, wind farm environmental information, and historical wind power data to establish a forecasting model. It then uses the historical data as the input of the model to obtain future wind power information through the output of the model. With the rapid increase of innovation and technological progress in recent years, researchers have developed many wind power prediction models, which can be divided into five categories, namely, physical models, statistical models, artificial intelligence models, hybrid models, and ensemble models [3]. Although numerical weather prediction (NWP), a classical method of the physical model, can effectively predict wind power, establishing a prediction model usually requires extensive physical information of the wind farm and the surrounding environment, which makes the accurate prediction of wind power challenging [4]. Compared with physical methods, all statistical models, such as Cubature Kalman Filter [5], support vector machine (SVM) [6], least squares support vector machine (LSSVM) [7], and autoregressive integrated moving average (ARIMA) [8], use historical wind power data to establish a potential relationship with future wind power, giving them higher accuracy in short-term wind power forecasting. Nevertheless, due to the nonlinearity and randomness of wind power, it is difficult for a single statistical model to deal with time series data with complex nonlinear characteristics [9]. With the rise of artificial intelligence, the deep learning model based on neural network has received extensive attention and been successfully applied in speech recognition [10], face recognition [11], text classification [12], wind power prediction [13], and other fields. Different from the traditional statistical model, deep learning model can better learn the complex nonlinear mapping relationship between data in different fields. For example, the Recurrent Neural Networks (RNN) model not only depends on the current time value but also on the previous time step in the storage unit [14], which is widely used in speech recognition.

On the other hand, considering the limitations of a single forecasting model, many hybrid models have been proposed for wind power forecasting and other fields [15], [16]. The hybrid model mainly consists of a data preprocessing module, an optimization algorithm module, and a prediction model. The data preprocessing module mainly includes some decomposition techniques, such as complete ensemble empirical mode decomposition (CEEMD) [17], variational mode decomposition (VMD) [18], wavelet packet decomposition (WPD) [19], symplectic geometry mode decomposition (SGMD) [20], and singular spectrum analysis (SSA) [21] technique. Optimization algorithms are employed to find the optimal parameters of the forecasting model, mainly metaheuristic optimization algorithms such as particle swarm optimizer (PSO) [22], firefly algorithm (FA) [23], and grey wolf optimizer (GWO) [24]. The forecasting model is usually a statistical model or an artificial intelligence model. For example, considering the nonstationarity and chaos of wind power time series, Afshari et al. [25] proposed a new hybrid wind power forecasting method by using wavelet transform, neural network, and improved krill herd optimization algorithm. The hybrid model has higher prediction accuracy compared with other single models. Zhang et al. [26] used SSA technique to analyze the original wind power data and predict the subsequence using the SVM optimized by the cuckoo search algorithm. Wang et al. [27] proposed a hybrid wind power prediction model based on ensemble empirical mode decomposition-sample entropy and full-parameter continuous fraction optimized by primal dual state transition algorithm. The effectiveness of the proposed hybrid forecasting model is verified by a large number of comprehensive experiments. Liu et al. [28] developed a forecasting model based on improved complete ensemble empirical mode decomposition with adaptive noise (ICEEMDAN), modified multi-objective dragonfly algorithm (MMODA), and combination of multiple forecasting models. The experimental results show that the combined model is better than all the comparative models in terms of prediction accuracy and stability. Niu et al. [29] proposed a wind speed forecasting method that combines a linear model and four neural network models. They adopted the CEEMDAN technique to preprocess the original data and multi-objective grasshopper optimization algorithm (MOGOA) to optimize the combined model, which successfully overcame the limitations of a single model. Dong et al. [30] used CEEMD to decompose the original wind power data, developed a Bernstein polynomial forecasting model with mixture of Gaussians, and optimized the parameters of the model through a multi-objective state transition algorithm. The proposed hybrid model was successfully applied to wind power forecasting and obtained higher forecasting accuracy and stability compared with other comparative models.

As a branch of machine learning, ensemble technique has been proven to be an effective model and successfully applied to wind power forecasting. For example, Wang et al. [31] proposed a hybrid model based on the combination of wavelet transform, echo state network, and ensemble technique. The hybrid model applies ensemble technique to deal with common model misjudgments and data noise problems, thereby reducing the uncertainty and improving the accuracy of wind power forecasting. He et al. [32] proposed a hybrid model combining wavelet transform, deep learning, and ensemble learning and verified the effectiveness of the proposed hybrid model by simulation experiments. Wang et al. [33] proposed a hybrid method based on Bayesian model averaging (BMA) and stacking ensemble learning. In the framework of stacking, back propagation neural network (BPNN), radial basis function neural network (RBFNN) and SVM are used as the base learners for training and then BMA is used to combine the output of the three basic learners to obtain the final forecasting results. Stacking ensemble learning, a popular ensemble learning framework, provides a new approach to improve the reliability of wind power forecasting results owing to its strong generalization capabilities and reliability.

In this paper, a wind power forecasting model based on stacking ensemble learning is proposed. First, the original wind power data are preprocessed by decomposition technology, and the original data are decomposed into several subsequences. Next, a state transition algorithm based on quadratic interpolation (QISTA) is proposed, and the parameters of the Bernstein polynomial model and Hermite neural network (HNN) are optimized by QISTA. Finally, Spearman correlation coefficient is used to analyze the correlation of base learners, the optimal combination of base learners is selected as the first-layer forecasting model, and HNN is used as the second-layer forecasting model of stacking framework to summarize the output of each basic learner and obtain the final wind power forecasting results. The overall framework of the proposed forecasting model is shown in Fig. 1. In addition, the validity of the stacking ensemble model is verified on a wind farm in Xinjiang, China.

The main contributions proposed in this paper can be described as follows:

(1)
Several decomposition methods are used to pre-process the original wind power data and an optimal decomposition method is selected through experiments.
(2)
An improved state transition algorithm based on quadratic interpolation is developed to optimize the parameters of the Bernstein polynomial model and HNN.
(3)
An ensemble forecasting model based on stacking framework is built for wind power forecasting, and the effectiveness of the model is verified by a large number of comprehensive experiments.

The structure of the rest of this paper is organized as follows. In Section 2, the proposed methodology is introduced in detail, including QISTA, Bernstein polynomial, HNN, and stacking ensemble model. In Section 3, data collection and performance criteria are briefly described. In Section 4, a large number of comparative experiments are carried out to verify the effectiveness of the stacking ensemble model. Finally, the conclusion is presented in Section 5.

Section snippets

State transition algorithm based on quadratic interpolation

State transition algorithm (STA) [34], [35] is an intelligent optimization technique proposed by Zhou et al. in 2012. In general, the unified framework for generating candidate solutions in the basic STA can be stated as: $\{\begin{matrix} s_{k + 1} = A_{k} s_{k} + B_{k} u_{k} \\ y_{k + 1} = f (s_{k + 1}), \end{matrix}$ where $s_{k}, s_{k + 1} \in R^{n}$ indicate the current state and the next state, respectively, corresponding to solutions of a particular optimization problem; then, $A_{k}, B_{k} \in R^{n \times n}$ are state transformation operators; $y_{k + 1}$ is the objective function value; $u_{k}$ is a function

Data collection

To verify the performance of the proposed stacking ensemble model, 15 min wind power data from a wind farm in Xinjiang, China in 2019 are selected for experiments. Considering the different climatic characteristics of different seasons, we select 30 days of historical data from January, April, July, and October to represent the four seasons of spring, summer, autumn, and winter, respectively. The first 20 days of each month are used for training and the remaining 10 days for testing. Fig. 5

Experimental results and analysis

In this section, numerous comprehensive experiments are conducted to test the effectiveness of the proposed QISTA, decomposition techniques, and stacking ensemble model. All forecasting models are independently run 20 times in MATLAB on a desktop computer with Intel(R) Core(TM) i5-9500F CPU @3.00 GHz under Windows 10 environment. In addition, for the sake of fairness, the average results of performance criteria obtained by each forecasting model are compared.

Conclusion

In this paper, a stacking ensemble model integrating ELM, Bernstein, and HNN models is proposed for wind power forecasting, and the parameters of Bernstein and HNN models are optimized by QISTA, the decomposition technique used to pre-process the wind power historical data. The proposed stacking ensemble model can make full use of the observation ability of different prediction models on data space and structure from different angles, making it possible for different models to learn from one

CRediT authorship contribution statement

Yingchao Dong: Writing - review & editing, Conceptualization, Investigation. Hongli Zhang: Writing - review & editing, Validation. Cong Wang: Writing - review & editing, Validation. Xiaojun Zhou: Writing - review & editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grant 51767022 and Grant 51967019.

Yingchao Dong received the B. Eng. degree in Electrical Engineering from Xinjiang University, Xinjiang, China in 2017 and received MA.Sc degree in Control Science and Engineering at Taiyuan University of Technology, Shanxi, China in 2020. He is currently pursuing the PhD degree in Xinjiang University, Xinjiang, China. His research interests include modeling, optimization and wind power forecasting.

References (51)

F. Shahid et al.
A novel wavenets long short term memory paradigm for wind power prediction
Applied Energy
(2020)
P. Du et al.
A novel hybrid model for short-term wind power forecasting
Applied Soft Computing
(2019)
J. Zhao et al.
An improved multi-step forecasting model based on WRF ensembles and creative fuzzy systems for wind speed
Applied Energy
(2016)
A. Zendehboudi et al.
Application of support vector machine models for forecasting solar and wind energy resources: A review
Journal of Cleaner Production
(2018)
X. Yuan et al.
Short-term wind power prediction based on LSSVM–GSA model
Energy Conversion and Management
(2015)
C. Wu et al.
A novel hybrid system based on multi-objective optimization for wind speed forecasting
Renewable Energy
(2020)
H.M. Fayek et al.
Evaluating deep learning architectures for speech emotion recognition
Neural Networks
(2017)
G. Guo et al.
A survey on deep learning based face recognition
Computer Vision and Image Understanding
(2019)
H. Yin et al.
A cascaded deep learning wind power prediction approach based on a two-layer of mode decomposition
Energy
(2019)
Z.O. Olaofe
A 5-day wind speed & power forecasts using a layer recurrent neural network (lrnn)
Sustainable Energy Technologies and Assessments
(2014)

J. Wang et al.

An innovative hybrid model based on outlier detection and correction algorithm and heuristic intelligent optimization algorithm for daily air quality index forecasting

Journal of Environmental Management

(2020)

W. Ting et al.

Eeg feature extraction based on wavelet packet decomposition for brain computer interface

Measurement

(2008)

H. Pan et al.

Symplectic geometry mode decomposition and its application to rotating machinery compound fault diagnosis

Mechanical Systems and Signal Processing

(2019)

R. Vautard et al.

Singular-spectrum analysis: A toolkit for short, noisy chaotic signals

Physica D: Nonlinear Phenomena

(1992)

S. Mirjalili et al.

Grey wolf optimizer

Advances in Engineering Software

(2014)

X. Zhang et al.

Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by cuckoo search algorithm

Electric Power Systems Research

(2017)

C. Wang et al.

A new chaotic time series hybrid prediction method of wind power based on EEMD-SE and full-parameters continued fraction

Energy

(2017)

Z. Liu et al.

A combined forecasting model for time series: Application to short-term wind speed forecasting

Applied Energy

(2020)

X. Niu et al.

A combined model based on data preprocessing strategy and multi-objective optimization algorithm for short-term wind speed forecasting

Applied Energy

(2019)

Y. Dong et al.

A novel hybrid model based on bernstein polynomial with mixture of Gaussians for wind power forecasting

Applied Energy

(2021)

H. Wang et al.

Echo state network based ensemble approach for wind power forecasting

Energy Conversion and Management

(2019)

H. Jiajun et al.

Ultra-short term wind prediction with wavelet transform, deep belief network and ensemble learning

Energy Conversion and Management

(2020)

G. Wang et al.

A hybrid wind power forecasting approach based on Bayesian model averaging and ensemble learning

Renewable Energy

(2020)

X. Zhou et al.

Discrete state transition algorithm for unconstrained integer optimization problems

Neurocomputing

(2016)

X. Zhou et al.

A hybrid feature selection method for production condition recognition in froth flotation with noisy labels

Minerals Engineering

(2020)

Cited by (73)

An emission predictive system for CO and NOx from gas turbine based on ensemble machine learning approach
2024, Fuel
The gas turbine in a combined cyclic power plant (CCPP) produces harmful gases like carbon monoxide (CO) and nitrogen oxide (NOx) into the atmosphere. It is evident to monitor the rate at which these gases are produced during power generation to comply with the industrial standard for emission. Therefore, a system is required to continuously monitor the emission from the CCPP gas turbine. Hence, this work aims to design a stacked ensemble machine learning (SEM) based predictive model for CO and NOx emission from a CCPP gas turbine. The neural network for regression (NNR), a generalized additive model (GAM), and the bagging of regression trees (BT) act as the base learners. A generalized regression neural network (GRNN) is used as a meta-learner for SEM. The hyperparameters of SEM are optimized using a Bayesian optimization algorithm for CO and NOX prediction. In addition to this, the performance of SEM is compared with support vector regression (SVR), decision tree (DRT), and linear regression (LIR). Simulation results demonstrate that SEM can reduce the RMSE 5.7–93.8% for NOx and 1%-41.5% for CO compared to other ML techniques. Finally, comparing the results with ML techniques existing in the literature shows the higher predictive accuracy of the proposed SEM.
Residential net load interval prediction based on stacking ensemble learning
2024, Energy
In response to the high uncertainty associated with residential net load due to the coupling of distributed photovoltaic generation and user demand, this paper proposed a novel cluster-based stacking ensemble learning model for net load interval prediction. Firstly, the k-means algorithm is employed to discover the similarity in user electricity consumption patterns. Then, a RIME optimization algorithm with local enhancement (LRIME) is developed to optimize the parameters and weights of the base learners in stacking ensemble learning. Subsequently, base learners with strong predictive capabilities and significant diversity are chosen as the first-layer predictive models, extreme learning machine (ELM) is utilized as the second-layer predictive model, ultimately resulting in the proposed stacking ensemble learning prediction model. And utilizing the bootstrap method to fit the volatility of point predictions, different prediction intervals are obtained at varying confidence levels, aiming to quantify the integrated uncertainty in photovoltaic generation and load. Testing on the open Ausgrid electricity load data in Australia provided robust validation of the proposed method's effectiveness. In comparison with other outstanding prediction models, the proposed ensemble model can effectively capture the uncertainty in integrating photovoltaic generation and user load.
An effective hybrid wind power forecasting model based on “decomposition-reconstruction-ensemble” strategy and wind resource matching
2024, Sustainable Energy, Grids and Networks
The random and fluctuating nature of wind energy brings tremendous challenges and disturbances to the security operation of wind power systems, accurate wind power prediction can effectively reduce these negative impacts. To this end, this paper proposes a hybrid wind power prediction model based on the "decomposition-reconstruction-ensemble" strategy, which consists of four main components, namely decomposition, reconstruction, prediction, and ensemble. Specifically, the original wind power series is decomposed into several sub-modes and reconstructed by frequency by the sample entropy(SE)-optimized variational modal decomposition(VMD) algorithm, subsequently, the Pearson correlation coefficients between the wind speed time series and the reconstructed components of wind power are calculated to divide the wind power series into trend and fluctuation components. Then both the two components are sequentially predicted using the temporal convolutional network(TCN) model. The final predicted value is obtained from the set of predicted results for each component. The wind power data from two wind farms in Hami, Xinjiang are adopted as examples for empirical study, and the results show that the IVMD-R-TCN model proposed in this paper performs significantly better than the benchmark model, which illustrates the predictive validity of the proposed model and is an effective tool for wind power forecasting.
A short-term wind power forecasting method based on multivariate signal decomposition and variable selection
2024, Applied Energy
Accurate and effective short-term wind power forecasting is vital for the large-scale integration of wind power generation into the power grid. However, due to the intermittence and volatility of wind resources, short-term wind power forecasting is challenging. To address the issue that the existing decomposition forecasting methods ignore the coupling relationship between wind power series and multiple meteorological series, this study proposes a short-term wind power forecasting method based on multivariate signal decomposition and variable selection. First, multivariate variational mode decomposition (MVMD) is used to perform time-frequency synchronous analysis on wind power and multidimensional meteorological series, thereby decomposing them into the same predefined number of frequency-aligned intrinsic mode functions (IMFs). Secondly, elastic net (EN) is used for supervised variable selection on all IMFs to provide a high-quality training set for the forecasting model, thereby enhancing precision and interpretability. Next, a hybrid deep neural network combining convolutional neural network (CNN), bidirectional long-short term memory (BiLSTM) neural network, and multi-head attention (MHA) mechanism is employed to model the output curve of a group of wind turbines in a wind farm. Finally, the proposed method is comprehensively evaluated through four sets of comparative experiments and multiple evaluation metrics on data gathered from the Mahuangshan first wind farm in China with four forecasting horizons: 15-min ahead, 30-min ahead, 45-min ahead, and 1-h ahead. The experimental results show that the proposed method significantly outperforms fifteen existing deep learning methods in terms of precision and stability.
Hierarchical transfer learning with applications to electricity load forecasting
2024, International Journal of Forecasting
The recent abundance of electricity consumption data available at different scales provides new opportunities and highlights the need for new techniques to leverage information present at finer scales in order to improve forecasts at wider scales. In this study, we take advantage of the similarity between this hierarchical prediction problem and transfer learning where source data are observed at a low aggregation level and target data at a global level. We develop two methods for hierarchical transfer learning based on stacking generalized additive models and random forests (GAM-RF). We also propose and compare adaptations of online aggregation of experts in a hierarchical context using quantile GAM-RF as experts. We apply these methods to two electricity load forecasting problems at the national scale by using smart meter data in the first case and regional data in the second case. For these two user cases, we compared the performance of our methods and benchmark algorithms, and investigated their behavior using variable importance analysis. Our results demonstrate that both methods can lead to significantly improved predictions.
Research on the short-term wind power prediction with dual branch multi-source fusion strategy
2024, Energy
The smooth operation of the power system imposes high requirements on wind power forecasting. Given the forecasting challenges brought by the coexistence of strong and weak disturbances in wind power data, this paper proposes a hierarchical anti-disturbance mechanism to raise the accuracy of single-point wind power prediction. In the first-level disturbance processing, we prioritize the treatment of strong disturbances in wind power data through a dual-branch multi-source fusion prediction structure. The wind power subsequences with distinct time–frequency characteristics are divided into two sets, processed by the stationary set prediction branch and the drastic set prediction branch, respectively. For the drastic set, we incorporate microenvironmental multi-source data surrounding the wind turbine as input and propose a parallel-structured attention fusion module CA (Convolution attention fusion block) to provide effective data support for the prediction module, then extract rich fusion features through deep temporal LSTM algorithm and ultimately achieve fine-grained forecasting. For weak disturbances that have not been adequately addressed in the previous step, we conduct a secondary treatment through error correction strategy. By employing the hierarchical processing approach, this paper accomplishes a comprehensive handling of disturbances with different intensities. Experimental results on four data sets show that the proposed model DEWFM (Dual-branch and error correction wind power forecast) demonstrates remarkable performance advantages in comparing with the baseline model and other advanced models, the average values of MAE reaching 1.679, revealing the effectiveness of the proposed method in enhancing the precision of short-term wind power prediction.

View all citing articles on Scopus

Hongli Zhang received the BS degree from Xinjiang Institute of Technology in 1995, MS degree from Xinjiang University, Xinjiang, China in 2001, and PhD degree from Beijing Institute of Technology, Beijing, China in 2009. He is currently a professor at Xinjiang University, Xinjiang, China. His research interests include big data analysis, machine learning, and intelligent computing.

Cong Wang received the BS and PhD degrees from Xinjiang University in 2013 and 2018, respectively. She is currently an Associate Professor at Xinjiang University, Xinjiang, China. Her research interests include machine learning, intelligent computing, and system modeling and control.

Xiaojun Zhou received his Bachelor’s degree in Automation in 2009 from Central South University, Changsha, China and received the PhD degree in Applied Mathematics in 2014 from Federation University Australia. He is currently an Associate Professor at Central South University, Changsha, China. His main interests include modeling, optimization, and control of complex system, artificial intelligence and machine learning, optimization theory and algorithms, duality theory and global optimization with applications.

View full text

Wind power forecasting based on stacking ensemble model, decomposition and intelligent optimization algorithm

Abstract

Introduction

Section snippets

State transition algorithm based on quadratic interpolation

Data collection

Experimental results and analysis

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Applied Energy

Applied Soft Computing

Applied Energy

Journal of Cleaner Production

Energy Conversion and Management

Renewable Energy

Neural Networks

Computer Vision and Image Understanding

Energy

Sustainable Energy Technologies and Assessments

Journal of Environmental Management

Measurement

Mechanical Systems and Signal Processing

Physica D: Nonlinear Phenomena

Advances in Engineering Software

Electric Power Systems Research

Energy

Applied Energy

Applied Energy

Applied Energy

Energy Conversion and Management

Energy Conversion and Management

Renewable Energy

Neurocomputing

Minerals Engineering