A novel decomposition ensemble model with extended extreme learning machine for crude oil price forecasting

doi:10.1016/j.engappai.2015.04.016

Engineering Applications of Artificial Intelligence

Volume 47, January 2016, Pages 110-121

https://doi.org/10.1016/j.engappai.2015.04.016 Get rights and content

Highlights

•
A novel decomposition-and-ensemble forecasting model is built for crude oil price.
•
The powerful data decomposition of EEMD is utilized to simplify the complex data.
•
The effective and stable AI tool of EELM is employed to ensure prediction accuracy.
•
Case study compares this model with typical techniques and similar ensemble models.
•
The model is shown superior in terms of high accuracy, time saving and robustness.

Abstract

As one of the most important energy resources, an accurate prediction for crude oil price can effectively guarantee a rapid new production development with higher production quality and less production cost. Accordingly, a novel decomposition-and-ensemble learning paradigm integrating ensemble empirical mode decomposition (EEMD) and extended extreme learning machine (EELM) is proposed for crude oil price forecasting, based on the principle of “decomposition and ensemble”. This novel learning model makes contribution to literature by introducing the current powerful artificial intelligent (AI) technique of EELM in the ensemble model formulation. In the proposed method, EEMD, a competitive decomposition method, is first applied to divide the original data of crude oil price time series into a number of relatively regular components, for simplicity. Second, EELM, a currently proposed, powerful, effective and stable forecasting tool, is implemented to predict all components independently. Finally, these predicted results are aggregated into an ensemble result as final prediction, using simple addition ensemble method. For illustration and verification purposes, the proposed learning paradigm is used to predict the crude oil spot price of WTI. Empirical results demonstrate that the proposed novel ensemble learning paradigm statistically outperforms all considered benchmark models (including popular single models and similar ensemble models) in both prediction accuracy (in terms of level and directional measurement) and effectiveness (in terms of time saving and robustness), indicating that it is a promising tool to predict complicated time series with high volatility and irregularity.

Introduction

International crude oil prediction has become an increasingly hot issue within the research fields of energy analysis and economic management, which can effectively guarantee a rapid new production development with higher production quality and less production cost. First, due to its significant role in the global economy and society (Oman, 2003), an accurate prediction for crude oil market is extremely indispensable for stable and rapid economic development and thence new production development. In particular, a leap in crude oil price would result in an inflation and economy recession in oil-consuming nations, and further negatively impact global economy. In contrast, a fast falling of crude oil price would otherwise prohibit the economic development of oil-producing countries, and further generate political instability and social unrest (Gholamian et al., 2005, Chen and Hsu, 2012). Therefore, an accurate prediction for crude oil price can effectively help capture the market dynamics and make the corresponding policies for avoiding high volatility of crude oil price and thus reducing the market risk, which can further enable a stable macroeconomic environment for a rapid new production development. Second, as one of the most important energy inputs, an accurate prediction for crude oil price can effectively help make appropriate production plans for new products in terms of higher quality and less cost. In particular, a higher crude oil price may enhance the production cost with the same use of crude oil, and vice versa. Therefore, an accurate prediction for crude oil price can effectively help make and revise production plans of new products and techniques for determining various inputs, which can significantly enhance the quality and reduce the cost of new production development. However, it has been proved to be an extremely tough task of forecasting crude oil price, due to the interactive inner factors, such as supply and demand, competition across providers, substitution with other energy forms, technique development, domestic economy, deregulation activities, globalization and even uncertainties caused by political instabilities, wars and conflicts (Chen, 2009, He et al., 2012, Zhang et al., 2009). To address such tough task, this paper concentrates on crude oil price forecasting, in order to improve the prediction performance from both prediction accuracy and time saving perspectives.

According to existing literature, a variety of forecasting models have been formulated for international crude oil price prediction. Generally, there are two main categories for the crude oil price forecasting. The first category can be referred to traditional statistical and econometric techniques, such as linear regression (LinR), generalized auto regressive conditional heteroskedasticity (GARCH) family models, random walk (RW), grey model (GM) and error correction models (ECM). For example, a sophisticated econometric model was applied to predict crude oil price (Huntington, 1994). Lin (2009) predicted the international crude oil futures price via GM(1,1). Hou and Suardi (2012) implemented a nonparametric GARCH model to predict the return volatility in oil price. Mohammadi and Su (2010) proposed a novel hybrid model, coupling ARIMA and GARCH models, to estimate the conditional mean and volatility of weekly crude oil spot prices in eleven international markets. Similarly, Murat and Tokat (2009) employed RW to forecast oil price movements. Lanza et al. (2005) investigated the prices of crude oil and oil products by using ECM model. Besides, autoregressive integrated moving average (ARIMA) model, the most typical traditional time series model, has also frequently been applied as benchmark in crude oil price forecasting (e.g., He et al., 2012, Yu et al., 2008, Li et al., 2013).

However, these above traditional econometric techniques may be insufficient to capture the hidden nonlinear features in crude oil price (Bao et al., 2007, Yu et al., 2007), and there is a need to find a new approach to remedy the shortcomings of the traditional methods. In the previous studies, artificial intelligence (AI) models with powerful self-learning capacities, such as artificial neural networks (ANNs), support vector machine (SVM) and other intelligent optimization algorithms, have become increasingly popular for crude oil price forecasting recently, and the empirical results demonstrated their superiority to traditional methods. For ANN, Abdullah and Zeng (2010) introduced ANN to analyze the quantitative data of crude oil price. Kulkarni and Haidar (2009) presented a multilayer feed-forward neural network (FNN) to predict crude oil spot price. Kaboudan (2001) employed genetic programming (GP) and ANN to forecast crude oil price. As far as SVM, Xie et al. (2006) implemented SVM model for crude oil price forecasting and compared its prediction performance with ARIMA and back-propagation neural network (BPNN). Khashman and Nwulu (2011) employed SVM to predict crude oil price. Li and Ge (2013) improved ε-support vector regression (ε-SVR) machine with dynamic errors correction for crude oil price forecasting. All these studies demonstrated that the AI models are quite superior to the statistical-based models in modeling the nonlinear and complicated data of crude oil price.

Though the AI models (e.g., ANN and SVM) are very effective relative to traditional models, AI models also have their own shortcomings. For example, the time wasting, slow convergence and local minima may be the most important disadvantages, especially in ANN. In order to overcome these drawbacks, a novel learning algorithm called extreme learning machine (ELM), a special case of single hidden layer feedforward networks (SLFNs) proposed by Huang et al. (2004), tends to provide a better generalization performance and much faster learning speed than the above gradient learning algorithms, without setting stopping criteria, learning rate and learning epochs.

According to existing literature, there was few research about crude oil price forecasting by using ELM, although ELM has wildly been implemented in other forecasting cases, such as medium-term sales in fashion retail supply chains (Wong and Guo, 2010, Sun et al., 2008), electricity prices (Shrivastava and Panigrahi, 2014, Tian and Meng, 2010) and other applications (Wang and Han, 2015), and the empirical results all witnessed that ELM significantly outperformed its counterparts (e.g., ARIMA, SVR and ANN models) in both level and directional forecasting (Liu et al., 2012, Pati et al., 2013). Since ELM might be somewhat unstable with randomicity (Sun et al., 2008, Rong et al., 2008, Singh and Balasundaram, 2007, Miche et al., 2010), an extended ELM (EELM) method was accordingly proposed (Sun et al., 2007), where a given number of ELM models are run and the average value of the prediction results is calculated as the final result. Since EELM might be more stable and accurate than its original form, EELM model is especially introduced here as a very promising approach for forecasting international crude oil price.

Besides, a “decomposition and ensemble” principle can be also considered as a helpful tool for analyzing the data with high complexity and irregularity (Yu et al., 2008, Wang et al., 2005). Actually, the effectiveness of “decomposition and ensemble” has been already confirmed, and a series of decomposition-and-ensemble learning paradigms have been accordingly proposed. For instance, Yu et al. (2008) proposed a novel empirical mode decomposition (EMD) based neural network ensemble learning paradigm to predict the crude oil price. Tang et al. (2011) selected ensemble EMD (EEMD) and least squares support vector regression (LSSVR) respectively as decomposition and forecasting tools, to formulate a EEMD-based LSSVR learning paradigm for forecasting nuclear power consumption. Wang et al. (2014) integrated the EMD and Elman neural network to predict the wind speed. Lu and Shao (2012) put forward an ensemble approach integrating EEMD and ELM for forecasting computer products sales. Wang et al. (2011) proposed a seasonal decomposition (SD) based LSSVR learning approach for hydropower consumption forecasting. Tang et al. (2015) built a novel decomposition ensemble model by coupling the complementary EEMD and EELM, for crude oil price forecasting. Yu et al. (2014) constructed a similar methodology based on compressed sensing (CS) as data decomposition technique and some powerful AI forecasting tools, for crude oil price forecasting. All empirical results statistically verified that the methodology framework of “decomposition and ensemble” can significantly improve prediction performance. Therefore, this study tends to conduct the prediction research for international crude oil price under such effective “decomposition-and-ensemble” model framework.

Generally speaking, based on the “decomposition and ensemble” principle, this study tries to propose a novel “decomposition-and-ensemble” learning paradigm integrating EEMD and EELM, i.e., EEMD-based EELM ensemble learning paradigm, to forecast the international crude oil price. In this proposed methodology, the original data of crude oil price time series are first divided into several relatively independent intrinsic mode functions (IMFs) and one residue by EEMD, an efficient decomposition method relative to other decomposition methods (e.g., EMD and wavelet decomposition). Second, EELM, a fast and powerful forecasting tool relative to traditional statistical techniques and other AI models (e.g., ANN and SVM techniques), is applied to predict the different IMFs and residue independently. Finally, these predicted values are fused into an ensemble result as the final prediction by simple addition (ADD) ensemble method, since the sum of real values of the decomposed components is actually equal to the original data. The main contribution of the paper is to introduce the current powerful AI technique of EELM in the decomposition-and-ensemble method formulation. Different from other existing decomposition-and-ensemble models, this novel method especially utilizes the currently proposed EELM technique as the individual forecasting tool, with its unique merits of powerful prediction capability, time-saving training process and model robustness.

The main motivation of this study is to formulate a novel EEMD-based EELM ensemble learning paradigm to improve the performance of international crude oil price prediction, in terms of prediction accuracy, time saving, and robustness, and to compare its prediction performance with other popularly used forecasting techniques (including typical single models and similar ensemble models). The rest of this study is organized as follows. Section 2 describes the formulation process of the proposed EEMD-based EELM ensemble learning paradigm in detail. For illustration and verification purposes, crude oil spot price of West Texas Intermediate (WTI) is used to test the effectiveness of the proposed methodology in multi-step-ahead predictions, as the experiment study designed in Section 3. The corresponding results and effectiveness of the proposed method are discussed in Section 4. Finally, some concluding remarks and future researches are drawn in Section 5.

Section snippets

Methodology formulation

Generally speaking, there are three main steps involved in the proposed decomposition-and-ensemble methodology, i.e., decomposition, individual forecast and ensemble forecast. In this section, the overall formulation process of the EEMD-based EELM ensemble learning paradigm is presented. First, EEMD and EELM algorithms are briefly introduced in 2.1 Ensemble empirical mode decomposition (EEMD), 2.2 Extended extreme learning machine (EELM), respectively. Then, the EEMD-based EELM ensemble

Experimental design

In order to verify the effectiveness of the proposed decomposition-and-ensemble learning paradigm, the crude oil spot price of West Texas Intermediate (WTI) is selected as experimental sample, as mentioned in Section 3.1. Section 3.2 gives the main evaluation criteria for prediction capability.

Experimental results

The detailed steps of experimental study, together with the parameter specification, are first given in Section 4.1, and the corresponding results are further discussed in Section 4.2.

Conclusions

Due to the intrinsic complexity of crude oil price data in terms of its interactive involving factors, a novel decomposition-and-ensemble learning paradigm, integrating ensemble empirical mode decomposition (EEMD) and extended extreme learning machine (EELM), is proposed for crude oil price forecasting, based on the principle of “decomposition and ensemble”. The empirical study shows that the proposed ensemble learning paradigm can significantly improve prediction performance and statistically

Acknowledgments

This work is partially supported by grants from the National Science Fund for Distinguished Young Scholars (NSFC no. 71025005), the National Natural Science Foundation of China (NSFC no. 91224001 and NSFC no. 71301006), the National Program for Support of Top-Notch Young Professionals and the Fundamental Research Funds for the Central Universities in BUCT.

References (59)

S.S. Chen et al.
Reverse globalization: does high oil price volatility discourage international trade?
Energy Econ.
(2012)
M.R. Gholamian et al.
A hybrid systematic design for multiobjective market problems: a case study in crude oil markets
Eng. Appl. Artif. Intell.
(2005)
K. He et al.
Crude oil price analysis and forecasting using wavelet decomposed ensemble model
Energy
(2012)
A. Hou et al.
A nonparametric GARCH model of crude oil price return volatility
Energy Econ.
(2012)
G.B. Huang et al.
Extreme learning machine: theory and applications
Neurocomputing
(2006)
A. Lanza et al.
Modeling and forecasting cointegrated relationships among heavy oil and product prices
Energy Econ.
(2005)
Z. Li et al.
An information diffusion-based model of oil futures price
Energy Econ.
(2013)
H. Liu et al.
Forecasting models for wind speed using wavelet, wavelet packet, time series and Artificial Neural Networks
Appl. Energy
(2013)
X. Liu et al.
A comparative analysis of support vector machines and extreme learning machines
Neural Netw.
(2012)
H. Mohammadi et al.
International evidence on crude oil price dynamics: applications of ARIMA-GARCH models
Energy Econ.
(2010)

A. Murat et al.

Forecasting oil price movements with crack spread futures

Energy Econ.

(2009)

T. Quan et al.

Weighted least squares support vector machine local region method for nonlinear time series prediction

Appl. Soft Comput.

(2010)

H.J. Rong et al.

A fast pruned-extreme learning machine for classification problem

Neurocomputing

(2008)

N.A. Shrivastava et al.

A hybrid wavelet-ELM based short term price forecasting for electricity markets

Int. J. Electr. Power Energy Syst.

(2014)

Z.L. Sun et al.

Sales forecasting using extreme learning machine with applications in fashion retailing

Decis. Support Syst.

(2008)

L. Tang et al.

A novel data-characteristic-driven modeling methodology for nuclear energy consumption forecasting

Appl. Energy

(2014)

L. Tang et al.

A novel hybrid ensemble learning paradigm for nuclear energy consumption forecasting

Appl. Energy

(2012)

J. Wang et al.

Forecasting wind speed using empirical mode decomposition and Elman neural network

Appl. Soft Comput.

(2014)

S. Wang et al.

A novel seasonal decomposition based least squares support vector regression ensemble learning approach for hydropower consumption forecasting in China

Energy

(2011)

X. Wang et al.

Improved extreme learning machine for multivariate time series online sequential prediction

Eng. Appl. Artif. Intell.

(2015)

W.K. Wong et al.

A hybrid intelligent model for medium-term sales forecasting in fashion retail supply chains using extreme learning machine and harmony search algorithm

Int. J. Prod. Econ.

(2010)

C.L. Wu et al.

Data-driven models for monthly streamflow time series prediction

Eng. Appl. Artif. Intell.

(2010)

G. Xie et al.

Hybrid approaches based on LSSVR model for container throughput forecasting: a comparative study

Appl. Soft Comput.

(2013)

L. Yu et al.

Forecasting crude oil price with an EMD-based neural network ensemble learning paradigm

Energy Econ.

(2008)

L. Yu et al.

A neural-network-based nonlinear metamodeling approach to financial time series forecasting

Appl. Soft Comput.

(2009)

L. Yu et al.

A compressed sensing based AI learning paradigm for crude oil price forecasting

Energy Econ.

(2014)

J. Zhang et al.

Performance enhancement of ensemble empirical mode decomposition

Mech. Syst. Signal Process.

(2010)

X. Zhang et al.

A new approach for crude oil price analysis based on Empirical Mode Decomposition

Energy Econ.

(2008)

X. Zhang et al.

Estimating the impact of extreme events on crude oil price: an EMD-based event analysis method

Energy Econ.

(2009)

Cited by (181)

A novel hybrid model for crude oil price forecasting based on MEEMD and Mix-KELM
2024, Expert Systems with Applications
It is of vital importance for governments, enterprises, and investors to forecast crude oil prices accurately, while this task is beset with difficulties and challenges due to the complex patterns in oil prices. This paper aims to propose a novel hybrid method to model and forecast the crude oil price by integrating median ensemble empirical mode decomposition (MEEMD) and mix-kernel extreme learning machine (Mix-KELM). Firstly, the emerging MEEMD is employed to decompose the crude oil price into several simple subseries. Secondly, a novel mix-kernel is developed for extreme learning machine (ELM) by combining the advantage of the local kernel (i.e., Radial Basis Function in learning ability) and global kernel (i.e., Sigmoid in generalization ability), with weights of the kernels optimized through genetic algorithm. Thirdly, the proposed Mix-KELM is applied to forecast the subseries of crude oil price, and the sub-forecasting results are integrated to generate the final results. The empirical results show that our proposed MEEMD-Mix-KELM model with different forecasting horizons significantly outperforms the benchmarks in terms of forecasting accuracy and robustness test. Taking one-step-ahead forecasting as an example, the proposed model exhibits the lowest prediction errors in terms of mean absolute error, symmetric mean absolute percentage error, and root mean squared error with values of 1.1767, 0.0135, and 1.5717, respectively.
A novel interval-based hybrid framework for crude oil price forecasting and trading
2024, Energy Economics
Existing research has demonstrated the effectiveness of hybrid models in improving the accuracy of crude oil forecasting compared to single models. However, these works usually focus on point-valued crude oil closing prices which may suffer from information loss. Instead, this paper proposes a novel interval-based framework based on the principle of “divide and conquer”. After deploying variational mode decomposition (VMD) on an original training series to decompose it into low- and high-frequency components, a newly proposed autoregressive conditional interval (ACI) model is applied to predict the interval-valued low-frequency component which is treated as an inseparable random set, while the interval-valued high-frequency component is predicted by interval long short-term memory (iLSTM) networks. Combination of the two parts yields the final interval-valued prediction. A trading strategy for interval-valued data is designed and executed on a daily basis. Compared to benchmark models and competing trading strategies, the proposed framework can generate superior forecasts and deliver enhanced trading performances. The analysis within this study indicates that the framework’s outstanding performance is robust to various forecasting horizons.
Decomposition-integration-based prediction study on the development trend of film industry
2023, Heliyon
Movies have the unique ability to both generate income and spread culture, thus predicting the direction of the film industry's growth has garnered a lot of interest. Given the volatility of the movie industry's entire box office revenue dataset and the peculiarities of tiny samples, this article incorporates the decomposition-integration notion to build the EEMD-PSO-LSSVM model movie box office prediction model. The historical box office data are first broken down into many components using the ensemble empirical modal decomposition technique, and then, distinct sequences are predicted using the least squares support vector machine prediction method with particle swarm optimization, and ultimately, the predictions for each sequence are combined. The experimental results demonstrate the effectiveness of the decomposition-integration technique in illustrating the data fluctuation characteristics of quarterly movie box office revenues. When compared to other models, the model proposed in this study has clear advantages in the problem of predicting the time series data of box office revenues that are non-linear, non-smooth, and non-large samples.
A new hybrid model for multi-step WTI futures price forecasting based on self-attention mechanism and spatial–temporal graph neural network
2023, Resources Policy
WTI futures prices are impacted by supply, demand and a variety of financial factors, including U.S. dollar exchange rates, interest rates, market sentiment and related market linkages. The frequent changes in these factors cause WTI futures prices to fluctuate dramatically and complicate the trading decisions of investors and the policy-making of governments; consequently, accurate forecasting of WTI futures prices has become a topic of intense interest in the field of energy research. To thoroughly investigate the impact of various factors on crude oil prices, this paper introduces the self-attention mechanism and the spatial–temporal graph neural network Graph WaveNet (GWNet) to predict crude oil prices. The self-attention mechanism is employed to learn time-varying interactions between variables to tackle a problem where the graph structure is unknown. The graph convolution and the dilated causal convolution in GWNet capture the spatial and temporal dependencies, respectively. The empirical findings demonstrate that the proposed Graph WaveNet with Self-Attention (GWNet-Attn) robustly and significantly outperforms all baseline models in various prediction horizons and that the dollar index (USDX), LIBOR, and VIX have surpassed supply and demand as the most influential predictors of WTI futures prices.
Recent advances and application of machine learning in food flavor prediction and regulation
2023, Trends in Food Science and Technology
Food flavor is a key factor affecting sensory quality. Predicting and regulating flavor can result in exceptional flavor characteristics and improve consumer preferences and food acceptability. Evaluating and regulating flavor through traditional experimental methods are time-consuming, labor-intensive, and cannot handle large amounts of data. Computational methods, such as machine learning (ML) techniques, can accurately and efficiently predict and regulate complex flavors and attract continuous attention.
This review presents the principles and advantages of commonly used ML methods, including support vector machine, decision tree, random forest, k-nearest neighbors, extreme learning machine, artificial neural networks, and deep learning, as well as their recent applications and prospects in the prediction and regulation of food flavors. Notably, the prediction of food flavor based on molecular structures, physical and chemical properties, and data obtained from electronic nose, electronic tongue, and gas chromatography-mass spectrometry were summarized. The regulation of food flavor by ML through metabolites and genes has also been reviewed.
Simultaneous combination of various ML methods could improve the prediction accuracy of flavor profiles, perception intensity, and sensory quality classification compared to a single model. Additionally, the data fusion of different techniques showed better flavor prediction performance than single data input. This review indicates that ML techniques are promising for predicting flavor formation mechanisms, dose effects of structure-flavor quality, and directing the bio/chemical synthesis of desirable flavor compounds to meet the consumer demand for healthy and delicious food.
A dynamic clustering ensemble learning approach for crude oil price forecasting
2023, Engineering Applications of Artificial Intelligence
Accurate oil price forecasts matter, yet the nonstationarity of oil prices makes forecasting a challenging task. In this study, we propose a dynamic ensemble forecasting method for nonstationary oil prices using clustering approaches. Specifically, clustering is embedded in the ensemble forecasting framework, whereby the given period of historical observations is automatically classified into several clusters according to the data characteristics. This classification provides a solid groundwork for dynamically evaluating individual forecasting models in a targeted manner. We then propose a clustering-based regular increasing monotone weight assignment strategy that removes the influence of outliers and assigns appropriate weights to each forecasting model, thereby balancing the competitiveness and robustness of the proposed ensemble model. We verify the competitiveness and robustness of the proposed model by using West TX Intermediate oil prices. Results show that the proposed model significantly outperforms benchmarks and state-of-the-art methods in terms of horizontal and directional accuracy and is thus competitive. The robustness of the proposed model is validated using scenarios involving parameter variation and data missing assumptions. In summary, we present a model with promising effectiveness in promoting prediction performance in forecasting oil prices.

View all citing articles on Scopus

View full text

A novel decomposition ensemble model with extended extreme learning machine for crude oil price forecasting

Highlights

Abstract

Introduction

Section snippets

Methodology formulation

Experimental design

Experimental results

Conclusions

Acknowledgments

Energy Econ.

Eng. Appl. Artif. Intell.

Energy

Energy Econ.

Neurocomputing

Energy Econ.

Energy Econ.

Appl. Energy

Neural Netw.

Energy Econ.

Energy Econ.

Appl. Soft Comput.

Neurocomputing

Int. J. Electr. Power Energy Syst.

Decis. Support Syst.

Appl. Energy

Appl. Energy

Appl. Soft Comput.

Energy

Eng. Appl. Artif. Intell.

Int. J. Prod. Econ.

Eng. Appl. Artif. Intell.

Appl. Soft Comput.

Energy Econ.

Appl. Soft Comput.

Energy Econ.

Mech. Syst. Signal Process.

Energy Econ.

Energy Econ.