AdaBoost-LSTM Ensemble Learning for Financial Time Series Forecasting

Sun, Shaolong; Wei, Yunjie; Wang, Shouyang

doi:10.1007/978-3-319-93713-7_55

Shaolong Sun^20,21,
Yunjie Wei^20,22 &
Shouyang Wang^20,21,22

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10862))

Included in the following conference series:

International Conference on Computational Science

5292 Accesses
25 Citations
4 Altmetric

Abstract

A hybrid ensemble learning approach is proposed to forecast financial time series combining AdaBoost algorithm and Long Short-Term Memory (LSTM) network. Firstly, by using AdaBoost algorithm the database is trained to get the training samples. Secondly, the LSTM is utilized to forecast each training sample separately. Thirdly, AdaBoost algorithm is used to integrate the forecasting results of all the LSTM predictors to generate the ensemble results. Two major daily exchange rate datasets and two stock market index datasets are selected for model evaluation and comparison. The empirical results demonstrate that the proposed AdaBoost-LSTM ensemble learning approach outperforms some other single forecasting models and ensemble learning approaches. This suggests that the AdaBoost-LSTM ensemble learning approach is a highly promising approach for financial time series data forecasting, especially for the time series data with nonlinearity and irregularity, such as exchange rates and stock indexes.

You have full access to this open access chapter, Download conference paper PDF

Improving Financial Time Series Prediction Through Output Classification by a Neural Network Ensemble

Enhanced Stock Market Prediction Using Hybrid LSTM Ensemble

A Review of Long Short-Term Memory Approach for Time Series Analysis and Forecasting

Keywords

1 Introduction

Financial markets are affected by many factors, such as economic conditions, political events, traders’ expectations and so on. Hence, financial time series forecasting is usually regarded as one of the most challenging tasks due to the nonlinearity and irregularity. How to forecast financial time series accurately is still an open question with respect to the economic and social organization of modern society. Many common econometric and statistical models have been applied to forecast financial time series, such as autoregressive integrated moving average (ARIMA) model [1], vector auto-regression (VAR) model [2] and error correction model (ECM) [3]. However, traditional models fail to capture the nonlinearity and complexity of financial time series which lead to poor forecasting accuracy. Hence, exploring more effective forecasting methods, which possess enough learning capacity, is really necessary for forecasting financial time series. Thus, nonlinear and more complex artificial intelligence methods are introduced to forecast financial time series, such as artificial neural networks (ANNs) [4, 5], support vector regression (SVR) [6] and deep learning method [7].

The forecasting accuracy of those nonlinear artificial intelligence methods are usually better than the common econometric and statistical models, while they also suffer from many problems, such as parameter optimization and overfitting. Hence, many hybrid forecasting approaches are proposed to get better forecasting performance [8,9,10,11,12,13]. So far, the decomposition ensemble learning approach has been widely used to forecast time series in many fields, such as financial time series forecasting [14, 15], crude oil price forecasting [16], nuclear energy consumption forecasting [17], PM2.5 concentration forecasting [18], etc. According to the existing literatures, ANNs are the most common used methods both in single model forecasting and hybrid model forecasting, which demonstrates that ANNs are really suitable for time series forecasting. If the advantages of different ANNs methods are combined, a better forecasting performance can be obtained. Long short-term memory (LSTM) neural network is a kind of deep neural networks, while it possesses similar properties of recurrent neural network (RNN). Therefore, LSTM is a better choice for financial time series forecasting. In addition, the above ensemble learning approach usually chooses AdaBoost to integrate different LSTM forecasters.

In this study, an AdaBoost-based LSTM ensemble learning approach is firstly proposed to forecast financial time series, combining AdaBoost ensemble algorithm and LSTM neural network. LSTM is considered as weak forecasters and AdaBoost is utilized as an ensemble tool. The rest of this paper is organized as follows: the proposed method is briefly introduced in Sect. 2. Section 3 gives the empirical results and Sect. 4 provides the conclusions.

2 AdaBoost-LSTM Ensemble Learning Approach

Suppose there is a time series, we would like to make the m-step ahead forecasting. It is noticing that the iterative forecasting strategy is implemented in this paper, which can be expressed as:

$$ \hat{x}_{t + m} = f(x_{t} , x_{t - 1} , \ldots , x_{t - (p - 1)} ) $$

(1)

where $ \hat{x} $ is the forecasting value, x_t is the actual value in period t, and p denotes the lag orders.

In this study, the AdaBoost algorithm is introduced to integrate a set of LSTM predictors. An AdaBoost-LSTM ensemble learning approach is proposed for financial time series forecasting, and the flowchart is illustrated in Fig. 1. The proposed AdaBoost-LSTM ensemble learning approach consists of three main steps:

(1)
The sampling weights $ \left\{ {D_{n}^{t} } \right\} $ of the training samples $ \left\{ {x_{t} } \right\}^{T}_{t = 1} $ are calculated as follows:

$$ D_{n}^{t} = \frac{1}{N}, n = (1, 2, \ldots ,N;t = 1, 2, \ldots ,T) $$

(2)

where N is the number of LSTM predictors, T is the number of training samples.

(2)
The LSTM predictor F_n is trained by the training samples which are sampled according to the sampling weights $ D_{n}^{t} $.
(3)
The foresting error $ \left\{ {e_{n}^{t} } \right\} $ and ensemble weights {W_n} of the LSTM predictor F_n are calculated as follows:

$$ e_{n}^{t} = \frac{{\left| {x_{i} - \hat{x}_{i} } \right|}}{{x_{i} }} , (n = 1, 2, \ldots , N;t = 1, 2, \ldots ,T) $$
(3)

$$ w_{n} = \frac{1}{2}{ \ln }\left( {\frac{{1 - \mathop \sum \nolimits_{t = 1}^{T} e_{n}^{t} }}{{\mathop \sum \nolimits_{t = 1}^{T} e_{n}^{t} }}} \right) $$
(4)
(4)
Update the sampling weights $ D^{t}_{n + 1} $ of the training samples $ \left\{ {x_{t} } \right\}_{t = 1}^{T} $ as follows:

$$ D_{n + 1}^{t} = \frac{{D_{n}^{t} \beta_{n}^{t} }}{{\mathop \sum \nolimits_{t = 1}^{T} D_{n}^{t} \beta_{n}^{t} }} $$
(5)
where $ \beta_{n}^{t} \, = \,exp\left( {e_{n}^{t} } \right) $ is the update rate of training sample x_t.
(5)
Repeat the step 2–4 until all the LSTM predictors are obtained.
(6)
The final forecasting result is generated by integrating the forecasting results of all the LSTM predictors with ensemble weights.

3 Empirical Study

3.1 Data Description and Evaluation Criteria

The data in this research comprises of two typical stock indexes (S&P 500 index and Shanghai composite index (SHCI)) and two main exchange rates (the euro against the US dollars (EUR/USD) and the US dollars against the China yuan (USD/CNY)). The historical data are daily data, collected from the Wind Database (http://www.wind.com.cn/). The datasets are then divided into in-sample subsets and out-of-sample subsets, as illustrated in Table 1.

Table 1. In-sample and out-of-sample dataset of the stock indexes and exchange rates.

Full size table

Table 2 shows the descriptive statistics of those research data. The difference of statistics between the four series can be obviously seen from Table 2.

Table 2. The descriptive statistics of the stock indexes and exchange rates.

Full size table

In order to evaluate the forecasting performance of the proposed AdaBoost-LSTM ensemble learning approach, mean absolute percentage error (MAPE) and directional symmetry (DS) are employed to evaluate the level forecasting accuracy and directional forecasting accuracy, respectively. MAPE and DS are defined as follows:

$$ MAPE = \frac{1}{n}\sum\nolimits_{t = 1}^{n} {\left| {\frac{{Y_{t} - \hat{Y}_{t} }}{{Y_{t} }}} \right|*100\% } $$

(6)

$$ DS = \frac{1}{n - 1}\sum\nolimits_{i = 2}^{n} {d_{i} \times 100 {\% },\,d_{i} = \left\{ {\begin{array}{*{20}l} {1, (y_{i} - y_{i - 1} )(\hat{y}_{i} - y_{i - 1} ) \ge 0} \hfill \\ {0, {\text{otherwise}}} \hfill \\ \end{array} } \right.} $$

(7)

where $ \hat{y}_{i} $ is the forecasting value, y_i is the actual value, and n is the number of observation samples.

3.2 Forecasting Performance Comparison

The forecasting performances of the proposed AdaBoost-LSTM ensemble learning approach and benchmarks are discussed in this section. Tables 3, 4, 5 and 6 show the comparison results of MAPE and DS evaluation criteria, which show that the out-of-sample forecasting performance of the proposed approach is better than that of the benchmarks for all of the four financial time series and demonstrates that the proposed approach is an effective tool for financial time series forecasting.

Table 3. Forecasting performance of different models for stock indexes

Full size table

Table 4. Forecasting performance of different models for exchange rates series.

Full size table

Table 5. MAPE comparison with different ensemble forecasting approaches.

Full size table

Table 6. DS comparison with different ensemble forecasting approaches.

Full size table

As shown in Tables 3, 4, 5 and 6, the proposed approach significantly outperforms all of the benchmark models by means of level forecasting accuracy and directional forecasting accuracy for the stock indexes and exchange rates. Overall, the ensemble learning approaches outperform the single models, while individual LSTM, ELM, SVR and MLP models consistently outperform ARIMA models in terms of MAPE and DS. Moreover, the proposed AdaBoost-LSTM ensemble learning approach produces 19.44–22.33% better directional forecasts than ARIMA models, reaching up to an accuracy rate of 76.68% in out-of-sample directional forecasting for the EUR/USD.

In summary, some interesting findings can be summarized: (1) the proposed AdaBoost-LSTM outperforms all of the benchmark models in different forecasting horizons, which implies that the AdaBoost-LSTM ensemble learning approach is a powerful learning approach for financial time series forecasting in both level accuracy and directional accuracy; (2) it clearly shows that the hybrid ensemble approach with AdaBoost is much better than the one without ensemble by means of level accuracy and directional accuracy, which reveals that AdaBoost is a more effective ensemble algorithm; (3) the forecasting performance of hybrid ensemble learning approaches are significantly better than single models. The possible reason is that the ensemble can dramatically improve the forecasting performance of single models.

4 Conclusions

This paper proposes an AdaBoost-LSTM ensemble learning approach which employs AdaBoost algorithm to integrate the forecasting results of LSTM forecasts. Then, the proposed AdaBoost-LSTM ensemble learning approach is applied to forecast financial time series, including stock indexes and exchange rates. For model evaluation and model comparison, four typical financial time series data are collected to test the model performance. The empirical results show that the proposed AdaBoost-LSTM ensemble learning approach can significantly improve forecasting performance and outperform some other single forecasting models and some other ensemble learning approaches in terms of both level forecasting accuracy and directional forecasting accuracy, which demonstrates that the proposed approach is really a promising approach for financial time series forecasting. What’s more, the proposed approach can also be employed to solve other complex time series forecasting problems, such as crude oil price forecasting, wind speed forecasting, traffic flow forecasting, etc.

References

Chortareas, G., Jiang, Y., Nankervis, J.C.: Forecasting exchange rate volatility using high-frequency data: Is the euro different? Int. J. Forecast. 27(4), 1089–1107 (2011)
Article Google Scholar
Carriero, A., Kapetanios, G., Marcellino, M.: Forecasting exchange rates with a large Bayesian VAR. Int. J. Forecast. 25(2), 400–417 (2009)
Article Google Scholar
Moosa, I.A., Vaz, J.J.: Cointegration, error correction and exchange rate forecasting. J. Int. Financ. Markets Institutions Money 44, 21–34 (2016)
Article Google Scholar
Galeshchuk, S.: Neural networks performance in exchange rate prediction. Neurocomputing 172, 446–452 (2016)
Article Google Scholar
Zhang, G., Hu, M.Y.: Neural network forecasting of the British pound/US dollar exchange rate. Omega 26(4), 495–506 (1998)
Article Google Scholar
Huang, S., Chuang, P., Wu, C., Lai, H.: Chaos-based support vector regressions for exchange rate forecasting. Expert Syst. Appl. 37(12), 8590–8598 (2010)
Article Google Scholar
Shen, F., Chao, J., Zhao, J.: Forecasting exchange rate using deep belief networks and conjugate gradient method. Neurocomputing 167, 243–253 (2015)
Article Google Scholar
Chen, A., Leung, M.T.: Regression neural network for error correction in foreign exchange forecasting and trading. Comput. Oper. Res. 31(7), 1049–1068 (2004)
Article Google Scholar
Nag, A.K., Mitra, A.: Forecasting daily foreign exchange rates using genetically optimized neural networks. J. Forecast. 21(7), 501–511 (2002)
Article Google Scholar
Sermpinis, G., Stasinakis, C., Theofilatos, K., Karathanasopoulos, A.: Modeling, forecasting and trading the EUR exchange rates with hybrid rolling genetic algorithms—Support vector regression forecast combinations. Eur. J. Oper. Res. 247(3), 831–846 (2015)
Article MathSciNet Google Scholar
Sermpinis, G., Theofilatos, K., Karathanasopoulos, A., Georgopoulos, E.F., Dunis, C.: Forecasting foreign exchange rates with adaptive neural networks using radial-basis functions and particle swarm optimization. Eur. J. Oper. Res. 225(3), 528–540 (2013)
Article MathSciNet Google Scholar
Yu, L., Wang, S., Lai, K.K.: A novel nonlinear ensemble forecasting model incorporating GLAR and ANN for foreign exchange rates. Comput. Oper. Res. 32(10), 2523–2541 (2005)
Article Google Scholar
Yu, L., Wang, S., Lai, K.K.: Forecasting crude oil price with an EMD-based neural network ensemble learning paradigm. Energy Econ. 30(5), 2623–2635 (2008)
Article Google Scholar
Plakandaras, V., Papadimitriou, T., Gogas, P.: Forecasting daily and monthly exchange rates with machine learning techniques. J. Forecast. 34(7), 560–573 (2015)
Article MathSciNet Google Scholar
Yu, L., Wang, S., Lai, K.K.: A neural-network-based nonlinear metamodeling approach to financial time series forecasting. Appl. Soft Comput. 9(2), 563–574 (2009)
Article Google Scholar
Yu, L., Wang, Z., Tang, L.: A decomposition-ensemble model with data-characteristic-driven reconstruction for crude oil price forecasting. Appl. Energ. 156, 251–267 (2015)
Article Google Scholar
Tang, L., Yu, L., Wang, S., Li, J., Wang, S.: A novel hybrid ensemble learning paradigm for nuclear energy consumption forecasting. Appl. Energ. 93, 432–443 (2012)
Article Google Scholar
Niu, M., Wang, Y., Sun, S., Li, Y.: A novel hybrid decomposition-and-ensemble model based on CEEMD and GWO for short-term PM2.5 concentration forecasting. Atmos. Environ. 134, 168–180 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
Shaolong Sun, Yunjie Wei & Shouyang Wang
School of Economics and Management, University of Chinese Academy of Sciences, Beijing, 100190, China
Shaolong Sun & Shouyang Wang
Center for Forecasting Science, Chinese Academy of Sciences, Beijing, 100190, China
Yunjie Wei & Shouyang Wang

Authors

Shaolong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yunjie Wei
View author publications
You can also search for this author in PubMed Google Scholar
Shouyang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunjie Wei .

Editor information

Editors and Affiliations

Chinese Academy of Sciences, Beijing, China
Yong Shi
National Supercomputing Center in Wuxi, Wuxi, China
Haohuan Fu
Chinese Academy of Sciences, Beijing, China
Yingjie Tian
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Amsterdam, Amsterdam, The Netherlands
Michael Harold Lees
University of Tennessee, Knoxville, Tennessee, USA
Jack Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M. A. Sloot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, S., Wei, Y., Wang, S. (2018). AdaBoost-LSTM Ensemble Learning for Financial Time Series Forecasting. In: Shi, Y., et al. Computational Science – ICCS 2018. ICCS 2018. Lecture Notes in Computer Science(), vol 10862. Springer, Cham. https://doi.org/10.1007/978-3-319-93713-7_55

Download citation

DOI: https://doi.org/10.1007/978-3-319-93713-7_55
Published: 12 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93712-0
Online ISBN: 978-3-319-93713-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

AdaBoost-LSTM Ensemble Learning for Financial Time Series Forecasting

Abstract

Similar content being viewed by others

Improving Financial Time Series Prediction Through Output Classification by a Neural Network Ensemble

Enhanced Stock Market Prediction Using Hybrid LSTM Ensemble

A Review of Long Short-Term Memory Approach for Time Series Analysis and Forecasting

Keywords

1 Introduction

2 AdaBoost-LSTM Ensemble Learning Approach

3 Empirical Study

3.1 Data Description and Evaluation Criteria

3.2 Forecasting Performance Comparison

4 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

AdaBoost-LSTM Ensemble Learning for Financial Time Series Forecasting

Abstract

Similar content being viewed by others

Improving Financial Time Series Prediction Through Output Classification by a Neural Network Ensemble

Enhanced Stock Market Prediction Using Hybrid LSTM Ensemble

A Review of Long Short-Term Memory Approach for Time Series Analysis and Forecasting

Keywords

1 Introduction

2 AdaBoost-LSTM Ensemble Learning Approach

3 Empirical Study

3.1 Data Description and Evaluation Criteria

3.2 Forecasting Performance Comparison

4 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation