Asian stock markets closing index forecast based on secondary decomposition, multi-factor analysis and attention-based LSTM model

doi:10.1016/j.engappai.2022.104908

Engineering Applications of Artificial Intelligence

Volume 113, August 2022, 104908

https://doi.org/10.1016/j.engappai.2022.104908 Get rights and content

Abstract

The analysis and prediction of stock markets in Asian is an important issue which can help to promote the integration and globalization of financial cooperation. However, owning to the non-stationary and complexity of the stock market fluctuation, it is challenging to predict the stock price accurately. Especially after the decomposition of the original series, how to solve the problem of pseudo information and filter the exogenous variables is often certain challenging. This paper presents a hybrid model based on secondary decomposition (SD), multi-factor analysis (MFA) and attention-based long short-term memory (ALSTM) to predict the stock market price trends of four major Asian countries. The original stock price series is preprocessed by two decomposition algorithms so as to capture further non-linear feature and better filter the noise. Multi-factor analysis is introduced as a supplement to the original data information. In the prediction stage, attention layer is added in long short-term memory model to increase the weights of effective information. Finally, four datasets about Asian stock markets and nine compared models were used to verify the performance of the proposed model. The empirical analysis results show that compared to the general long short-term memory, our proposed model can obtain higher 30% accuracy at least. The mean average percentage errors of the system were also the lowest among all models mentioned in this paper (0.612%, 0.903%, 0.606% and 0.402% respectively), which proves the effectiveness of the hybrid model.

Introduction

The stock markets, which are known as one of the most vital financial markets all over the world, have become the focus of many investors and professional researchers. Over the past three decades, with the continuous development of the economy, financial activities in Asia have increased day by day, and regional monetary and financial cooperation based on regional development has always been one of the important topics among Asia countries. From the Chiang Mai Initiative (CMI) signed in Thailand to the Chiang Mai Initiative Multilateralization (CMIM) agreement, the macroeconomic research office among Asean, China, Japan and South Korea has been established, which fully reflects the urgent need and bold practice of Asia to make up for the inadequacy of the international financial system.

The integration of Asian financial capital market is an important force to accelerate the financial cooperation in Asia. It is an irresistible trend that governments will vigorously promote the cooperation of Asian stock market after the proposal of new roadmap of Asian bond market. This strategy can promote the level of regional and global stock market integration in emerging and frontier Asian countries and expand the geographic reach of portfolio diversification strategies for more international investors participating (Mohti et al., 2019). Recently, the prediction of Asian stock markets has becoming a hot research field. Because using appropriate methods to forecast financial data can not only help investors and production enterprises to avoid risks and maximize profits, but also help the government sector to understand the operation status of national economy from the macro level and guide policymakers to formulate relevant industrial policies, thus promoting the steady and healthy economy development. In addition, because Asian stock markets are highly dynamic and fluctuating resulted from economic policies, investor sentiments, economic expectations and the political environment, it may be more realistic and practical to assume that the Asian stock market prices are nonlinear mixture data (Guegan, 2009).

In the early stage of time series research, most forecasting methods focus on traditional econometric models, which mainly includes autoregressive moving average model (ARMA) and generalized autoregressive conditional heteroscedasticity (GARCH) model (Jakubowski and Szewczak, 2021, Yang, 2018). Devi et al. (2013) predicted Nifty-Midcap by applying ARIMA model, and the results proved that ARIMA model had good flexibility in time series. Gu and Chen (2011) used five different GARCH models to predict the volatility of Shanghai composite index and Shenzhen composite index, and the GARCH (1,1) was found that it could describe the volatility of the stock index better. Although econometric models have made some achievements in the field of prediction, a large number of empirical studies also show that the linear assumption makes the models unable to comprehensively reflect the real distribution of data (Xu and Chen, 2010), which is difficult to obtain accurate prediction results on more complex dataset. Similarly, Liu and Morley (2009) also pointed out that only using the traditional econometric models for modeling would lead to very serious prediction deviation.

Apart from econometric models, artificial intelligence (AI) techniques are now widely applied in time series, which mainly includes support vector machine model (Richhariya and Tanveer, 2018, Atmaja and Akagi, 2021, Fan et al., 2016) and artificial neural network (ANN) (Yuan et al., 2021, Aras and Kocakoc, 2016, Rakesh et al., 2018). Traditional ANN has been proven to have more advantages in dealing with nonlinear data (Zhang, 2003). However, each coin has two sides. ANN is easy to fall into the problem of overfitting and local extremum, which greatly restricts its promotion (Chen and Hao, 2018). Additionally, stock price has the characteristic of autocorrelation, which means the current stock price will be affected by the historical data. Therefore, it is non-ignorable that changes in stock price do not satisfy the assumption of random walk but are well suitable for neural networks. If we only use current information to make prediction, the information carried by the early data will be lost, which makes it difficult to deal with complicated time series problems. To solve these problems, recurrent neural network (RNN) was proposed. RNN can keep memory of recent events well through fully connecting hidden neurons in networks and learning from internal feedback mechanism. But the fly in the ointment is that it also faces the problem of gradient disappearance. In order to capture both long-term and short-term information, long short-term memory (LSTM) model was established on the basis of RNN. The special gate structure is added in LSTM to filter more useful and important information from the training data, which makes it more suitable for time series problem. Ghosh et al. (2019) used the LSTM model to predict the future trend of the company’s stock price, and the final empirical results showed good accuracy. Baek and Kim (2018) used LSTM framework to predict S&P-500 index and KOSPI-200 index respectively, and the study found that LSTM could achieve better prediction performance than other comparison models. Nevertheless, the valid information cannot gain more weights to show its importance in LSTM, which needs to be further improved.

Combining AI techniques with data preprocessing methods can greatly improve the prediction accuracy, because relying solely on AI cannot consistently and stably output high-quality results in all cases. The complexity of the predictive model can be lowered by using the signal analysis methods which decompose the data and then model the sub-sequences respectively. The most typical signal decomposition method is empirical mode decomposition (EMD), which can not only transform non-stationary signal into stationary signal, but also have a good decomposition effect on nonlinear data. By decomposing the original signal into multiple intrinsic mode functions (IMFs) and trend items, the hidden information in the data can be mined well. Wang and Wang (2017) successfully applied EMD algorithm and stochastic time neural network to predict financial time series. However, there is an unavoidable problem in the application of EMD-modal aliasing. Generally speaking, it means that different IMFs contain similar features. In order to solve this, a non-recursive signal analysis method, variational mode decomposition (VMD), was created based on EMD. Bisoi et al. (2019) applied the most efficient VMD to decompose the original stock price series for seizing the vital information, which proved that it is a useful tool for feature selection. This paper also applied VMD algorithm as the first decomposition for the reason that it can integrate the decomposed signals into the real signals and separate similar frequencies. Additionally, it is of vital importance for researchers to determine the decomposition layers of VMD. Scholars need to stop the decomposition when the same features are encountered. Kullback–Leibler divergence (KLD) algorithm is adopted in this paper to find the optimal decomposition layers, and constitute an improved VMD (IVMD). Analogously, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) is another improved version of EMD and has been widely used by researchers. Cao et al. (2019) combined different decomposition methods (EMD and CEEMDAN) with LSTM to compare their effectiveness. They found that CEEMDAN can extract more effective information compared with EMD. By adding a special kind of white noise, CEEMDAN has been improved continuously, which can reduce the residual noise to nearly zero. Therefore, this paper applied this relatively progressive algorithm to make the second decomposition for the high frequency sub-sequences.

In previous studies, one-off decomposition methods were widely used in the fusion models to improve prediction performance. In their decomposition-ensemble framework, the decomposition method was used to process the original data only once, and then the sub-sequences were modeled respectively. However, it should be noted that some of the decomposed sub-sequences may not be stable and even be pseudo-signals. In that case, the direct use of sub-sequences for modeling does not improve the accuracy, but reduces the efficiency of calculation. Compared with the single model, the hybrid model has greater superiority by integrating the advantages of multiple methods (Zhang et al., 2019a, Zhang et al., 2019b). Therefore, this paper applied with secondary decomposition (SD) method including improved VMD (IVMD) and improved CEEMDAN (ICEEMDAN) to solve the problem that decomposed modes still have pseudo-signals.

The role of exogenous variables is as important as data preprocessing part in time series forecasting field. For example, technical indicators such as moving average convergence divergence (MACD) and stochastics oscillator (KDJ) can act as exogenous variables in the stock price forecasting. Some studies have shown that technical indicators can not only effectively describe stock market changes, but also provide effective information for the establishment of model, which is also named multivariate analysis (Panigrahi and Behera, 2017, Zbikowski, 2015). It includes two stages, dimensionality reduction and feature selection. In the former stage, high dimensional data will be compressed and reconstructed into lower dimensional data since high dimensional data will lower the generalization ability, increase the amount of calculations, and reduce the final prediction performance. The Principal component analysis method (PCA) is used as the most classic and famous dimensionality reduction algorithm (Zhang et al., 2018). Wang and Wang (2015) combined PCA and neural network (NN) to predict financial time series and found that the introduction of PCA can effectually improve the accuracy of a single model. However, PCA can only achieve good results on linear data. In order to avoid this problem, this study uses auto encoder (AE) machine to compress high dimensional data. Chen et al. (2018) applied the combination of a deep-learning model, AE and a restricted Boltzmann machine to predict stock index price, whose results indicated that the novel hybrid model is a good choice for the index price prediction. The power of the AE is that it can reduce the dimensionality of non-linear data and guarantee the integrity of the information to a great extent. In the later stage, the most influential variables will be selected as the input from all collected variables. Feature selection is also a kind of data compression method as the same as dimensionality reduction. It can filter irrelevant variables in order to save computing time and improve the model’s efficiency. In this study, F test and mutual information (MI) are used to screen the relevant variables.

On the basis of the above analysis, this research comes up with a decomposition ensemble model for stock market price prediction, which combines the advantages of SD, multi-factor analysis (MFA) and attention-based LSTM (ALSTM) model. Firstly, the closing price in stock market is decomposed to corresponding sub-series by the IVMD algorithm. The high frequency components obtained from IVMD are inputted to secondary decomposition which adopts ICEEMDAN as the tool. For the further improvement, multiple exogenous variables are selected as influencing factors, and AE is used to compress these factors after screening out significant influencing factors. In the ensemble phase, the ALSTM model is utilized to predict the closing price data.

The innovations and contributions of this study are as follows:

(a) In the previous researches, the signal decomposition method is often used only once in the fusion models. Even though it can improve the prediction performance, its potential has not been fully tapped yet. The decomposed sub-sequences still have noise and unclear signals. In order to make up for this defect, this study establishes a model based on the secondary decomposition, which can reduce the influence of pseudo components on the prediction results and extract more effective information from the data.

(b) The parameters of VMD have rarely been determined scientifically before. This paper applied with KLD algorithm to improve VMD, which can determine the optimal number of sub-sequences’ layers according to the calculated value so as to improve the ability of feature extraction.

(c) The previous model only used closing price data for modeling, ignoring other relevant variables. In order to cover up this gap, this study introduces multiple related variables to build the model based on the closing price, and uses the multivariate analysis technologies including dimensionality reduction and variables selection to improve the prediction performance of the hybrid model.

(d) The proposed model uses the AE as the dimensionality reduction method instead of PCA because stock price has the nonlinear and nonstationary characteristics which are not suitable for PCA. Conversely, AE is a type of neural networks and it can adapt nonlinear data well. Additionally, the methods of feature selection applied with the combination of F test and MI to select the most correlated variables with stock market from underlying factors.

(e) The attention mechanism is introduced into the LSTM model in order to improve the prediction performance of the proposed model. The importance of input data is fully utilized to determine the weight assignment so as to improve the prediction accuracy of the model.

(f) The novel hybrid model is firstly proposed and achieves superior performance.

The remainder of this paper is organized as follows. Section 2 describes the basic methods used in this study. Section 3 presents the framework of the proposed hybrid model. Sector 4 describes the experimental process and empirical results in detail, and Section 5 summarizes the conclusion.

Section snippets

Related methodology

A novel prediction model is established in this study. The proposed framework integrates the advantages of SD, MFA and ALSTM model to improve the prediction accuracy of the stock market price. This section introduced the basic principle of each method employed in the proposed model.

Proposed model

As mentioned in the introduction, this study puts forward a novel stock market price prediction model, and takes four Asian markets as examples to demonstrate the effectiveness of the model. This section describes the modeling process of the novel model in detail. Fig. 1 displays the flow chart of the proposed model.

(1)
The first step is to decompose the closing stock price by VMD. The important parameter in this decomposition method is the number of decomposed modes which is always set manually.

Data description

In this study, stock price data from four Asian markets are used to demonstrate the validity of the novel model, including Shanghai Stock Exchange Composite Index (SSE), Tokyo NIKKEI Index (Japan), Seoul KOSPI Index (Korea) and SET Index (Thailand). The reason for selecting these four sets of data is that these four countries include both frontier countries and emerging countries in Asia, which are of certain explanatory significance. Moreover, the stocks selected are also relatively mature in

Conclusions

For improving the prediction accuracy of stock price in Asia, this research proposes a hybrid model based on secondary decomposition, multi-factor analysis and attention-based LSTM, and improves the final performance of the hybrid model by combining the advantages of each model. According to the empirical results, the main conclusions are as follows: (1) The secondary decomposition method has better performance than single decomposition with regard to extracting the hidden nonlinear

CRediT authorship contribution statement

Jujie Wang: Conceived of the presented idea, Developed the theory and performed the computations, Discussed the results and wrote the paper. Quan Cui: Conceived of the presented idea, Developed the theory and performed the computations, Discussed the results and wrote the paper. Xin Sun: Conceived of the presented idea, Developed the theory and performed the computations, Discussed the results and wrote the paper. Maolin He: Contributed to the revision of the manuscript.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This research was supported by the National Natural Science Foundation of China (Grant No. 71971122 and 71501101) and Postgraduate research and innovation plan project of Jiangsu Province, China (Grant No. SJCX20_0283).

References (48)

AltuveM. et al.
Fundamental heart sounds analysis using improved complete ensemble EMD with adaptive noise
Biocybern. Biomed. Eng.
(2020)
ArasS. et al.
A new model selection strategy in time series forecasting with artificial neural network: IHTS
Neurocomputing
(2016)
AtmajaB.T. et al.
Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM
Speech Commun.
(2021)
BaekY. et al.
ModAugNet: A new forecasting framework for stock market index value with an overfitting prevention LSTM module and a prediction LSTM module
Expert Syst. Appl.
(2018)
BisoiR. et al.
Hybrid variational mode decomposition and evolutionary robust kernel extreme learning machine for stock price and movement prediction on daily basis
Appl. Soft Comput.
(2019)
CaoJ. et al.
Financial time series forecasting model based on CEEMDAN and LSTM
Physica A
(2019)
ChangY.S. et al.
An LSTM-based aggregated model for air pollution forecasting
Atmos. Pollut. Res.
(2020)
ChenY.J. et al.
Integrating principle component analysis and weighted support vector machine for stock trading signals prediction
Neurocomputing
(2018)
DingY.K. et al.
Interpretable spatio-temporal attention LSTM model for flood forcasting
Neurocomputing
(2020)
FanL.W. et al.
An ICA-based support vector regressor scheme for forecasting crude oil prices
Technol. Forecast. Soc. Change
(2016)

FanJ.N. et al.

Robust deep auto-encoding Gaussian process regression for unsupervised anomaly detection

Neurocomputing

(2020)

GueganD.

Chaos in economics and finance

Annu. Rev. Control

(2009)

JakubowskiA. et al.

Truncated moments of perpetuities and a new central limit theorem for GARCH processes without Kesten’s regularity

Stochastic Process. Appl.

(2021)

LinY.J. et al.

Attribute reduction for multi-label learning with fuzzy rough set

Knowl.-Based Syst.

(2018)

MohtiW. et al.

Regional and global integration of Asian stock markets

Res. Int. Bus. Finance

(2019)

PanigrahiS. et al.

A hybrid ETS-ANN model for time series forecasting

Eng. Appl. Artif. Int.

(2017)

PatelJ. et al.

Predicting stock and stock price index movement using trend deterministic data preparation and machine learning techniques

Expert Syst. Appl.

(2015)

PatelJ. et al.

Predicting stock market index using fusion of machine learning techniques

Expert Syst. Appl.

(2015)

RezaeiH. et al.

Stock price prediction using deep learning and frequency decomposition

Expert Syst. Appl.

(2021)

RichhariyaB. et al.

EEG signal classification using universum support vector machine

Expert Syst. Appl.

(2018)

WangJ.J. et al.

Multi-step ahead wind speed prediction based on optimal feature extraction, long short term memory neural network and error

Appl. Energy

(2018)

WangJ. et al.

Forecasting stock market indexes using principle component analysis and stochastic neural networks

Neurocomputing

(2015)

WangJ. et al.

Forecasting stochastic neural network based on financial empirical mode decomposition

Neural Netw.

(2017)

XuJ. et al.

Disentangling the drivers of carbon prices in China’s ETS pilots – An EEMD approach

Technol. Forecast. Soc. Change

(2019)

Cited by (38)

A hierarchical deep model integrating economic facts for stock movement prediction
2024, Engineering Applications of Artificial Intelligence
Accurate stock movement prediction is essential to profit from the stock market. However, this task is challenging due to the complexity and non-stationary nature of the market. Deep learning methods have obtained more attention and success in mining price movement patterns.However, some limitations affect their performances. In general, the stock market is ever-changing, and many factors affect stock movement, so capturing the stock movement patterns is hard without enough prior information. To tackle it, we consider employing economic facts to help improve the deep learning method. In this paper, we propose a novel Hierarchical Deep learning Model that fuses Economic Facts (HDMEF) to predict stock movement from the micro to the macro tiers: the individual, industry, and whole market tiers. Specifically, we present three well-designed modules to separately model them based on the Capital Asset Pricing Model (CAPM), the herding effects, and the holiday effects in the stock market. Experiments on the A-share CSI300 and CSI500 indexes demonstrate that our proposed method performs best on all test phases compared with previous competitive baselines, even an absolute improvement of 2%–3% on some test phases where all the baselines act poor, proving our method is more efficient and robust in different market conditions. In addition, we do an ablation study to analyze the role of various economic effects used in our model, and the results prove that each module is helpful for prediction.
A novel framework for high resolution air quality index prediction with interpretable artificial intelligence and uncertainties estimation
2024, Journal of Environmental Management
Accurate air quality index (AQI) prediction is essential in environmental monitoring and management. Given that previous studies neglect the importance of uncertainty estimation and the necessity of constraining the output during prediction, we proposed a new hybrid model, namely TMSSICX, to forecast the AQI of multiple cities. Firstly, time-varying filtered based empirical mode decomposition (TVFEMD) was adopted to decompose the AQI sequence into multiple internal mode functions (IMF) components. Secondly, multi-scale fuzzy entropy (MFE) was applied to evaluate the complexity of each IMF component and clustered them into high and low-frequency portions. In addition, the high-frequency portion was secondarily decomposed by successive variational mode decomposition (SVMD) to reduce volatility. Then, six air pollutant concentrations, namely CO, SO₂, PM_2.5, PM₁₀, O₃, and NO₂, were used as inputs. The secondary decomposition and preliminary portion were employed as the outputs for the bidirectional long short-term memory network optimized by the snake optimization algorithm (SOABiLSTM) and improved Catboost (ICatboost), respectively. Furthermore, extreme gradient boosting (XGBoost) was applied to ensemble each predicted sub-model to acquire the consequence. Ultimately, we introduced adaptive kernel density estimation (AKDE) for interval estimation. The empirical outcome indicated the TMSSICX model achieved the best performance among the other 23 models across all datasets. Moreover, implementing the XGBoost to ensemble each predicted sub-model led to an 8.73%, 8.94%, and 0.19% reduction in RMSE, compared to SVM. Additionally, by utilizing SHapley Additive exPlanations (SHAP) to assess the impact of the six pollutant concentrations on AQI, the results reveal that PM_2.5 and PM₁₀ had the most notable positive effects on the long-term trend of AQI. We hope this model can provide guidance for air quality management.
Highly fluctuating short-term load forecasting based on improved secondary decomposition and optimized VMD
2024, Sustainable Energy, Grids and Networks
Short Term Load Forecasting (STLF) is a critical task in the power sector, enabling efficient resource allocation and grid management. However, the volatile and complex nature of short-term load series pose significant challenges to forecasting models. Traditional decomposition-prediction models are bottlenecked in that they often lack complexity-based clustering for efficiency and optimization of decomposition for optimal secondary decomposition. In this paper, we summarize the framework of the decomposition-prediction models, and propose the hybrid model to address these limitations. We propose a Sample Entropy-based hierarchical clustering method to cluster components according to complexity and improve the efficiency of secondary decomposition. Additionally, we propose the center frequency method to efficiently optimize the K parameter of VMD, ultimately achieving the optimal decomposition. In summary, firstly, to help minimize the difficulty of prediction, the load series is decomposed twice using Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) and Optimized Variational Mode Decomposition (OVMD). Then, two separate Long Short-Term Memory (LSTM) frameworks are built to predict the components obtained from the two decompositions, thus leveraging the advantages of the previous basic framework. Finally, by superimposing the prediction results, we obtain the output of the proposed model. The Belgian power load dataset is divided into four groups by season for comparison experiments. The results reveal that our model outperforms the benchmark models, with the best average coefficient of determination and mean absolute error of 0.996 and 53.69. Additionally, the limitations of sample entropy in secondary decomposition were revealed through our findings. These insights emphasize the promising contribution that our study brings in enhancing the decomposition-prediction model.
A novel hybrid model for stock price forecasting integrating Encoder Forest and Informer
2023, Expert Systems with Applications
Stock forecasting plays a pivotal role in time series forecasting as it enables informed and effective investment decisions by minimizing risks. In this paper, a novel hybrid model for stock price forecasting is proposed to explore the impact of a decomposition-reconstruction method fused with machine learning models, aiming to enhance the predictive ability of the model. Following the decomposition-prediction-reconstruction principle, the hybrid model incorporates wavelet transform, integrating Encoder Forest (EF) with Informer. To mitigate the influence of long-time series noise on stock forecasting, the original data is decomposed into high-frequency signal components (CD) and low-frequency signal components (CA). The Informer and Encoder Forest are trained to predict the future CA and CD, respectively. The hybrid model is implemented for all stocks in three industries of the china stock market. Several models including MLP, RNN, LSTM, Informer, WT + RNN + DT, WT + LSTM + RF, EMD + Informer + EF, VMD + Informer + EF, CEEMD + Informer + EF, and CEEMDAN + Informer + EF are designed as compared methods to verify the superiority and advancement of the proposed technique. Evaluating the performance of individual models reveals that the prediction accuracy follows the ranking: MLP < RNN < LSTM < Informer. Comparing the results of the hybrid model with those of the individual models demonstrates that the hybrid model improves prediction accuracy. This comparison also indicates that the wavelet transform and tree models can enhance the accuracy of the model without altering the initial ranking of the prediction effect. It is worth noting that WT and EMD-like methods employ different data decomposition approaches, leading to diverse outcomes. Experimental results indicate that WT is better suited for a hybrid model that combines two distinct methods. All experimental results indicated that the proposed hybrid model has higher prediction accuracy, stronger generalization ability, and stronger practicability, which is more suitable for Stock forecasting problems.
Novel optimization approach for realized volatility forecast of stock price index based on deep reinforcement learning model
2023, Expert Systems with Applications
Accurately predicting volatility has always been the focus of government decision-making departments, financial regulators and academia. Therefore, it is very crucial to precisely predict the realized volatility (RV) of the stock price index. In this paper, we take the RV sequences of Shanghai Stock Exchange Composite Index (SSEC), Standard & Poor 500 index (SPX) and Financial Times Stock Exchange Index (FTSE) as the research objects, and propose a predictive model based on optimized variational mode decomposition (VMD), deep learning models including deep belief network (DBN), long short-term memory network (LSTM) and gated recurrent unit (GRU), and reinforcement learning Q-learning algorithm. Firstly, the original RV sequence is decomposed by using the VMD ideal parameters optimized by grey wolf optimizer (GWO) to obtain the intrinsic mode functions (IMFs). Then, DBN, LSTM and GRU are used to predict same IMF simultaneously. Finally, the optimal weights of the above three models are determined by the Q-learning algorithm to construct an integrated model, and the final results are obtained after accumulating the predicted values of each IMF. The predictive performance of the model was evaluated by four loss functions: the mean average error (MAE), mean squared error (MSE), heterogeneous mean average error (HMAE), heterogeneous mean squared error (HMSE) and modified Diebold and Mariano test (MDM). The experimental results show that the constructed GVMD-Q-DBN-LSTM-GRU method has better performance that the comparison model in both emerging and developed markets.
McVCsB: A new hybrid deep learning network for stock index prediction
2023, Expert Systems with Applications
Forecasting the stock composite index is a challenge on account of the abundant noise-induced high degree of non-linearity and non-stationarity. Numerous predictive models based on statistical and intelligent methods have been proposed but using a single model or method cannot afford noise reduction so as to achieve credible results in facilitating regulation and investment. This study presents a novel hybrid model of multi-channel VMD-CBAM-BiLSTM named McVCsB based on an improved decomposition-reconstruction predicting framework for stock index prediction. The variational modal decomposition algorithm (VMD) and convolutional block attention module (CBAM) are adopted to achieve noise filtration and deep-level feature extraction. The denoised and concentrated feature will feed the predicting mechanism of bi-directional long-short-term networks (BiLSTM) to produce a stock index for the next day. The multi-channel input structure is innovated to resolve the inherent cumulative error problem of the decomposition-reconstruction framework by changing the parallel processing manner. We design three levels of model comparison by selecting and constructing a variety of benchmark models. In both statistical and practical assessments, the proposed model outperforms others with high reliability and robustness. According to the empirical results on four stock composite indexes, the McVCsB gives the best performance on China’s SCI out-of-sample prediction with the mean absolute error of 0.0016, root mean square error of 0.0019, symmetric mean absolute percentage error of 0.3862, R $^{2}$ of 0.9989 and Sharpe ratio of 0.6648. Thus, this new proposed model provides an effective predicting tool and can be extensively applied to stock markets in a more widen range.

View all citing articles on Scopus

View full text

Asian stock markets closing index forecast based on secondary decomposition, multi-factor analysis and attention-based LSTM model

Abstract

Introduction

Section snippets

Related methodology

Proposed model

Data description

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Biocybern. Biomed. Eng.

Neurocomputing

Speech Commun.

Expert Syst. Appl.

Appl. Soft Comput.

Physica A

Atmos. Pollut. Res.

Neurocomputing

Neurocomputing

Technol. Forecast. Soc. Change

Neurocomputing

Annu. Rev. Control

Stochastic Process. Appl.

Knowl.-Based Syst.

Res. Int. Bus. Finance

Eng. Appl. Artif. Int.

Expert Syst. Appl.

Expert Syst. Appl.

Expert Syst. Appl.

Expert Syst. Appl.

Appl. Energy

Neurocomputing

Neural Netw.

Technol. Forecast. Soc. Change