Skip to main content
Log in

A novel stock indices hybrid forecasting system based on features extraction and multi-objective optimizer

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

The stock index is a barometer of the market economy. However, there are few reliable methods to forecast the stock market accurately and steadily owing to the complexity and non-linearity of the stock market. In this paper, a novel hybrid forecasting system is established based on the singular spectrum analysis (SSA), a novel multi-features extraction strategy, and the extreme learning machine (ELM) optimized by the multi-objective grey wolf optimizer (MOGWO). The raw stock data is firstly denoised by SSA. To make full use of data information, multiple feature extractors are constructed to obtain different features of the denoised data. The ELM optimized by MOGWO learns different features to acquire more satisfying prediction accuracy. The performance of the developed system has been comprehensively evaluated through three global stock indices datasets. Experimental results reveal that the proposed system has a great advantage over both conventional baseline models and advanced models proposed in recent years.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Lin G, Lin A, Cao J (2021) Multidimensional KNN algorithm based on EEMD and complexity measures in financial time series forecasting. Expert Systems with Applications 168(15):114443

    Article  Google Scholar 

  2. Wang X, Yang K, Liu T (2021) Stock price prediction based on morphological similarity clustering and hierarchical temporal memory. IEEE Access 9:67241–67248

    Article  Google Scholar 

  3. Fama EF (1970) Efficient capital markets: a review of theory and empirical work. The Journal of Finance 25:383–417

    Article  Google Scholar 

  4. Faith C (2007) Way of the turtle: the secret methods that turned ordinary people into legendary traders. McGraw-Hill Companies, New York

    Google Scholar 

  5. Nison S (2001) Japanese Candlestick charting techniques: a contemporary guide to the ancient investment techniques of the far east. Prentice Hall Press, Englewood Cliffs

    Google Scholar 

  6. Rounaghia MM, Zadeh F (2016) Investigation of market efficiency and financial stability between S&P 500 and London Stock Exchange: monthly and yearly forecasting of time series stock returns using ARMA model. Physica A: Statistical Mechanics and its Applications 456:10–21

    Article  Google Scholar 

  7. Ramezanian R, Peymanfar A, Ebrahimi SB (2019) An integrated framework of genetic network programming and multi-layer perceptron neural network for prediction of daily stock return: An application in Tehran stock exchange market. Applied Soft Computing 82:105551

    Article  Google Scholar 

  8. Parmar K, Singh KS, Kumar J (2021) Soft computing model coupled with statistical models to estimate future of stock market. Neural Computing and Applications 33(13):7629–7647

    Article  Google Scholar 

  9. Challa ML, Malepati V, Kolusu S (2020) S&P BSE Sensex and S&P BSE IT return forecasting using ARIMA. Financial Innovation 6(1):793–811

    Article  Google Scholar 

  10. Amini P, Khashei M (2019) A soft intelligent allocation-based hybrid model for uncertain complex time series forecasting. Applied Soft Computing 84:105736

    Article  Google Scholar 

  11. Xing D, Li H, Li J et al (2021) Forecasting price of financial market crash via a new nonlinear potential GARCH model. Physica A: Statistical Mechanics and its Applications 566:125649

    Article  MathSciNet  MATH  Google Scholar 

  12. Wilms I, Rombouts J, Croux C (2021) Multivariate volatility forecasts for stock market indices. International Journal of Forecasting 37(2):484–499

    Article  Google Scholar 

  13. Degiannakis S, Filis G, Hassani H (2018) Forecasting global stock market implied volatility indices. Journal of Empirical Finance 46:111–129

    Article  Google Scholar 

  14. Norouzia N, Fani M (2020) Black gold falls, black plague arise - an Opec crude oil price forecast using a gray prediction model. Upstream Oil and Gas Technology 5:100015

    Article  Google Scholar 

  15. Tsai MC, Cheng CH, Tsai MI (2019) A multifactor fuzzy time-series fitting model for forecasting the stock index. Symmetry-Basel 11(12):1474

    Article  Google Scholar 

  16. Guo H, Pedrycz W, Liu X (2019) Fuzzy time series forecasting based on axiomatic fuzzy set theory. Neural Computing & Applications 31(8):3921–3932

    Article  Google Scholar 

  17. Zhang D, Lou S (2021) The application research of neural network and BP algorithm in stock price pattern classification and prediction. Future Generation Computer Systems 115:872–879

    Article  Google Scholar 

  18. Kumar BS, Ravi V, Miglani R (2021) Predicting Indian stock market using the Psycho-Linguistic features of financial news. Annals of Data Science 8:517–558

    Article  Google Scholar 

  19. Wang Y, Wang L, Yang F et al (2021) Advantages of direct input-to-output connections in neural networks: the Elman network for stock index forecasting. Information Sciences 547:1066–1079

    Article  MathSciNet  MATH  Google Scholar 

  20. Zhang L, Wang J, Wang B (2020) Energy market prediction with novel long short-term memory network: Case study of energy futures index volatility. Energy 211:118634

    Article  Google Scholar 

  21. Alzaeemi SAS, Sathasivam S (2021) Examining the forecasting movement of palm oil price using RBFNN-2SATRA metaheuristic algorithms for logic mining. IEEE Access 9:22542–22557

    Article  Google Scholar 

  22. Yang Y, Wang J (2021) Forecasting wavelet neural hybrid network with financial ensemble empirical mode decomposition and MCID evaluation. Expert Systems with Applications 166:114097

    Article  Google Scholar 

  23. Ribeiro GT, Santos AAP, Mariani VC et al (2021) Novel hybrid model based on echo state neural network applied to the prediction of stock price return volatility. Expert Systems With Applications 184:115490

    Article  Google Scholar 

  24. Yadav A, Jha CK, Sharan A (2020) Optimizing LSTM for time series prediction in Indian stock market. Procedia Computer Science 167:2091–2100

    Article  Google Scholar 

  25. Niu H, Xu K, Wang W (2020) A hybrid stock price index forecasting model based on variational mode decomposition and LSTM network. Applied Intelligence 50(12):4296–4309

    Article  Google Scholar 

  26. Yang Y, Fan C, Xiong H (2021) A novel general-purpose hybrid model for time series forecasting. Appl Intell https://doi.org/10.1007/s10489-021-02442-y

  27. Niu H, Xu K (2020) A hybrid model combining variational mode decomposition and an attention-GRU network for stock price index forecasting. Mathematical Biosciences and Engineering 17(6):7151–7166

    Article  MathSciNet  MATH  Google Scholar 

  28. Zhou F, Zhang Q, Sornette D et al (2019) Cascading logistic regression onto gradient boosted decision trees for forecasting and trading stock indices. Applied Soft Computing 84:105747

    Article  Google Scholar 

  29. Zhang J, Teng Y, Chen W (2019) Support vector regression with modified firefly algorithm for stock price forecasting. Applied Intelligence 49(5):1658–1674

    Article  Google Scholar 

  30. Xu Y, Yang C, Peng S et al (2020) A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning. Applied Intelligence 50(11):3852–3867

    Article  Google Scholar 

  31. Deng Y, Ren Z, Kong Y et al (2016) A hierarchical fused fuzzy deep neural network for data classification. IEEE Transactions on Fuzzy Systems 25(4):1006–1012

    Article  Google Scholar 

  32. Aziz MFA, Mostafa SA, Foozy CFM et al (2021) Integrating Elman recurrent neural network with particle swarm optimization algorithms for an improved hybrid training of multidisciplinary datasets. Expert Systems with Applications 183:115441

    Article  Google Scholar 

  33. Donoho DL (1995) De-noising by soft-thresholding. IEEE Transactions on Information Theory 41(3):613–627

    Article  MathSciNet  MATH  Google Scholar 

  34. Donoho DL, Johnstone IM (1995) Adapting to unknown smoothness via wavelet shrinkage. Journal of the American Statistical Association 90(432):1200–1224

    Article  MathSciNet  MATH  Google Scholar 

  35. Donoho DL, Johnstone IM (1998) Ideal spatial adaptation by wavelet shrinkage. Biometrika 81(3):425–455

    Article  MathSciNet  MATH  Google Scholar 

  36. Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Advances in Adaptive Data Analysis 1(1):1–41

    Article  Google Scholar 

  37. Yeh JR, Shieh JS, Huang NE (2010) Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method. Advances in Adaptive Data Analysis 2(2):135–156

    Article  MathSciNet  Google Scholar 

  38. Lin Y, Ling BW, Xu N et al (2020) Effectiveness analysis of bio-electronic stimulation therapy to Parkinsons diseases via joint singular spectrum analysis and discrete fourier transform approach. Biomedical Signal Processing and Control 62:102–131

    Article  Google Scholar 

  39. Kalantari M (2021) Forecasting COVID-19 pandemic using optimal singular spectrum analysis. Chaos Solitons & Fractals 142:110547

    Article  MathSciNet  MATH  Google Scholar 

  40. Jiang P, Liu Z, Niu X (2020) A combined forecasting system based on statistical method, artificial neural networks, and deep learning methods for short-term wind speed forecasting. Energy 217:119361

    Article  Google Scholar 

  41. Zhang X, Wang J, Gao Y (2019) A hybrid short-term electricity price forecasting framework: Cuckoo search-based feature selection with singular spectrum analysis and SVM. Energy Economics 81:899–913

    Article  Google Scholar 

  42. Eberhart R, Kennedy J (1995) A new optimizer using particle swarm theory. Proceedings of the 6th international symposium on micro machine and human science, pp 39-43

  43. Holland JH (1992) Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. MIT Press, Cambridge

    Book  Google Scholar 

  44. Gandomi AH, Yang H, Alavi AH (2013) Cuckoo search algorithm: a metaheuristic approach to solve structural optimization problems. Engineering with Computers 29:17–35

    Article  Google Scholar 

  45. Mirjalili S, Mirjalili SM, Lewis A (2014) Grey Wolf Optimizer. Advances in Engineering Software 69:46–61

    Article  Google Scholar 

  46. Muthiah-Nakarajan V, Noel MM (2016) Galactic swarm optimization: a new global optimization metaheuristic inspired by galactic motion. Applied Soft Computing 38:771–787

    Article  Google Scholar 

  47. Shayanfar H, Gharehchopogh FS (2018) Farmland fertility: a new metaheuristic algorithm for solving continuous optimization problems. Applied Soft Computing 71:728–746

    Article  Google Scholar 

  48. Dhiman G, Kumar V (2019) Seagull optimization algorithm: theory and its applications for large-scale industrial engineering problems. Knowledge-Based Systems 165:169–196

    Article  Google Scholar 

  49. Moosavi SHS, Bardsiri VK (2019) Poor and rich optimization algorithm: a new human-based and multi populations algorithm. Engineering Applications of Artificial Intelligence 86:165–181

    Article  Google Scholar 

  50. Shadravan S, Naji HR, Bardsiri VK (2019) The sailfish optimizer: a novel nature-inspired metaheuristic algorithm for solving constrained engineering optimization problems. Engineering Applications of Artificial Intelligence 80:20–34

    Article  Google Scholar 

  51. Segundo EHDV, Mariani VC, Coelho LDS (2019) Design of heat exchangers using falcon optimization algorithm. Applied Thermal Engineering 156:119–144

    Article  Google Scholar 

  52. Segundo EHDV, Mariani VC, Coelho LDS (2019) Metaheuristic inspired on owls behavior applied to heat exchangers design. Thermal Science and Engineering Progress 14:100431

    Article  Google Scholar 

  53. Yapici H, Cetinkaya N (2019) A new meta-heuristic optimizer: pathfinder algorithm. Applied Soft Computing 78:545–568

    Article  Google Scholar 

  54. Hayyolalam V, Kazem A (2020) Black widow optimization algorithm: a novel meta-heuristic approach for solving engineering optimization problems. Engineering Applications of Artificial Intelligence 87:103249

    Article  Google Scholar 

  55. Li S, Chen H, Wang M et al (2020) Slime mould algorithm: a newmethod for stochastic optimization. Future Generation Computer Systems 111(1):300–323

    Article  Google Scholar 

  56. Faramarzi A, Heidarinejad M, Mirjalili S et al (2020) Marine predators algorithm: a nature-inspired metaheuristic. Expert Systems with Applications 152:113377

    Article  Google Scholar 

  57. Saxena BK, Mishra S, Rao KVS (2021) Offshore wind speed forecasting at different heights by using ensemble empirical mode decomposition and deep learning models. Applied Ocean Research 117:102937

    Article  Google Scholar 

  58. Silva RGD, Ribeiro MHDM, Moreno SR et al (2021) A novel decomposition-ensemble learning framework for multi-step ahead wind energy forecasting. Energy 216:119174

    Article  Google Scholar 

  59. Ribeiro MHDM, Silva RGD, Moreno SR et al (2022) Efficient bootstrap stacking ensemble learning model applied to wind power generation forecasting. International Journal of Electrical Power & Energy Systems 136:107712

    Article  Google Scholar 

  60. Stefenon SF, Ribeiro MHDM, Nied A et al (2022) Time series forecasting using ensemble learning methods for emergency prevention in hydroelectric power plants with dam. Electric Power Systems Research 202:107584

    Article  Google Scholar 

  61. Yu L, Ma M (2021) A memory-trait-driven decomposition-reconstruction-ensemble learning paradigm for oil price forecasting. Applied Soft Computing 111:107699

    Article  Google Scholar 

  62. Li H, Jin F, Sun S et al (2021) A new secondary decomposition ensemble learning approach for carbon price forecasting. Knowledge-Based Systems 214:106686

    Article  Google Scholar 

  63. Efendt R, Ismall Z, Deris MM (2013) Improved weight fuzzy time series as used in the exchange rates forecasting of us dollar to ringgtt malaysia. International Journal of Computational Intelligence and Applications 12(1):1350005

    Article  Google Scholar 

  64. Zhang Y, Chu G, Shen D (2021) The role of investor attention in predicting stock prices: the long short-term memory networks perspective. Finance Research Letters 38:101484

    Article  Google Scholar 

  65. Vautard R, Ghil M (1989) Singular spectrum analysis in nonlinear dynamics, with applications to paleoclimatic time series. Physica D Nonlinear Phenomena 35(3):395–424

    Article  MathSciNet  MATH  Google Scholar 

  66. Vautard R, Yiou P, Ghil M (1992) Singular-spectrum analysis: a toolkit for short, noisy chaotic signals. Physica D 58(1–4):95–126

    Article  Google Scholar 

  67. Mirjalili S, Saremi S, Mirjalili SM et al (2016) Multi-objective grey wolf optimizer: a novel algorithm for multi-criterion optimization. Expert Systems with Applications 47:106–119

    Article  Google Scholar 

  68. Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501

    Article  Google Scholar 

  69. Huang GB, Chen L, Siew CK (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Transactions on Neural Networks 17(4):879–892

    Article  Google Scholar 

  70. Huang GB, Liang NY, Rong HJ et al (2005) On-line sequential extreme learning machine. In: The IASTED international conference on computational intelligence, Calgary

  71. Deb K, Pratap A, Agarwal S et al (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation 6(2):182–197

    Article  Google Scholar 

  72. Coello CAC, Pulido GT, Lechuga MS (2004) Handling multiple objectives with particle swarm optimization. IEEE Transactions on Evolutionary Computation 8(3):256–279

    Article  Google Scholar 

  73. Li X, Zhang W, Ding Q (2019) Cross-domain fault diagnosis of rolling element bearings using deep generative neural networks. IEEE Transactions on Industrial Electronics 66(7):5525–5534

    Article  Google Scholar 

  74. Hu C, He S, Wang Y (2020) A classification method to detect faults in a rotating machinery based on kernelled support tensor machine and multilinear principal component analysis. Applied Intelligence 51(4):2609–2621

    Article  Google Scholar 

  75. Huang H, Jia R, Shi X et al (2021) Feature selection and hyper parameters optimization for short-term wind power forecast. Applied Intelligence 51(10):6752–6770

    Article  Google Scholar 

  76. Wei D, Wang J, Niu X et al (2021) Wind speed forecasting system based on gated recurrent units and convolutional spiking neural networks. Applied Energy 292:116842

    Article  Google Scholar 

  77. Yang F, Chen Z, Li J et al (2019) A novel hybrid stock selection method with stock prediction. Applied Soft Computing 80:820–831

    Article  Google Scholar 

Download references

Funding

National Natural Science Foundation of China (11701071).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shaoting Li.

Appendix A

Appendix A

Table 14 shows the hyperparameters of ARIMA and SSA-ARIMA, the order of the difference is determined by Augmented Dickey-Fuller (ADF) test, and the Akaike Information Criterion (AIC) is used to select the lag of periods and the lag of error components [76].

The hyperparameters of LSTM, IWFTS, SSA, ELM and MOGWO are listed in Table 15.

  1. (1)

    Because extracting nonlinear features of stock indices is not a complicated task, LSTM with a simple network structure is adopted in this paper. Meanwhile, a small learning rate (0.05) and a large number of iterations (5000) are adopted to ensure stable and effective extraction of nonlinear features.

  2. (2)

    To extract the trend feature effectively, the first-order difference of the stock data is calculated and divided into 10 equal-length domains, and the triangular membership function is selected to ensure that the first-order difference of the stock data has a reasonable degree of membership in each domain.

  3. (3)

    SSA has two important hyperparameters, but there is no uniform criterion to choose them. Generally, the larger the window length is, the finer the decomposition of the original sequence can be. However, choosing a large window length increases the amount of calculation and produces many singular values approaching 0, and their corresponding subsequences are usually noise sequences that should be removed. Therefore, we set the window length to 20 and select the first 7 subsequences to reconstruct the data (their singular values do not approach 0), which can effectively retain the main feature of stock indices.

  4. (4)

    There are 3 input variables for ELM, too many hidden neurons of ELM lead to overfitting. In addition, the experimental results show that ELM can better learn stock features when Sine is selected as the activation function [77].

  5. (5)

    MOGWO only needs to find 20 initial parameters of ELM, which is a simple task. Therefore, the hyperparameters of MOGWO set in this study are not large.

Table 14 Model hyperparameters I
Table 15 Model hyperparameters II

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, X., Li, X. & Li, S. A novel stock indices hybrid forecasting system based on features extraction and multi-objective optimizer. Appl Intell 52, 11784–11807 (2022). https://doi.org/10.1007/s10489-021-03031-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-03031-9

Keywords

Navigation