Skip to main content
Log in

An integrative extraction approach for index-tracking portfolio construction and forecasting under a deep learning framework

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

This paper proposed a fusion model of the deep long- and short-term memory network named as deep LSTM and the stochastic dominance named as SD filter method to construct an index-tracking portfolio. We present a practical model that provides investors for portfolio construction using the deep LSTM model on extracting stock features and integrating the dimension deduction ability of the SD approach to the fusion model. Three main conclusions are drawn. First, deep learning in a supervised framework can work in portfolio management with our tuned model. Second, our deep LSTM model with an SD selection filter has enhanced the feature extraction ability to construct an index-tracking portfolio. Third, the linear activator, rectified linear unit, cooperated with SD methods can better reduce the estimation errors than the nonlinear activator in our deep LSTM model. These findings can implement on the portfolio construction in the neural network field. Additionally, entropy suggests to evaluate the learning effect of forecasting. The SD methods also are indicated for prosecuting in other neural network models to extract features of time series data, like transformer-based models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

  1. Babii A, Ghysels E, Striaukas J (2021) Machine learning time series regressions with an application to nowcasting. J Bus Econ Stat 40(3):1–23

    MathSciNet  Google Scholar 

  2. Ban G-Y, El Karoui N, Lim AE (2018) Machine learning and portfolio optimization. Manage Sci 64(3):1136–1154

    Article  Google Scholar 

  3. Bao W, Yue J, Rao Y (2017) A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PLoS ONE 12(7):e0180944

    Article  Google Scholar 

  4. Barak S, Arjmand A, Ortobelli S (2017) Fusion of multiple diverse predictors in stock market. Inf Fusion 36:90–102

    Article  Google Scholar 

  5. Brownlee J (2019) Deep learning for computer vision: image classification, object detection, and face recognition in python. Machine Learning Mastery

  6. Coulombe PG, Leroux M, Stevanovic D, Surprenant S (2020) How is machine learning useful for macroeconomic forecasting? arXiv preprint arXiv:2008.12477

  7. De Prado ML (2018) Advances in financial machine learning. Wiley

    Google Scholar 

  8. Delgado-Bonal A, Marshak A (2019) Approximate entropy and sample entropy: a comprehensive tutorial. Entropy 21(6):541

    Article  MathSciNet  Google Scholar 

  9. Dixon MF, Halperin I (2019) The four horsemen of machine learning in finance. SSRN J. https://doi.org/10.2139/ssrn.3453564

    Article  Google Scholar 

  10. Ghoshal S, Roberts S (2017) Reading the tea leaves: a neural network perspective on technical trading. KDD 2017 mining and learning from time series workshop.

  11. Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT press, Cambridge

    Google Scholar 

  12. Gu S, Kelly B, Xiu D (2020) Empirical asset pricing via machine learning. Rev Financ Stud 33(5):2223–2273

    Article  Google Scholar 

  13. Heaton J, Polson N, Witte JH (2017) Deep learning for finance: deep portfolios. Appl Stoch Model Bus Ind 33(1):3–12

    Article  MathSciNet  Google Scholar 

  14. Henrique BM, Sobreiro VA, Kimura H (2019) Literature review: machine learning techniques applied to financial market prediction. Expert Syst Appl 124:226–251

    Article  Google Scholar 

  15. Hinton G, Srivastava N, Swersky K (2012) Neural networks for machine learning lecture 6a overview of mini-batch gradient descent. Cited on 14(8):2

    Google Scholar 

  16. Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507

    Article  MathSciNet  Google Scholar 

  17. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  18. Hu Y-J, Lin S-J (2019) Deep reinforcement learning for optimizing finance portfolio management. Amity Int Conf Artif Intell (AICAI) 2019:14–20

    Google Scholar 

  19. Kan R, Zhou G (2007) Optimal portfolio choice with parameter uncertainty. J Financ Quantit Anal 42(3):621–656

    Article  Google Scholar 

  20. Kaut, M., & Wallace, S. W. (2003). Evaluation of scenario-generation methods for stochastic programming

  21. Kim Y, Kang D, Jeon M, Lee C (2019) GAN-MP hybrid heuristic algorithm for non-convex portfolio optimization problem. Eng Econ 64(3):196–226

    Article  Google Scholar 

  22. Levy H (2015) Stochastic dominance: investment decision making under uncertainty. Springer, Newyork

    Google Scholar 

  23. Lim B, Zohren S, Roberts S (2019) Enhancing time-series momentum strategies using deep neural networks. J Financ Data Sci 1(4):19–38

    Article  Google Scholar 

  24. Makridakis S, Spiliotis E, Assimakopoulos V (2018) Statistical and Machine Learning forecasting methods: concerns and ways forward. PLoS ONE 13(3):e0194889

    Article  Google Scholar 

  25. Markowitz H (1952) The utility of wealth. J Polit Econ 60(2):151–158

    Article  Google Scholar 

  26. Moody J, Wu L, Liao Y, Saffell M (1998) Performance functions and reinforcement learning for trading systems and portfolios. J Forecast 17(5–6):441–470

    Article  Google Scholar 

  27. Nakamura E (2005) Inflation forecasting using a neural network. Econ Lett 86(3):373–378

    Article  MathSciNet  Google Scholar 

  28. Olah, C. (2015). Understanding lstm networks.

  29. Pei A, Wang J, Fang W (2017) Predicting agent-based financial time series model on lattice fractal with random legendre neural network. Soft Comput 21(7):1693–1708

    Article  Google Scholar 

  30. Roman D, Mitra G, Zverovich V (2013) Enhanced indexation based on second-order stochastic dominance. Eur J Oper Res 228(1):273–281

    Article  MathSciNet  Google Scholar 

  31. Soleymani F, Paquet E (2020) Financial portfolio optimization with online deep reinforcement learning and restricted stacked autoencoder—DeepBreath. Expert Syst Appl 156:113456

    Article  Google Scholar 

  32. Sorzano COS, Vargas J, Montano AP (2014) A survey of dimensionality reduction techniques. arXiv preprint arXiv:1403.2877

  33. Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958

    MathSciNet  Google Scholar 

  34. Sutskever I, Martens J, Dahl G, Hinton G (2013) On the importance of initialization and momentum in deep learning. In: International Conference on Machine Learning. 1139–1147.

  35. Troiano L, Bhandari A, Villa EM (2020) Hands-on deep learning for finance: implement deep learning techniques and algorithms to create powerful trading strategies. Packt Publishing Ltd., Mumbai

    Google Scholar 

  36. Zebari R, Abdulazeez A, Zeebaree D, Zebari D, Saeed J (2020) A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction. J Appl Sci Technol Trends 1(2):56–70

    Article  Google Scholar 

  37. Zhang Z, Zohren S, Roberts S (2019) Extending deep learning models for limit order books to quantile regression. arXiv preprint arXiv:1906.04404

  38. Zhang Z, Zohren S, Roberts S (2020) Deep reinforcement learning for trading. The J Financ Data Sci 2(2):25–40

    Article  Google Scholar 

  39. Wu L, Wang Y, Wu LH (2022) Modeling index tracking portfolio based on stochastic dominance for stock selection. Eng Econ 67(3):172–194

    Article  Google Scholar 

  40. Theodorsson-Norheim E (1986) Kruskal-Wallis test: BASIC computer program to perform nonparametric one-way analysis of variance and multiple comparisons on ranks of several independent samples. Comput Methods Programs Biomed 23(1):57–62

    Article  Google Scholar 

  41. Hajirahimi Z, Khashei M (2023) Hybridization of hybrid structures for time series forecasting: a review. Artif Intell Rev 56(2):1201–1261

    Article  Google Scholar 

  42. Liu F, Kong D, Kong J, Shiying L, Xiao Z, Qing X, Zhou A, Qi J (2022) Pandemic bonds issued by the Chinese government supported post-disaster recovery from COVID-19 pandemic. J Soc Comput 3(2):158–170

    Article  Google Scholar 

  43. Liu F, Kong D, Xiao Z, Zhang X, Zhou A, Qi J (2022) Effect of economic policies on the stock and bond market under the impact of COVID-19. J Saf Sci Resil 3(1):24–38

    Google Scholar 

  44. Fu Z, Liu F, Wang H, Shen S, Zhang J, Qi J, Zhou A (2021) LMR-CBT: Learning modality-fused representations with CB-transformer for multimodal emotion recognition from unaligned multimodal sequences. arXiv preprint arXiv:2112.01697

  45. Tayalı HA, Tolun S (2018) Dimension reduction in mean-variance portfolio optimization. Expert Syst Appl 92:161–169

    Article  Google Scholar 

  46. Bawa VS, Lindenberg EB, Rafsky LC (1979) An efficient algorithm to determine stochastic dominance admissible sets. Manage Sci 25(7):609–622

    Article  Google Scholar 

  47. Al-Khazali O, Lean HH, Samet A (2014) Do Islamic stock indexes outperform conventional stock indexes? A stochastic dominance approach. Pac Basin Financ J 28:29–46

    Article  Google Scholar 

  48. Mitra G, Kyriakis T, Lucas C, Pirbhad M (2003) A review of portfolio planning: Models and systems. Adv Portf Constr Implement. https://doi.org/10.1016/B978-075065448-7.50002-1

    Article  Google Scholar 

  49. Simaan Y (1997) Estimation risk in portfolio selection: the mean variance model versus the mean absolute deviation model. Manage Sci 43(10):1437–1446

    Article  Google Scholar 

  50. Gonin R (1989) Nonlinear Lp-norm estimation. CRC Press, Newyork

    Google Scholar 

Download references

Funding

No funding was received.

Author information

Authors and Affiliations

Authors

Contributions

Yu-Ju Wang: Conceptualization of adopting the index tracking portfolio; writing original draft preparation text, tables, and figures. Conceptualization of LSTM design; validation and supervision. Liang-Hong Wu: Original conceptualization of bridging LSTM and stochastic dominance. Conceptualization of LSTM design; validation; visualization of experimental results; and supervision. Conceptualization of stochastic dominance. Liang-Chuan Wu: Revised and edited the manuscript.

Corresponding author

Correspondence to Liang-Hong Wu.

Ethics declarations

Conflict of interest

Authors declare that there is no conflict of interest of a financial or personal nature.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, YJ., Wu, LH. & Wu, LC. An integrative extraction approach for index-tracking portfolio construction and forecasting under a deep learning framework. J Supercomput 80, 2047–2066 (2024). https://doi.org/10.1007/s11227-023-05538-z

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-023-05538-z

Keywords

Navigation