Skip to main content
Log in

Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market

  • Original Article
  • Published:
Journal of Banking and Financial Technology Aims and scope Submit manuscript

Abstract

In this paper, a two-phase ensemble framework comprising of various non-classical decomposition models, namely, Empirical Mode Decomposition, Ensemble Empirical Mode Decomposition and Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN), and machine learning models, namely, Artificial Neural Network and Support Vector Regression (SVR), is proposed for predicting the stock prices. In the first phase, historical stock prices are decomposed to a set of subseries. In the second phase, each subseries is forecasted using machine learning algorithms. Lastly, forecasts of individual subseries are added to obtain the final forecasts. The proposed framework is tested on constituents of Nifty index for a period of 8 years ranging from 2008 to 2015. Performance of the models were analysed using root mean square error. Further, the results were validated statistically using Wilcoxon Signed Rank Test and Friedman Test. CEEMDAN-SVR model outperformed the remaining models. In addition, trading rules were illustrated to determine the optimal timing for buying/selling the stocks. Trading rules based on ensemble models yielded higher return on investment compared to traditional Buy-and-Hold strategy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. Difference between the desired output and obtained predicted value

  2. Other tests include KPSS test [34], Phillips–Pheron (PP) test [44] and Dickey–Fuller (DF) test [11]

  3. Ljung–Box statistics [40]

Abbreviations

ANN:

Artificial Neural Network

ARIMA:

AutoRegressive Integrated Moving Average

ARMA:

AutoRegressive Moving Average

BP:

Back Propagation

CEEMDAN:

Complete Ensemble Empirical Mode Decomposition with Adaptive Noise

DWT:

Discrete Wavelet Transform

EMD:

Empirical Mode Decomposition

EEMD:

Ensemble Empirical Mode Decomposition

EMH:

Efficient Market Hypothesis

GARCH:

Generalized Autoregressive Conditional Heteroskedasticity

IMF:

Intrinsic Mode Function

RNN:

Recurrent Neural Network

RSI:

Relative Strength Indicator

SVM:

Support Vector Machine

SVR:

Support Vector Regression

WSRT:

Wilcoxon Signed Rank Test

RMSE:

Root Mean Square Error

References

  1. Atsalakis G, Valavanis K (2009) Surveying stock market forecasting techniques- Part II: soft computing methods. Expert Syst Appl 36(3, Part 2):5932–5941

    Google Scholar 

  2. Atsalakis G, Valavanis K (2013) Surveying stock market forecasting techniques-part I: conventional methods. In: Zopounidis C (ed) Computation optimization in economics and finance research compendium. Nova Science Publishers Inc, New York, pp 49–104

    Google Scholar 

  3. Bettman JL, Sault S, Schultz E (2009) Fundamental and technical analysis: substitutes or complements? Account Financ 49(1):21–36

    Google Scholar 

  4. Bilson C, Brailsford T, Hooper VJ (2001) Selecting macroeconomic variables as explanatory factors of emerging stock market returns. Pac Basin Financ J 9(4):401–426

    Google Scholar 

  5. Booth A, Gerding E, McGroarty F (2015) Performance-weighted ensembles of random forests for predicting price impact. Quant Financ 15(11):1823–1835

    MathSciNet  MATH  Google Scholar 

  6. Chen NF (1991) Financial investment opportunities and the macroeconomy. J Financ 46(2):529–554

    Google Scholar 

  7. Cheng CH, Wei LY (2014) A novel time-series model based on empirical mode decomposition for forecasting TAIEX. Econ Model 36:136–141

    Google Scholar 

  8. Crone S, Guajardo J, Weber R (2006) A study on the ability of support vector regression and neural networks to forecast basic time series patterns. In: Bramer M (ed) Artificial intelligence in theory and practice, IFIP international federation for information processing, vol 217. Springer, Boston, MA, pp 149–158

    Google Scholar 

  9. Crowley PM (2010) Long cycles in growth: explorations using new frequency domain techniques with US data. Bank of Finland research discussion paper no. 6/2010. https://doi.org/10.2139/ssrn.1573641

  10. Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30

    MathSciNet  MATH  Google Scholar 

  11. Dickey DA, Fuller WA (1979) Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc 74(366):427–431

    MathSciNet  MATH  Google Scholar 

  12. Dickey DA, Fuller WA (1981) Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica 49(4):1057–1072

    MathSciNet  MATH  Google Scholar 

  13. Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econ Stat 13:253–265

    Google Scholar 

  14. Dietterich TG (2000) Ensemble methods in machine learning. In: proceedings of the first international workshop on multiple classifier systems, MCS ’00, pp. 1–15. Springer-Verlag, London, UK. http://dl.acm.org/citation.cfm?id=648054.743935. Accessed 15 Sept 2016

    Google Scholar 

  15. Engle RF (1982) Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50:987–1007

    MathSciNet  MATH  Google Scholar 

  16. Fama EF (1970) Efficient capital markets: a review of theory and empirical work. J Financ 25(2):383–417

    Google Scholar 

  17. Fischer D, Jordan R (1987) Security analysis and portfolio management, 4th edn. Prentice-Hall, Englewood Cliffs, NJ

    Google Scholar 

  18. Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675–701

    MATH  Google Scholar 

  19. Friedman M (1940) A comparison of alternative tests of significance for the problem of m rankings. Ann Math Stat 11(1):86–92

    MathSciNet  MATH  Google Scholar 

  20. Göçken M, Özçalıcı M, Boru A, Dosdoğru AT (2019) Stock price prediction using hybrid soft computing models incorporating parameter tuning and input variable selection. Neural Comput Appl 31(2):577–592. https://doi.org/10.1007/s00521-017-3089-2

    Article  Google Scholar 

  21. Hsu CM (2014) An integrated portfolio optimisation procedure based on data envelopment analysis, artificial bee colony algorithm and genetic programming. Int J Syst Sci 45(12):2645–2664

    MathSciNet  MATH  Google Scholar 

  22. Huang N, Shen Z, Long S, Wu M, Shih H, Zheng Q, Yen N, Tung C, Liu H (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc Lond A Math Phys Eng Sci 454(1971):903–995

    MathSciNet  MATH  Google Scholar 

  23. Huang NE, Wu ML, Qu W, Long SR, Shen SSP (2003) Applications of Hilbert Huang transform to non-stationary financial time series analysis. Appl Stoch Models Bus Ind 19(3):245–268

    MathSciNet  MATH  Google Scholar 

  24. Huang SC (2011) Forecasting stock indices with wavelet domain kernel partial least square regressions. Appl Soft Comput 11(8):5433–5443

    Google Scholar 

  25. Jothimani D, Shankar R, Yadav SS (2015) Discrete wavelet transform-based prediction of stock index: a study on National Stock Exchange fifty index. J Financ Manag Anal 28(2):35–49

    Google Scholar 

  26. Jothimani D, Shankar R, Yadav SS (2015) Forecasting of stock index using empirical mode decomposition and artificial neural network. In: Baydogan MG, Huang S, Oztekin A (eds) Proceedings of the 2015 INFORMS workshop on data mining and analytics, Philadelphia, pp. 1–6. Available at: https://pdfs.semanticscholar.org/783d/818475ac87b6834f605cbbf5357fdcc52b31.pdf. Accessed 27 Jan 2018

  27. Jothimani D, Shankar R, Yadav SS (2016) A comparative study of ensemble-based forecasting models for stock index prediction. In: MWAIS 2016 Proceedings. Paper 5, http://aisel.aisnet.org/mwais2016/5. Accessed 27 Jan 2018

  28. Jothimani D, Shankar R, Yadav SS (2016) A hybrid emd-ann model for stock price prediction. In: Panigrahi BK, Suganthan PN, Das S, Satapathy SC (eds) Swarm, evolutionary, and memetic computing. Springer, Cham, pp 60–70

    Google Scholar 

  29. Jothimani D, Shankar R, Yadav SS (2017) Ensemble of non-classical decomposition models and machine learning models for stock index prediction. In: MWAIS 2017 proceedings. Paper 17. https://aisel.aisnet.org/mwais2017/17/. Accessed 18 Sept 2018

  30. Kao LJ, Chiu CC, Lu CJ, Chang CH (2013) A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting. Decis Support Syst 54(3):1228–1244

    Google Scholar 

  31. Kara Y, Boyacioglu MA, Baykan ÖK (2011) Predicting direction of stock price index movement using artificial neural networks and support vector machines: the sample of the istanbul stock exchange. Expert Syst Appl 38(5):5311–5319

    Google Scholar 

  32. Khashei M, Bijari M (2011) A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl Soft Comput 11(2):2664–2675 (The impact of soft computing for the progress of artificial intelligence)

    Google Scholar 

  33. Kia AN, Haratizadeh S, Shouraki SB (2018) A hybrid supervised semi-supervised graph-based model to predict one-day ahead movement of global stock markets and commodity prices. Expert Syst Appl 105:159–173

    Google Scholar 

  34. Kwiatkowski D, Phillips P, Schmidt P, Shin Y (1992) Testing the null hypothesis of stationarity against the alternative of a unit root. J Econom 54:159–178

    MATH  Google Scholar 

  35. Lahmiri S (2014) Wavelet low- and high-frequency components as features for predicting stock prices with backpropagation neural networks. J King Saud Univ Comput Inf Sci 26(2):218–227

    Google Scholar 

  36. Lin C, Hsu C, Chang C (2003) A practical guide to support vector classification. Tech. rep., Department of computer science and information engineering. National Taiwan University, Taipei

    Google Scholar 

  37. Liu H, Chen C, Tian H, Li Y (2012) A hybrid model for wind speed prediction using empirical mode decomposition and artificial neural networks. Renew Energy 48:545–556

    Google Scholar 

  38. Lo AW, MacKinlay AC (1988) Stock market prices do not follow random walks: evidence from a simple specification test. Rev Financ Stud 1(1):41–66

    Google Scholar 

  39. Magdalena L (2010) What is soft computing? Revisiting possible answers. Int J Comput Intell Syst 3(2):148–159. https://doi.org/10.1080/18756891.2010.9727686

    Article  MathSciNet  Google Scholar 

  40. McLeod AI, Li WK (1983) Diagnostic checking ARMA time series models using squared-residual autocorrelations. J Time Ser Anal 4:269–273

    MathSciNet  MATH  Google Scholar 

  41. Nemeyi PB (1963) Distribution-free multiple comparisons. PhD thesis. Princeton University, Princeton

  42. Opitz D, Maclin R (1999) Popular ensemble methods: an empirical study. J Artif Intell Res 11:169–198

    MATH  Google Scholar 

  43. Ortega L, Khashanah K (2014) A neuro-wavelet model for the short-term forecasting of high-frequency time series of stock returns. J Forecast 33(2):134–146

    MathSciNet  Google Scholar 

  44. Phillips P, Perron P (1988) Testing for a unit root in time series regression. Biometrika 75:335–346

    MathSciNet  MATH  Google Scholar 

  45. Ren Y, Suganthan P, Srikanth N (2015) Ensemble methods for wind and solar power forecasting: a state-of-the-art review. Renew Sustain Energy Rev 50:82–91

    Google Scholar 

  46. Riedmiller M, Braun H (1993) A direct adaptive method for faster backpropagation learning: the RPROP algorithm. IEEE Int Conf Neural Netw 1:586–591

    Google Scholar 

  47. Sapankevych N, Sankar R (2009) Time series prediction using support vector machines: a survey. Comput Intell Magazine IEEE 4(2):24–38

    Google Scholar 

  48. Theodosiou M (2011) Forecasting monthly and quarterly time series using STL decomposition. Int J Forecast 27(4):1178–1195

    Google Scholar 

  49. Torres ME, Colominas MA, Schlotthauer G, Flandrin P (2011) A complete ensemble empirical mode decomposition with adaptive noise. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP), Prague, pp 4144–4147

  50. Tsai CF, Hsu YF, Yen DC (2014) A comparative study of classifier ensembles for bankruptcy prediction. Appl Soft Comput 24(C):977–984

    Google Scholar 

  51. Vapnik VN (1995) The Nature of statistical learning theory. Springer, New York Inc, New York

    MATH  Google Scholar 

  52. Wang J, Hou R, Wang C, Shen L (2016) Improved v-support vector regression model based on variable selection and brain storm optimization for stock price forecasting. Appl Soft Comput 49:164–178

    Google Scholar 

  53. Wu G, Lo S (2010) Effects of data normalization and inherent-factor on decision of optimal coagulant dosage in water treatment by artificial neural network. Expert Syst Appl 37(7):4974–4983

    Google Scholar 

  54. Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal 1(1):1–41

    Google Scholar 

  55. Yao J, Tan C, Poh HL (1999) Neural networks for technical analysis: a study on KLCI. Int J Theor Appl Financ 02(02):221–241

    MATH  Google Scholar 

  56. Yeh CY, Huang CW, Lee SJ (2011) A multiple-kernel support vector regression approach for stock market price forecasting. Expert Syst Appl 38(3):2177–2186

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dhanya Jothimani.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jothimani, D., Yadav, S.S. Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market. J BANK FINANC TECHNOL 3, 113–129 (2019). https://doi.org/10.1007/s42786-019-00009-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s42786-019-00009-7

Keywords

Navigation