Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market

Jothimani, Dhanya; Yadav, Surendra S.

doi:10.1007/s42786-019-00009-7

Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market

Original Article
Published: 04 June 2019

Volume 3, pages 113–129, (2019)
Cite this article

Journal of Banking and Financial Technology Aims and scope Submit manuscript

458 Accesses
19 Citations
Explore all metrics

Abstract

In this paper, a two-phase ensemble framework comprising of various non-classical decomposition models, namely, Empirical Mode Decomposition, Ensemble Empirical Mode Decomposition and Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN), and machine learning models, namely, Artificial Neural Network and Support Vector Regression (SVR), is proposed for predicting the stock prices. In the first phase, historical stock prices are decomposed to a set of subseries. In the second phase, each subseries is forecasted using machine learning algorithms. Lastly, forecasts of individual subseries are added to obtain the final forecasts. The proposed framework is tested on constituents of Nifty index for a period of 8 years ranging from 2008 to 2015. Performance of the models were analysed using root mean square error. Further, the results were validated statistically using Wilcoxon Signed Rank Test and Friedman Test. CEEMDAN-SVR model outperformed the remaining models. In addition, trading rules were illustrated to determine the optimal timing for buying/selling the stocks. Trading rules based on ensemble models yielded higher return on investment compared to traditional Buy-and-Hold strategy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A two-stage model for stock price prediction based on variational mode decomposition and ensemble machine learning method

Article 11 June 2023

Stock Index Forecasting Using Time Series Decomposition-Based and Machine Learning Models

Stock Price Forecasting with Empirical Mode Decomposition Based Ensemble $$\nu $$ -Support Vector Regression Model

Notes

Difference between the desired output and obtained predicted value
Other tests include KPSS test [34], Phillips–Pheron (PP) test [44] and Dickey–Fuller (DF) test [11]
Ljung–Box statistics [40]

Abbreviations

ANN:: Artificial Neural Network
ARIMA:: AutoRegressive Integrated Moving Average
ARMA:: AutoRegressive Moving Average
BP:: Back Propagation
CEEMDAN:: Complete Ensemble Empirical Mode Decomposition with Adaptive Noise
DWT:: Discrete Wavelet Transform
EMD:: Empirical Mode Decomposition
EEMD:: Ensemble Empirical Mode Decomposition
EMH:: Efficient Market Hypothesis
GARCH:: Generalized Autoregressive Conditional Heteroskedasticity
IMF:: Intrinsic Mode Function
RNN:: Recurrent Neural Network
RSI:: Relative Strength Indicator
SVM:: Support Vector Machine
SVR:: Support Vector Regression
WSRT:: Wilcoxon Signed Rank Test
RMSE:: Root Mean Square Error

References

Atsalakis G, Valavanis K (2009) Surveying stock market forecasting techniques- Part II: soft computing methods. Expert Syst Appl 36(3, Part 2):5932–5941
Google Scholar
Atsalakis G, Valavanis K (2013) Surveying stock market forecasting techniques-part I: conventional methods. In: Zopounidis C (ed) Computation optimization in economics and finance research compendium. Nova Science Publishers Inc, New York, pp 49–104
Google Scholar
Bettman JL, Sault S, Schultz E (2009) Fundamental and technical analysis: substitutes or complements? Account Financ 49(1):21–36
Google Scholar
Bilson C, Brailsford T, Hooper VJ (2001) Selecting macroeconomic variables as explanatory factors of emerging stock market returns. Pac Basin Financ J 9(4):401–426
Google Scholar
Booth A, Gerding E, McGroarty F (2015) Performance-weighted ensembles of random forests for predicting price impact. Quant Financ 15(11):1823–1835
MathSciNet MATH Google Scholar
Chen NF (1991) Financial investment opportunities and the macroeconomy. J Financ 46(2):529–554
Google Scholar
Cheng CH, Wei LY (2014) A novel time-series model based on empirical mode decomposition for forecasting TAIEX. Econ Model 36:136–141
Google Scholar
Crone S, Guajardo J, Weber R (2006) A study on the ability of support vector regression and neural networks to forecast basic time series patterns. In: Bramer M (ed) Artificial intelligence in theory and practice, IFIP international federation for information processing, vol 217. Springer, Boston, MA, pp 149–158
Google Scholar
Crowley PM (2010) Long cycles in growth: explorations using new frequency domain techniques with US data. Bank of Finland research discussion paper no. 6/2010. https://doi.org/10.2139/ssrn.1573641
Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
MathSciNet MATH Google Scholar
Dickey DA, Fuller WA (1979) Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc 74(366):427–431
MathSciNet MATH Google Scholar
Dickey DA, Fuller WA (1981) Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica 49(4):1057–1072
MathSciNet MATH Google Scholar
Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econ Stat 13:253–265
Google Scholar
Dietterich TG (2000) Ensemble methods in machine learning. In: proceedings of the first international workshop on multiple classifier systems, MCS ’00, pp. 1–15. Springer-Verlag, London, UK. http://dl.acm.org/citation.cfm?id=648054.743935. Accessed 15 Sept 2016
Google Scholar
Engle RF (1982) Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50:987–1007
MathSciNet MATH Google Scholar
Fama EF (1970) Efficient capital markets: a review of theory and empirical work. J Financ 25(2):383–417
Google Scholar
Fischer D, Jordan R (1987) Security analysis and portfolio management, 4th edn. Prentice-Hall, Englewood Cliffs, NJ
Google Scholar
Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675–701
MATH Google Scholar
Friedman M (1940) A comparison of alternative tests of significance for the problem of m rankings. Ann Math Stat 11(1):86–92
MathSciNet MATH Google Scholar
Göçken M, Özçalıcı M, Boru A, Dosdoğru AT (2019) Stock price prediction using hybrid soft computing models incorporating parameter tuning and input variable selection. Neural Comput Appl 31(2):577–592. https://doi.org/10.1007/s00521-017-3089-2
Article Google Scholar
Hsu CM (2014) An integrated portfolio optimisation procedure based on data envelopment analysis, artificial bee colony algorithm and genetic programming. Int J Syst Sci 45(12):2645–2664
MathSciNet MATH Google Scholar
Huang N, Shen Z, Long S, Wu M, Shih H, Zheng Q, Yen N, Tung C, Liu H (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc Lond A Math Phys Eng Sci 454(1971):903–995
MathSciNet MATH Google Scholar
Huang NE, Wu ML, Qu W, Long SR, Shen SSP (2003) Applications of Hilbert Huang transform to non-stationary financial time series analysis. Appl Stoch Models Bus Ind 19(3):245–268
MathSciNet MATH Google Scholar
Huang SC (2011) Forecasting stock indices with wavelet domain kernel partial least square regressions. Appl Soft Comput 11(8):5433–5443
Google Scholar
Jothimani D, Shankar R, Yadav SS (2015) Discrete wavelet transform-based prediction of stock index: a study on National Stock Exchange fifty index. J Financ Manag Anal 28(2):35–49
Google Scholar
Jothimani D, Shankar R, Yadav SS (2015) Forecasting of stock index using empirical mode decomposition and artificial neural network. In: Baydogan MG, Huang S, Oztekin A (eds) Proceedings of the 2015 INFORMS workshop on data mining and analytics, Philadelphia, pp. 1–6. Available at: https://pdfs.semanticscholar.org/783d/818475ac87b6834f605cbbf5357fdcc52b31.pdf. Accessed 27 Jan 2018
Jothimani D, Shankar R, Yadav SS (2016) A comparative study of ensemble-based forecasting models for stock index prediction. In: MWAIS 2016 Proceedings. Paper 5, http://aisel.aisnet.org/mwais2016/5. Accessed 27 Jan 2018
Jothimani D, Shankar R, Yadav SS (2016) A hybrid emd-ann model for stock price prediction. In: Panigrahi BK, Suganthan PN, Das S, Satapathy SC (eds) Swarm, evolutionary, and memetic computing. Springer, Cham, pp 60–70
Google Scholar
Jothimani D, Shankar R, Yadav SS (2017) Ensemble of non-classical decomposition models and machine learning models for stock index prediction. In: MWAIS 2017 proceedings. Paper 17. https://aisel.aisnet.org/mwais2017/17/. Accessed 18 Sept 2018
Kao LJ, Chiu CC, Lu CJ, Chang CH (2013) A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting. Decis Support Syst 54(3):1228–1244
Google Scholar
Kara Y, Boyacioglu MA, Baykan ÖK (2011) Predicting direction of stock price index movement using artificial neural networks and support vector machines: the sample of the istanbul stock exchange. Expert Syst Appl 38(5):5311–5319
Google Scholar
Khashei M, Bijari M (2011) A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl Soft Comput 11(2):2664–2675 (The impact of soft computing for the progress of artificial intelligence)
Google Scholar
Kia AN, Haratizadeh S, Shouraki SB (2018) A hybrid supervised semi-supervised graph-based model to predict one-day ahead movement of global stock markets and commodity prices. Expert Syst Appl 105:159–173
Google Scholar
Kwiatkowski D, Phillips P, Schmidt P, Shin Y (1992) Testing the null hypothesis of stationarity against the alternative of a unit root. J Econom 54:159–178
MATH Google Scholar
Lahmiri S (2014) Wavelet low- and high-frequency components as features for predicting stock prices with backpropagation neural networks. J King Saud Univ Comput Inf Sci 26(2):218–227
Google Scholar
Lin C, Hsu C, Chang C (2003) A practical guide to support vector classification. Tech. rep., Department of computer science and information engineering. National Taiwan University, Taipei
Google Scholar
Liu H, Chen C, Tian H, Li Y (2012) A hybrid model for wind speed prediction using empirical mode decomposition and artificial neural networks. Renew Energy 48:545–556
Google Scholar
Lo AW, MacKinlay AC (1988) Stock market prices do not follow random walks: evidence from a simple specification test. Rev Financ Stud 1(1):41–66
Google Scholar
Magdalena L (2010) What is soft computing? Revisiting possible answers. Int J Comput Intell Syst 3(2):148–159. https://doi.org/10.1080/18756891.2010.9727686
Article MathSciNet Google Scholar
McLeod AI, Li WK (1983) Diagnostic checking ARMA time series models using squared-residual autocorrelations. J Time Ser Anal 4:269–273
MathSciNet MATH Google Scholar
Nemeyi PB (1963) Distribution-free multiple comparisons. PhD thesis. Princeton University, Princeton
Opitz D, Maclin R (1999) Popular ensemble methods: an empirical study. J Artif Intell Res 11:169–198
MATH Google Scholar
Ortega L, Khashanah K (2014) A neuro-wavelet model for the short-term forecasting of high-frequency time series of stock returns. J Forecast 33(2):134–146
MathSciNet Google Scholar
Phillips P, Perron P (1988) Testing for a unit root in time series regression. Biometrika 75:335–346
MathSciNet MATH Google Scholar
Ren Y, Suganthan P, Srikanth N (2015) Ensemble methods for wind and solar power forecasting: a state-of-the-art review. Renew Sustain Energy Rev 50:82–91
Google Scholar
Riedmiller M, Braun H (1993) A direct adaptive method for faster backpropagation learning: the RPROP algorithm. IEEE Int Conf Neural Netw 1:586–591
Google Scholar
Sapankevych N, Sankar R (2009) Time series prediction using support vector machines: a survey. Comput Intell Magazine IEEE 4(2):24–38
Google Scholar
Theodosiou M (2011) Forecasting monthly and quarterly time series using STL decomposition. Int J Forecast 27(4):1178–1195
Google Scholar
Torres ME, Colominas MA, Schlotthauer G, Flandrin P (2011) A complete ensemble empirical mode decomposition with adaptive noise. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP), Prague, pp 4144–4147
Tsai CF, Hsu YF, Yen DC (2014) A comparative study of classifier ensembles for bankruptcy prediction. Appl Soft Comput 24(C):977–984
Google Scholar
Vapnik VN (1995) The Nature of statistical learning theory. Springer, New York Inc, New York
MATH Google Scholar
Wang J, Hou R, Wang C, Shen L (2016) Improved v-support vector regression model based on variable selection and brain storm optimization for stock price forecasting. Appl Soft Comput 49:164–178
Google Scholar
Wu G, Lo S (2010) Effects of data normalization and inherent-factor on decision of optimal coagulant dosage in water treatment by artificial neural network. Expert Syst Appl 37(7):4974–4983
Google Scholar
Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal 1(1):1–41
Google Scholar
Yao J, Tan C, Poh HL (1999) Neural networks for technical analysis: a study on KLCI. Int J Theor Appl Financ 02(02):221–241
MATH Google Scholar
Yeh CY, Huang CW, Lee SJ (2011) A multiple-kernel support vector regression approach for stock market price forecasting. Expert Syst Appl 38(3):2177–2186
Google Scholar

Download references

Author information

Authors and Affiliations

Data Science Lab, Ryerson University, Toronto, Canada
Dhanya Jothimani
Department of Management Studies, Indian Institute of Technology Delhi, New Delhi, India
Surendra S. Yadav

Authors

Dhanya Jothimani
View author publications
You can also search for this author in PubMed Google Scholar
Surendra S. Yadav
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dhanya Jothimani.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jothimani, D., Yadav, S.S. Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market. J BANK FINANC TECHNOL 3, 113–129 (2019). https://doi.org/10.1007/s42786-019-00009-7

Download citation

Received: 01 November 2018
Accepted: 21 May 2019
Published: 04 June 2019
Issue Date: October 2019
DOI: https://doi.org/10.1007/s42786-019-00009-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market

Abstract

Access this article

Similar content being viewed by others

A two-stage model for stock price prediction based on variational mode decomposition and ensemble machine learning method

Stock Index Forecasting Using Time Series Decomposition-Based and Machine Learning Models

Stock Price Forecasting with Empirical Mode Decomposition Based Ensemble $$\nu $$ -Support Vector Regression Model

Notes

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market

Abstract

Access this article

Similar content being viewed by others

A two-stage model for stock price prediction based on variational mode decomposition and ensemble machine learning method

Stock Index Forecasting Using Time Series Decomposition-Based and Machine Learning Models

Stock Price Forecasting with Empirical Mode Decomposition Based Ensemble $$\nu $$ -Support Vector Regression Model

Notes

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation