Abstract
In this paper, a two-phase ensemble framework comprising of various non-classical decomposition models, namely, Empirical Mode Decomposition, Ensemble Empirical Mode Decomposition and Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN), and machine learning models, namely, Artificial Neural Network and Support Vector Regression (SVR), is proposed for predicting the stock prices. In the first phase, historical stock prices are decomposed to a set of subseries. In the second phase, each subseries is forecasted using machine learning algorithms. Lastly, forecasts of individual subseries are added to obtain the final forecasts. The proposed framework is tested on constituents of Nifty index for a period of 8 years ranging from 2008 to 2015. Performance of the models were analysed using root mean square error. Further, the results were validated statistically using Wilcoxon Signed Rank Test and Friedman Test. CEEMDAN-SVR model outperformed the remaining models. In addition, trading rules were illustrated to determine the optimal timing for buying/selling the stocks. Trading rules based on ensemble models yielded higher return on investment compared to traditional Buy-and-Hold strategy.
Similar content being viewed by others
Abbreviations
- ANN:
-
Artificial Neural Network
- ARIMA:
-
AutoRegressive Integrated Moving Average
- ARMA:
-
AutoRegressive Moving Average
- BP:
-
Back Propagation
- CEEMDAN:
-
Complete Ensemble Empirical Mode Decomposition with Adaptive Noise
- DWT:
-
Discrete Wavelet Transform
- EMD:
-
Empirical Mode Decomposition
- EEMD:
-
Ensemble Empirical Mode Decomposition
- EMH:
-
Efficient Market Hypothesis
- GARCH:
-
Generalized Autoregressive Conditional Heteroskedasticity
- IMF:
-
Intrinsic Mode Function
- RNN:
-
Recurrent Neural Network
- RSI:
-
Relative Strength Indicator
- SVM:
-
Support Vector Machine
- SVR:
-
Support Vector Regression
- WSRT:
-
Wilcoxon Signed Rank Test
- RMSE:
-
Root Mean Square Error
References
Atsalakis G, Valavanis K (2009) Surveying stock market forecasting techniques- Part II: soft computing methods. Expert Syst Appl 36(3, Part 2):5932–5941
Atsalakis G, Valavanis K (2013) Surveying stock market forecasting techniques-part I: conventional methods. In: Zopounidis C (ed) Computation optimization in economics and finance research compendium. Nova Science Publishers Inc, New York, pp 49–104
Bettman JL, Sault S, Schultz E (2009) Fundamental and technical analysis: substitutes or complements? Account Financ 49(1):21–36
Bilson C, Brailsford T, Hooper VJ (2001) Selecting macroeconomic variables as explanatory factors of emerging stock market returns. Pac Basin Financ J 9(4):401–426
Booth A, Gerding E, McGroarty F (2015) Performance-weighted ensembles of random forests for predicting price impact. Quant Financ 15(11):1823–1835
Chen NF (1991) Financial investment opportunities and the macroeconomy. J Financ 46(2):529–554
Cheng CH, Wei LY (2014) A novel time-series model based on empirical mode decomposition for forecasting TAIEX. Econ Model 36:136–141
Crone S, Guajardo J, Weber R (2006) A study on the ability of support vector regression and neural networks to forecast basic time series patterns. In: Bramer M (ed) Artificial intelligence in theory and practice, IFIP international federation for information processing, vol 217. Springer, Boston, MA, pp 149–158
Crowley PM (2010) Long cycles in growth: explorations using new frequency domain techniques with US data. Bank of Finland research discussion paper no. 6/2010. https://doi.org/10.2139/ssrn.1573641
Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
Dickey DA, Fuller WA (1979) Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc 74(366):427–431
Dickey DA, Fuller WA (1981) Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica 49(4):1057–1072
Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econ Stat 13:253–265
Dietterich TG (2000) Ensemble methods in machine learning. In: proceedings of the first international workshop on multiple classifier systems, MCS ’00, pp. 1–15. Springer-Verlag, London, UK. http://dl.acm.org/citation.cfm?id=648054.743935. Accessed 15 Sept 2016
Engle RF (1982) Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50:987–1007
Fama EF (1970) Efficient capital markets: a review of theory and empirical work. J Financ 25(2):383–417
Fischer D, Jordan R (1987) Security analysis and portfolio management, 4th edn. Prentice-Hall, Englewood Cliffs, NJ
Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675–701
Friedman M (1940) A comparison of alternative tests of significance for the problem of m rankings. Ann Math Stat 11(1):86–92
Göçken M, Özçalıcı M, Boru A, Dosdoğru AT (2019) Stock price prediction using hybrid soft computing models incorporating parameter tuning and input variable selection. Neural Comput Appl 31(2):577–592. https://doi.org/10.1007/s00521-017-3089-2
Hsu CM (2014) An integrated portfolio optimisation procedure based on data envelopment analysis, artificial bee colony algorithm and genetic programming. Int J Syst Sci 45(12):2645–2664
Huang N, Shen Z, Long S, Wu M, Shih H, Zheng Q, Yen N, Tung C, Liu H (1998) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc Lond A Math Phys Eng Sci 454(1971):903–995
Huang NE, Wu ML, Qu W, Long SR, Shen SSP (2003) Applications of Hilbert Huang transform to non-stationary financial time series analysis. Appl Stoch Models Bus Ind 19(3):245–268
Huang SC (2011) Forecasting stock indices with wavelet domain kernel partial least square regressions. Appl Soft Comput 11(8):5433–5443
Jothimani D, Shankar R, Yadav SS (2015) Discrete wavelet transform-based prediction of stock index: a study on National Stock Exchange fifty index. J Financ Manag Anal 28(2):35–49
Jothimani D, Shankar R, Yadav SS (2015) Forecasting of stock index using empirical mode decomposition and artificial neural network. In: Baydogan MG, Huang S, Oztekin A (eds) Proceedings of the 2015 INFORMS workshop on data mining and analytics, Philadelphia, pp. 1–6. Available at: https://pdfs.semanticscholar.org/783d/818475ac87b6834f605cbbf5357fdcc52b31.pdf. Accessed 27 Jan 2018
Jothimani D, Shankar R, Yadav SS (2016) A comparative study of ensemble-based forecasting models for stock index prediction. In: MWAIS 2016 Proceedings. Paper 5, http://aisel.aisnet.org/mwais2016/5. Accessed 27 Jan 2018
Jothimani D, Shankar R, Yadav SS (2016) A hybrid emd-ann model for stock price prediction. In: Panigrahi BK, Suganthan PN, Das S, Satapathy SC (eds) Swarm, evolutionary, and memetic computing. Springer, Cham, pp 60–70
Jothimani D, Shankar R, Yadav SS (2017) Ensemble of non-classical decomposition models and machine learning models for stock index prediction. In: MWAIS 2017 proceedings. Paper 17. https://aisel.aisnet.org/mwais2017/17/. Accessed 18 Sept 2018
Kao LJ, Chiu CC, Lu CJ, Chang CH (2013) A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting. Decis Support Syst 54(3):1228–1244
Kara Y, Boyacioglu MA, Baykan ÖK (2011) Predicting direction of stock price index movement using artificial neural networks and support vector machines: the sample of the istanbul stock exchange. Expert Syst Appl 38(5):5311–5319
Khashei M, Bijari M (2011) A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl Soft Comput 11(2):2664–2675 (The impact of soft computing for the progress of artificial intelligence)
Kia AN, Haratizadeh S, Shouraki SB (2018) A hybrid supervised semi-supervised graph-based model to predict one-day ahead movement of global stock markets and commodity prices. Expert Syst Appl 105:159–173
Kwiatkowski D, Phillips P, Schmidt P, Shin Y (1992) Testing the null hypothesis of stationarity against the alternative of a unit root. J Econom 54:159–178
Lahmiri S (2014) Wavelet low- and high-frequency components as features for predicting stock prices with backpropagation neural networks. J King Saud Univ Comput Inf Sci 26(2):218–227
Lin C, Hsu C, Chang C (2003) A practical guide to support vector classification. Tech. rep., Department of computer science and information engineering. National Taiwan University, Taipei
Liu H, Chen C, Tian H, Li Y (2012) A hybrid model for wind speed prediction using empirical mode decomposition and artificial neural networks. Renew Energy 48:545–556
Lo AW, MacKinlay AC (1988) Stock market prices do not follow random walks: evidence from a simple specification test. Rev Financ Stud 1(1):41–66
Magdalena L (2010) What is soft computing? Revisiting possible answers. Int J Comput Intell Syst 3(2):148–159. https://doi.org/10.1080/18756891.2010.9727686
McLeod AI, Li WK (1983) Diagnostic checking ARMA time series models using squared-residual autocorrelations. J Time Ser Anal 4:269–273
Nemeyi PB (1963) Distribution-free multiple comparisons. PhD thesis. Princeton University, Princeton
Opitz D, Maclin R (1999) Popular ensemble methods: an empirical study. J Artif Intell Res 11:169–198
Ortega L, Khashanah K (2014) A neuro-wavelet model for the short-term forecasting of high-frequency time series of stock returns. J Forecast 33(2):134–146
Phillips P, Perron P (1988) Testing for a unit root in time series regression. Biometrika 75:335–346
Ren Y, Suganthan P, Srikanth N (2015) Ensemble methods for wind and solar power forecasting: a state-of-the-art review. Renew Sustain Energy Rev 50:82–91
Riedmiller M, Braun H (1993) A direct adaptive method for faster backpropagation learning: the RPROP algorithm. IEEE Int Conf Neural Netw 1:586–591
Sapankevych N, Sankar R (2009) Time series prediction using support vector machines: a survey. Comput Intell Magazine IEEE 4(2):24–38
Theodosiou M (2011) Forecasting monthly and quarterly time series using STL decomposition. Int J Forecast 27(4):1178–1195
Torres ME, Colominas MA, Schlotthauer G, Flandrin P (2011) A complete ensemble empirical mode decomposition with adaptive noise. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP), Prague, pp 4144–4147
Tsai CF, Hsu YF, Yen DC (2014) A comparative study of classifier ensembles for bankruptcy prediction. Appl Soft Comput 24(C):977–984
Vapnik VN (1995) The Nature of statistical learning theory. Springer, New York Inc, New York
Wang J, Hou R, Wang C, Shen L (2016) Improved v-support vector regression model based on variable selection and brain storm optimization for stock price forecasting. Appl Soft Comput 49:164–178
Wu G, Lo S (2010) Effects of data normalization and inherent-factor on decision of optimal coagulant dosage in water treatment by artificial neural network. Expert Syst Appl 37(7):4974–4983
Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal 1(1):1–41
Yao J, Tan C, Poh HL (1999) Neural networks for technical analysis: a study on KLCI. Int J Theor Appl Financ 02(02):221–241
Yeh CY, Huang CW, Lee SJ (2011) A multiple-kernel support vector regression approach for stock market price forecasting. Expert Syst Appl 38(3):2177–2186
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jothimani, D., Yadav, S.S. Stock trading decisions using ensemble-based forecasting models: a study of the Indian stock market. J BANK FINANC TECHNOL 3, 113–129 (2019). https://doi.org/10.1007/s42786-019-00009-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42786-019-00009-7