Abstract
Stock trend prediction is challenging due to complex stock behavior including high volatility. Leveraging an additional source of information, such as social media, can improve predictions, social media data is highly unstructured. To address these challenges, we propose a Multi-Stage TCN-LSTM Hybrid Attentive Network (MSHAN), which utilizes historical stock data with selected technical indicators and weighted social media information to predict daily stock movement. MSHAN uses a multi-stage architecture and hybrid neural networks to capture long-range sequential context dependencies of time-series data and unstructured textual information. We present results using extensive experiments on the actual market public dataset StockNet and demonstrate that MSHAN outperforms other baselines and can successfully predict the directional movement of daily stock prices. The ablation study results show the effectiveness of each component of MSHAN. The market simulation analysis suggests that the application of MSHAN can enhance trading profits.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abu-Mostafa, Y.S., Atiya, A.F.: Introduction to financial forecasting. Appl. Intell. 6, 205–213 (1996)
Bai, S., Kolter, J.Z., Koltun, V.: An empirical evaluation of generic convolutional and recurrent networks for sequence modeling (2018)
Lin, T., Guo, T., Aberer, K.: Hybrid neural networks for learning the trend in time series. In: IJCAI 2017, pp. 2273–2279 (2017)
Wang, J., Sun, T., Liu, B., Cao, Y., Zhu, H.: CLVSA: a convolutional LSTM based variational sequence-to-sequence model with attention for predicting trends of financial markets. Papers arXiv:2104.04041, arXiv.org, April 2021
Farha, Y.A., Gall, J.: MS-TCN: multi-stage temporal convolutional network for action segmentation. In: CVPR, pp. 3570–3579 (2019)
Cowles, A., Jones, H.E.: Some a posteriori probabilities in stock market action. Econometrica 5(3), 280–294 (1937)
Kohli, P.P.S., Zargar, S., Arora, S., Gupta, P.: Stock prediction using machine learning algorithms. In: Malik, H., Srivastava, S., Sood, Y.R., Ahmad, A. (eds.) Applications of Artificial Intelligence Techniques in Engineering. AISC, vol. 698, pp. 405–414. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-1819-1_38
Nti, I.K., Adekoya, A., Weyori, B.: A comprehensive evaluation of ensemble learning for stock-market prediction. J. Big Data 7, 20 (2020)
Dingli, A., Fournier, K.: Financial time series forecasting - a deep learning approach. Int. J. Mach. Learn. Comput. 7(5), 118–122 (2017)
Hoseinzade, E., Haratizadeh, S., Khoeini, A.: U-CNNpred: a universal CNN-based predictor for stock markets (2019)
Selvin, S., Ravi, V., Gopalakrishnan, E., Menon, V., Kp, S.: Stock price prediction using LSTM, RNN and CNN-sliding window model, pp. 1643–1647, September 2017
Nelson, D., Pereira, A., de Oliveira, R.: Stock market’s price movement prediction with LSTM neural networks, pp. 1419–1426, May 2017
Chen, H., Xiao, K., Sun, J., Wu, S.: A double-layer neural network framework for high-frequency forecasting. ACM Trans. Manage. Inf. Syst. 7, 1–17 (2017)
Song, Y., Lee, J.W., Lee, J.: A study on novel filtering and relationship between input-features and target-vectors in a deep learning model for stock price prediction. Appl. Intell. 49, 897–911 (2019)
Nguyen, T.-T., Yoon, S.: A novel approach to short-term stock price movement prediction using transfer learning. Appl. Sci. 9(22), 4745 (2019)
Feng, F., Chen, H., He, X., Ding, J., Sun, M., Chua, T.-S.: Enhancing stock movement prediction with adversarial training. In: IJCAI (2019)
Tetlock, P.C., Saar-tsechansky, M., Macskassy, S.: More than words: Quantifying language to measure firms’ fundamentals. J. Finan. 63, 1437–1467 (2008)
Hu, Z., Liu, W., Bian, J., Liu, X., Liu, T.: Listening to chaotic whispers: a deep learning framework for news-oriented stock trend prediction. In: WSDM 2018 (2018)
Pagolu, V.S., Reddy, K.N., Panda, G., Majhi, B.: Sentiment analysis of twitter data for predicting stock market movements. In: SCOPES, pp. 1345–1350 (2016)
Nguyen, T.H., Shirai, K.: Topic modeling based sentiment analysis on social media for stock market prediction, (Beijing, China), pp. 1354–1364. ACL (2015)
Xu, Y., Cohen, S.B.: Stock movement prediction from tweets and historical prices, pp. 1970–1979. Association for Computational Linguistics (2018)
Sawhney, R., Agarwal, S., Wadhwa, A., Shah, R.R.: Deep attentive learning for stock movement prediction from social media text and company correlations. In: EMNLP. Association for Computational Linguistics, November 2020
Vaswani, A., et al.: Attention is all you need. CoRR, arXiv:1706.03762 (2017)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Wang, Y., Li, Q., Huang, Z., Li, J.: EAN: event attention network for stock price trend prediction based on sentimental embedding. In: WebSci 2019, pp. 311–320 (2019)
Qiu, J., Wang, B., Zhou, C.: Forecasting stock prices with long-short term memory neural network based on attention mechanism. PLOS One 15(1), e0227222 (2020)
Chen, Y., Wei, Z., Huang, X.: Incorporating corporation relationship via graph convolutional neural networks for stock price prediction. In: CIKM 2018, pp. 1655–1658 (2018)
Kim, R., So, C.H., Jeong, M., Lee, S., Kim, J., Kang, J.: Hats: a hierarchical graph attention network for stock movement prediction (2019)
Yang, L., Ng, T.L.J., Smyth, B., Dong, R.: HTML: hierarchical transformer-based multi-task learning for volatility prediction. In: WWW 2020, pp. 441–451 (2020)
Jeanblanc, M., Yor, M., Chesney, M.: Mathematical Methods for Financial Markets, June 2009
Chandrashekar, G., Sahin, F.: A survey on feature selection methods. Comput. Electr. Eng. 40(1), 16–28 (2014)
Ng, A.Y.: Feature selection, l1 vs. l2 regularization, and rotational invariance. In: ICML, ICML 2004 (New York, NY, USA), p. 78 (2004)
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. CoRR, arXiv:1409.0473 (2015)
Cer, D., et al.: Universal sentence encoder for English. In: EMNLP, pp. 169–174. ACL, November 2018
Pennington, J., Socher, R., Manning, C.: GloVe: global vectors for word representation. In: EMNLP, (Doha, Qatar), pp. 1532–1543. ACL, October 2014
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding, pp. 4171–4186. ACL (2019)
Akbik, A., Blythe, D., Vollgraf, R.: Contextual string embeddings for sequence labeling. In: COLING, pp. 1638–1649 (2018)
Ding, X., Zhang, Y., Liu, T., Duan, J.: Knowledge-driven event embedding for stock prediction. In: COLING, (Osaka, Japan), pp. 2133–2142, December 2016
Brown, R.: Smoothing, Forecasting and Prediction of Discrete Time Series. Dover Phoenix Editions, Dover Publications (2004)
Qin, Y., Song, D., Cheng, H., Cheng, W., Jiang, G., Cottrell, G.W.: A dual-stage attention-based recurrent neural network for time series prediction. In: IJCAI 2017 (2017)
Lavrenko, V., Schmill, M., Lawrie, D., Ogilvie, P., Jensen, D., Allan, J.: Mining of concurrent text and time series. In: KDD, pp. 37–44 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Gong, J., Eldardiry, H. (2021). Multi-stage Hybrid Attentive Networks for Knowledge-Driven Stock Movement Prediction. In: Mantoro, T., Lee, M., Ayu, M.A., Wong, K.W., Hidayanto, A.N. (eds) Neural Information Processing. ICONIP 2021. Lecture Notes in Computer Science(), vol 13111. Springer, Cham. https://doi.org/10.1007/978-3-030-92273-3_41
Download citation
DOI: https://doi.org/10.1007/978-3-030-92273-3_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92272-6
Online ISBN: 978-3-030-92273-3
eBook Packages: Computer ScienceComputer Science (R0)