Abstract
The market analysis is one of the important tasks for text mining. In this situation, Web news has an important role to predict stock prices. In this paper, we propose a method to predict the Nikkei Stock Average, which is one of the most important stock market indexes. We extract viewpoints from experts’ articles for analyzing Web news. The extracted words are index words in the vector space of a machine learning technique. We also incorporate word embedding and bootstrap approaches into our method. It predicts “UP” or “DOWN” of the next day by using the articles of a day. We also evaluate our method with not only one-day prediction but also simulated trading. The experimental result shows that index words based on expert articles were effective for both one-day prediction and simulated trading.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
Since the number of BoWs is approximately 10,000, BoK-E is approximately 10% of the BoW.
- 4.
- 5.
Note that we never use experts’ articles for the vectorization. We just use expert’s articles for extracting index words for the vector space.
- 6.
More precisely, the model predicts the next day’s “UP” or “DOWN” from the target day.
- 7.
Loss-cutting is a financial word. It is an order to buy or sell stocks automatically in the case that unrealized capital losses become larger.
- 8.
Short selling is a financial word and the practice of selling stocks that are not currently owned by borrowing them from a securities company.
- 9.
More properly, the borrowed stocks are paid back to the securities company. The balance is the benefit of the day.
- 10.
Although we evaluated other methods, such as the method with BoT-E, the BoK-E also produced the best performance in terms of this simulated trading.
References
Bollen, J., Mao, H., Zeng, X.-J.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)
Lee, H., Surdeanu, M., MacCartney, B., Jurafsky, D.: On the importance of text analysis for stock price prediction. In: Proceedings of the 9th Edition of the Language Resources and Evaluation Conference (LREC), pp. 1170–1175 (2014)
Nguyen, T.H., Shirai, K.: Topic modeling based sentiment analysis on social media for stock market prediction. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, pp. 1354–1364 (2015)
Izumi, K., Goto, T., Matsui, T.: Trading tests of long-term market forecast by text mining. In: Proceedings of 2010 IEEE International Conference on Data Mining Workshops (ICDMW), pp. 935–942 (2010)
Schumaker, R.P., Chen, H.: A discrete stock price prediction engine based on financial news. Computer 43(1), 51–56 (2010)
Ding, X., Zhang, Y., Liu, T., Duan, J.: Using structured events to predict stock price movement: an empirical investigation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1415–1425 (2014)
Xie, B., Passonneau, R.J., Wu, L., Creamer, G.G.: Semantic frames to predict stock price movement. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 873–883 (2013)
Peng, Y., Jiang, H.: Leverage financial news to predict stock price movements using word embeddings and deep neural networks. In: Proceedings of NAACL-HLT 2016, pp. 374–379 (2016)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS (2013)
Bar-Haim, R., Dinur, E., Feldman, R., Fresko, M., Goldstein, G.: Identifying and following expert investors in stock microblogs. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pp. 1310–1319 (2011)
Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying conditional random fields to Japanese morphological analysis. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP-2004), pp. 230–237 (2004)
Kudo, T., Matsumoto, Y.: Japanese dependency analysis using cascaded chunking. In: CoNLL 2002: Proceedings of the 6th Conference on Natural Language Learning 2002 (COLING 2002 Post-Conference Workshops), pp. 63–69 (2002)
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Ichinose, K., Shimada, K. (2018). Stock Market Prediction Using Keywords from Expert Articles. In: Ghazali, R., Deris, M., Nawi, N., Abawajy, J. (eds) Recent Advances on Soft Computing and Data Mining. SCDM 2018. Advances in Intelligent Systems and Computing, vol 700. Springer, Cham. https://doi.org/10.1007/978-3-319-72550-5_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-72550-5_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-72549-9
Online ISBN: 978-3-319-72550-5
eBook Packages: EngineeringEngineering (R0)