Abstract
This paper explores the use of machine learning techniques in classifying financial news for the purpose of predicting stock price movements. The current body of literature on the subject is small, and the reported results are mixed. During the course of this paper we attempt to identify some causes for the divergent results, and devise experiments that account for weaknesses in existing research. A corpus of Thomson Reuter newswires was collected from Dow Jones’ Factiva for seven large stocks. Each article was then linked with the associated price gap of the trading day following the article’s publish date. Utilizing a sequential minimal optimization based support vector machine along with a WordNet-transformed bag-of-words representation, predictions were made in the form of long and short signals. Another variant of the system was also evaluated, wherein Latent Semantic Analysis was employed to process the input data. The signals were conditioned on a set of thresholds, meaning that trade signals were only generated when the predicted values exceeded certain threshold values. Higher thresholds were associated with higher accuracy but a lower number of trading signals. Overall the results were promising.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kennedy, A., Inkpen, D.: Sentiment classification of movie reviews using contextual valence shifters. Computational Intelligence 22(2), 110–125 (2006)
Whitelaw, C., Garg, N., Argamon, S.: Using appraisal groups for sentiment analysis. In: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, pp. 625–631 (2005)
Thomson Reuters Launches Social Media Sentiment Analysis Offering, http://thomsonreuters.com/content/press_room/financial/2012_03_07_social_media_sentiment_analysis_offering
WiseWindow Enters Strategic Relationship with Thoms Reuters to DeliverReal-Time Consumer Index Score for 300 Public Companies Covering Six Industry Groups, http://www.bloomberg.com/article/2012-02-03/aGzt5TYWYfiM.html
Schumaker, R.P., Chen, H.: Textual analysis of stock market prediction using breaking financial news: The AZFin text system. ACM Transactions on Information Systems 27(2) (2009)
Gidófalvi, G., Elkan, C.: Using news articles to predict stock price movements. Department of Computer Science and Engineering. University of California, San Diego (2001)
Mittermayer, M.-A., Knolmayer, G.F.: Newscats: A news categorization and trading system. In: Sixth International Conference on Data Mining, pp. 1002–1007. IEEE (2006)
Koppel, M., Shtrimberg, I.: Good news or bad news? let the market decide. Computing attitude and affect in text: Theory and application, pp. 297-301 (2006)
WordNet lexical database, http://wordnet.princeton.edu/
eSignal data provider service, http://www.esignal.com/
Weka: Data Mining Software, http://www.cs.waikato.ac.nz/ml/weka/
Kakkonen, T., Myller, N., Sutinen, E., Timonen, J.: Comparison of dimension reduction methods for automated essay grading. Natural Language Engineering 1, 1–16 (2005)
Zhang, D., Zhu, Z.: A fast approximate algorithm for large-scale Latent Semantic Indexing. In: Third International Conference on Digital Information Management, pp. 626–631. IEEE (2008)
Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 4(3), 199–222 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hollum, A.T.G., Mosch, B.P., Szlávik, Z. (2013). Economic Sentiment: Text-Based Prediction of Stock Price Movements with Machine Learning and WordNet. In: Ali, M., Bosse, T., Hindriks, K.V., Hoogendoorn, M., Jonker, C.M., Treur, J. (eds) Recent Trends in Applied Artificial Intelligence. IEA/AIE 2013. Lecture Notes in Computer Science(), vol 7906. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38577-3_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-38577-3_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38576-6
Online ISBN: 978-3-642-38577-3
eBook Packages: Computer ScienceComputer Science (R0)