Skip to main content

Economic Sentiment: Text-Based Prediction of Stock Price Movements with Machine Learning and WordNet

  • Conference paper
  • 4153 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7906))

Abstract

This paper explores the use of machine learning techniques in classifying financial news for the purpose of predicting stock price movements. The current body of literature on the subject is small, and the reported results are mixed. During the course of this paper we attempt to identify some causes for the divergent results, and devise experiments that account for weaknesses in existing research. A corpus of Thomson Reuter newswires was collected from Dow Jones’ Factiva for seven large stocks. Each article was then linked with the associated price gap of the trading day following the article’s publish date. Utilizing a sequential minimal optimization based support vector machine along with a WordNet-transformed bag-of-words representation, predictions were made in the form of long and short signals. Another variant of the system was also evaluated, wherein Latent Semantic Analysis was employed to process the input data. The signals were conditioned on a set of thresholds, meaning that trade signals were only generated when the predicted values exceeded certain threshold values. Higher thresholds were associated with higher accuracy but a lower number of trading signals. Overall the results were promising.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kennedy, A., Inkpen, D.: Sentiment classification of movie reviews using contextual valence shifters. Computational Intelligence 22(2), 110–125 (2006)

    Article  MathSciNet  Google Scholar 

  2. Whitelaw, C., Garg, N., Argamon, S.: Using appraisal groups for sentiment analysis. In: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, pp. 625–631 (2005)

    Google Scholar 

  3. Thomson Reuters Launches Social Media Sentiment Analysis Offering, http://thomsonreuters.com/content/press_room/financial/2012_03_07_social_media_sentiment_analysis_offering

  4. WiseWindow Enters Strategic Relationship with Thoms Reuters to DeliverReal-Time Consumer Index Score for 300 Public Companies Covering Six Industry Groups, http://www.bloomberg.com/article/2012-02-03/aGzt5TYWYfiM.html

  5. Schumaker, R.P., Chen, H.: Textual analysis of stock market prediction using breaking financial news: The AZFin text system. ACM Transactions on Information Systems 27(2) (2009)

    Google Scholar 

  6. Gidófalvi, G., Elkan, C.: Using news articles to predict stock price movements. Department of Computer Science and Engineering. University of California, San Diego (2001)

    Google Scholar 

  7. Mittermayer, M.-A., Knolmayer, G.F.: Newscats: A news categorization and trading system. In: Sixth International Conference on Data Mining, pp. 1002–1007. IEEE (2006)

    Google Scholar 

  8. Koppel, M., Shtrimberg, I.: Good news or bad news? let the market decide. Computing attitude and affect in text: Theory and application, pp. 297-301 (2006)

    Google Scholar 

  9. WordNet lexical database, http://wordnet.princeton.edu/

  10. eSignal data provider service, http://www.esignal.com/

  11. Weka: Data Mining Software, http://www.cs.waikato.ac.nz/ml/weka/

  12. Kakkonen, T., Myller, N., Sutinen, E., Timonen, J.: Comparison of dimension reduction methods for automated essay grading. Natural Language Engineering 1, 1–16 (2005)

    Google Scholar 

  13. Zhang, D., Zhu, Z.: A fast approximate algorithm for large-scale Latent Semantic Indexing. In: Third International Conference on Digital Information Management, pp. 626–631. IEEE (2008)

    Google Scholar 

  14. Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 4(3), 199–222 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hollum, A.T.G., Mosch, B.P., Szlávik, Z. (2013). Economic Sentiment: Text-Based Prediction of Stock Price Movements with Machine Learning and WordNet. In: Ali, M., Bosse, T., Hindriks, K.V., Hoogendoorn, M., Jonker, C.M., Treur, J. (eds) Recent Trends in Applied Artificial Intelligence. IEA/AIE 2013. Lecture Notes in Computer Science(), vol 7906. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38577-3_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38577-3_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38576-6

  • Online ISBN: 978-3-642-38577-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics