skip to main content
10.1145/3383455.3422557acmconferencesArticle/Chapter ViewAbstractPublication PagesicaifConference Proceedingsconference-collections
research-article

Choosing news topics to explain stock market returns

Published:07 October 2021Publication History

ABSTRACT

We analyze methods for selecting topics in news articles to explain stock returns. We find, through empirical and theoretical results, that supervised Latent Dirichlet Allocation (sLDA) implemented through Gibbs sampling in a stochastic EM algorithm will often overfit returns to the detriment of the topic model. We obtain better out-of-sample performance through a random search of plain LDA models. A branching procedure that reinforces effective topic assignments often performs best. We test these methods on an archive of over 90,000 news articles about S&P 500 firms.

References

  1. David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. JMLR 3, Jan (2003), 993--1022.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Jordan Boyd-Graber and Philip Resnik. 2010. Holistic sentiment analysis across languages: Multilingual supervised latent dirichlet allocation. In EMNLP. 45--55.Google ScholarGoogle Scholar
  3. John Campbell. 1991. A variance decomposition for stock returns. The Economic Journal (1991), 157--179.Google ScholarGoogle Scholar
  4. Thomas L Griffiths and Mark Steyvers. 2004. Finding scientific topics. PNAS 101, suppl 1 (2004), 5228--5235.Google ScholarGoogle ScholarCross RefCross Ref
  5. Michael Hughes, Gabriel Hope, Leah Weiner, Thomas McCoy, Roy Perlis, Erik Sudderth, and Finale Doshi-Velez. 2018. Semi-supervised prediction-constrained topic models. In AISTAT. 1067--1076.Google ScholarGoogle Scholar
  6. Tim Loughran and Bill McDonald. 2011. When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. Journal of Finance 66, 35--65.Google ScholarGoogle Scholar
  7. Jon D Mcauliffe and David M Blei. 2008. Supervised topic models. In NeurIPS. 121--128.Google ScholarGoogle Scholar
  8. Thien Hai Nguyen and Kiyoaki Shirai. 2015. Topic modeling based sentiment analysis on social media for stock market prediction. In ACL. 1354--1364.Google ScholarGoogle Scholar
  9. Richard Roll. 1988. R2. Journal of Finance 43 (3), 541--566.Google ScholarGoogle Scholar
  10. P. Tetlock, M. Saar-Tsechansky, and S. Macskassy. 2008. More Than Words: Quantifying Language to Measure Firms' Fundamentals. Journal of Finance 63, 3 (2008), 1437--1467.Google ScholarGoogle ScholarCross RefCross Ref
  11. Paul C Tetlock. 2014. Information transmission in finance. Annu. Rev. Financ. Econ. 6, 1, 365--384.Google ScholarGoogle Scholar
  12. Cheng Zhang and Hedvig Kjellström. 2014. How to supervise topic models. In ECCV. Springer, 500--515.Google ScholarGoogle Scholar

Index Terms

  1. Choosing news topics to explain stock market returns

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICAIF '20: Proceedings of the First ACM International Conference on AI in Finance
        October 2020
        422 pages
        ISBN:9781450375849
        DOI:10.1145/3383455

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 7 October 2021

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader