Skip to main content

Named Entity Recognition from Financial Press Releases

  • Conference paper
  • First Online:
Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2014)

Abstract

This paper explores a previous model’s use of named entity recognition to predict the changes in stock prices from financial news. Detecting company mentions in the articles is crucial for this task, and we modified these methods to gain additional mentions. We first expanded upon the rules of the named entity recognition from the original model. We also incorporated coreference resolution and modified an existing toolkit to be compatible with our specific domain. After these two adjustments, the number of instances captured increased significantly. Although this did not necessarily improve the overall prediction performance, the results give us an opportunity to explore reasons why the scores stayed around the same, and a full analysis will allow us to achieve our goals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    S&P 500 is an equity market index that includes 500 publicly traded companies in leading industries.

  2. 2.

    http://www.ark.cs.cmu.edu/SEMAFOR.

References

  1. O’Connor, B., Stewart, B.M., Smith, N.A.: Learning to extract international relations from political context. In: Proceedings of the 51st Annual Meeting of the ACL, Sofia, Bulgaria, pp. 1094–1104 (2013)

    Google Scholar 

  2. Xie, B., Passonneau, R.J., Wu, L., Creamer, G.: Semantic frames to predict stock price movement. In: Proceedings of the 51st Annual Meeting of the ACL, Sofia, Bulgaria, pp. 873–883 (2013)

    Google Scholar 

  3. Ravi, S., Knight, K., Soricut, R.: Automatic prediction of parser accuracy. In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, Honolulu, Hawaii, pp. 887–896 (2008)

    Google Scholar 

  4. McClosky, D., Charniak, E., Johnson, M.: Automatic domain adaptation for parsing. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL. HLT 2010, Stroudsburg, PA, USA, pp. 28–36 (2010)

    Google Scholar 

  5. Roux, J.L., Foster, J., Wagner, J., Samad, R., Kaljahi, Z., Bryl, A.: DUC-Paris13 systems for the SANCL 2012 shared task (2012)

    Google Scholar 

  6. Bulyko, I., Ostendorf, M.: Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures. In: Proceedings of HLT-NAACL 2003, pp. 7–9 (2003)

    Google Scholar 

  7. Sarikaya, R., Gravano, A., Gao, Y.: Rapid language model development using external resources for new spoken dialog domains. In: International Congress of Acoustics. Speech, and Signal Processing (ICASSP), Philadelphia, PA, USA, pp. 573–576. IEEE Signal Processing Society (2005)

    Google Scholar 

  8. Rosenfeld, B., Feldman, R.: Using corpus statistics on entities to improve semi-supervised relation extraction from the web. In: Proceedings of the 45th Annual Meeting of the ACL, ACL 2007, 23–30 June, 2007, Prague, Czech Republic (2007)

    Google Scholar 

  9. Feldman, R., Rosenfeld, B., Bar-Haim, R., Fresko, M.: The stock sonar - sentiment analysis of stocks based on a hybrid approach. In: Proceedings of the Twenty-Third Conference on Innovative Applications of Artificial Intelligence, 9–11 August 2011, San Francisco, California, USA (2011)

    Google Scholar 

  10. Lee, H., Chang, A., Peirsman, Y., Chambers, N., Surdeanu, M., Jurafsky, D.: Deterministic coreference resolution based on entity-centric, precision-ranked rules. Comput. Linguist. 39, 885–916 (2013)

    Article  Google Scholar 

  11. Tetlock, P.C.: Giving content to investor sentiment: the role of media in the stock market. J. Fin. (2007)

    Google Scholar 

  12. Gentzkow, M., Shapiro, J.M.: What drives media slant? Evidence from U.S. daily newspapers. Econometrica 78, 35–71 (2010)

    Article  MathSciNet  Google Scholar 

  13. Engelberg, J., Parsons, C.A.: The causal impact of media in financial markets. J. Fin. 66, 67–97 (2011)

    Article  Google Scholar 

  14. Devitt, A., Ahmad, K.: Sentiment polarity identification in financial news: a cohesion-based approach. In: Proceedings of the 45th Annual Meeting of the ACL, Prague, Czech Republic, pp. 984–991 (2007)

    Google Scholar 

  15. Haider, S.A., Mehrotra, R.: Corporate news classification and valence prediction: a supervised approach. In: Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis. WASSA 2011, Portland, Oregon, pp. 175–181 (2011)

    Google Scholar 

  16. Chua, C., Milosavljevic, M., Curran, J.R.: A sentiment detection engine for internet stock message boards. In: Proceedings of the Australasian Language Technology Association Workshop 2009, Sydney, Australia, pp. 89–93 (2009)

    Google Scholar 

  17. Bar-Haim, R., Dinur, E., Feldman, R., Fresko, M., Goldstein, G.: Identifying and following expert investors in stock microblogs. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, pp. 1310–1319 (2011)

    Google Scholar 

  18. Kogan, S., Levin, D., Routledge, B.R., Sagi, J.S., Smith, N.A.: Predicting risk from financial reports with regression. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the ACL. NAACL 2009, Stroudsburg, PA, USA, pp. 272–280 (2009)

    Google Scholar 

  19. Luss, R., d’Aspremont, A.: Predicting abnormal returns from news using text classification (2008). CoRR, abs/0809.2792

  20. Wentland, W., Johannes Knopp, C.S., Hartung, M.: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration. In: (ELRA), E.L.R.A., ed.: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (2008)

    Google Scholar 

  21. Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The Stanford CoreNLP natural language processing toolkit. In: Proceedings of 52nd Annual Meeting of the ACL, pp. 55–60 (2014)

    Google Scholar 

  22. Moschitti, A.: Making tree kernels practical for natural language learning. In: Proceedings of the 11th Conference of the European Chapter of the ACL (2006)

    Google Scholar 

  23. Fillmore, C.J.: Frame semantics and the nature of language. Ann. N.Y. Acad. Sci. 280, 20–32 (1976)

    Article  Google Scholar 

  24. Das, D., Smith, N.A.: Semi-supervised frame-semantic parsing for unknown predicates. In: Proceedings of the 49th Annual Meeting of the ACL. HLT 2011, Stroudsburg, PA, USA, pp. 1435–1444 (2011)

    Google Scholar 

  25. Das, D., Smith, N.A.: Graph-based lexicon expansion with sparsity-inducing penalties. In: HLT-NAACL, pp. 677–687 (2012)

    Google Scholar 

  26. Whissel, C.M.: The dictionary of affect in language. Emot. Theory Res. Experience 39, 113–131 (1989)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tifara Ramelson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Passonneau, R.J., Ramelson, T., Xie, B. (2015). Named Entity Recognition from Financial Press Releases. In: Fred, A., Dietz, J., Aveiro, D., Liu, K., Filipe, J. (eds) Knowledge Discovery, Knowledge Engineering and Knowledge Management. IC3K 2014. Communications in Computer and Information Science, vol 553. Springer, Cham. https://doi.org/10.1007/978-3-319-25840-9_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25840-9_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25839-3

  • Online ISBN: 978-3-319-25840-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics