Skip to main content

Poincaré Embeddings in the Task of Named Entity Recognition

  • Conference paper
  • First Online:
Advances in Computational Intelligence (MICAI 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12469))

Included in the following conference series:

  • 927 Accesses

Abstract

Hyperbolic embeddings have become important in many natural language processing tasks due to their great ability to capture latent hierarchical data and to encode valuable syntactic and semantic information. We study and consider the ability of Poincaré embeddings to get the most similar nodes to a given node when trying to recognize named entities in a set of text documents. In this paper, we propose a classifier model for the NER (Named Entity Recognition) task by implementing Poincaré embeddings and by using the most frequent n-grams and their Part-of-Speech (POS) structures from the training dataset. We found that POS structures and n-grams help to map possible named entities, while using Poincaré embeddings manage to affirm and refine this recognition, improving the recognition of named entities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Definitions of used classes are explained in the original source and can be found in https://dictionary.cambridge.org/dictionary/english/.

  2. 2.

    The Web-based Tagger Tool can be found in https://lkesymposium.tudublin.ie/Tagger/.

References

  1. Jansen, B.J., Rieh, S.: The seventeen theoretical constructs of information searching and information retrieval. J. Am. Soc. Inf. Sci. Technol. 61, 1517–1534 (2010)

    Google Scholar 

  2. Hasegawa, T., Sekine, S., Grishman, R.: Discovering relations among named entities from large corpora. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 415. Association for Computational Linguistics (2004)

    Google Scholar 

  3. Grishman, R., Sundheim, B.: Design of the MUC-6 evaluation. In: Proceedings Sixth Message Understanding Conference (MUC-6), pp. 1–11 (1996)

    Google Scholar 

  4. Nickel, M., Kiela, D.: Poincaré embeddings for learning hierarchical representations. In: Advances in Neural Information Processing Systems, pp. 6338–6347 (2017)

    Google Scholar 

  5. Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)

    Article  Google Scholar 

  6. Lay, D.C.: Linear Algebra and Its Applications, 5th edn, pp. 255–262. Addison Wesley Publishing Company, Boston (2018)

    Google Scholar 

  7. Sketch Engine: POS tagger (2020). https://www.sketchengine.eu/my_keywords/pos-tagger/

  8. Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media, Sebastopol (2009)

    MATH  Google Scholar 

  9. Muñoz, D., Pérez-Téllez, F., Pinto, D.: Collaborative web-based tagger for named entities in the task of information extraction. Pistas Educativas 40, 877–893 (2018)

    Google Scholar 

  10. Jurafsky, D., Martin, J.H.: Information extraction. In: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, pp. 725–743 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Muñoz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Muñoz, D., Pérez, F., Pinto, D. (2020). Poincaré Embeddings in the Task of Named Entity Recognition. In: Martínez-Villaseñor, L., Herrera-Alcántara, O., Ponce, H., Castro-Espinoza, F.A. (eds) Advances in Computational Intelligence. MICAI 2020. Lecture Notes in Computer Science(), vol 12469. Springer, Cham. https://doi.org/10.1007/978-3-030-60887-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60887-3_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60886-6

  • Online ISBN: 978-3-030-60887-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics