Abstract
Hyperbolic embeddings have become important in many natural language processing tasks due to their great ability to capture latent hierarchical data and to encode valuable syntactic and semantic information. We study and consider the ability of Poincaré embeddings to get the most similar nodes to a given node when trying to recognize named entities in a set of text documents. In this paper, we propose a classifier model for the NER (Named Entity Recognition) task by implementing Poincaré embeddings and by using the most frequent n-grams and their Part-of-Speech (POS) structures from the training dataset. We found that POS structures and n-grams help to map possible named entities, while using Poincaré embeddings manage to affirm and refine this recognition, improving the recognition of named entities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Definitions of used classes are explained in the original source and can be found in https://dictionary.cambridge.org/dictionary/english/.
- 2.
The Web-based Tagger Tool can be found in https://lkesymposium.tudublin.ie/Tagger/.
References
Jansen, B.J., Rieh, S.: The seventeen theoretical constructs of information searching and information retrieval. J. Am. Soc. Inf. Sci. Technol. 61, 1517–1534 (2010)
Hasegawa, T., Sekine, S., Grishman, R.: Discovering relations among named entities from large corpora. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 415. Association for Computational Linguistics (2004)
Grishman, R., Sundheim, B.: Design of the MUC-6 evaluation. In: Proceedings Sixth Message Understanding Conference (MUC-6), pp. 1–11 (1996)
Nickel, M., Kiela, D.: Poincaré embeddings for learning hierarchical representations. In: Advances in Neural Information Processing Systems, pp. 6338–6347 (2017)
Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)
Lay, D.C.: Linear Algebra and Its Applications, 5th edn, pp. 255–262. Addison Wesley Publishing Company, Boston (2018)
Sketch Engine: POS tagger (2020). https://www.sketchengine.eu/my_keywords/pos-tagger/
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media, Sebastopol (2009)
Muñoz, D., Pérez-Téllez, F., Pinto, D.: Collaborative web-based tagger for named entities in the task of information extraction. Pistas Educativas 40, 877–893 (2018)
Jurafsky, D., Martin, J.H.: Information extraction. In: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, pp. 725–743 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Muñoz, D., Pérez, F., Pinto, D. (2020). Poincaré Embeddings in the Task of Named Entity Recognition. In: Martínez-Villaseñor, L., Herrera-Alcántara, O., Ponce, H., Castro-Espinoza, F.A. (eds) Advances in Computational Intelligence. MICAI 2020. Lecture Notes in Computer Science(), vol 12469. Springer, Cham. https://doi.org/10.1007/978-3-030-60887-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-60887-3_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60886-6
Online ISBN: 978-3-030-60887-3
eBook Packages: Computer ScienceComputer Science (R0)