Abstract
In this article we present a simple method to extract Spanish nouns with the linguistic property of “human” animacy. We describe a non-supervised method based on lexical patterns and on a person name list enlarged from a collection of newspaper texts. Results were obtained from the Web filters and estimation methods are proposed to validate them.
Work done with partial support of Mexican Government (CONACyT, SNI, CGEPI-IPN).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aissen, J.: Differential Object Marking: Iconicity vs. Economy. Natural Language and Linguistic Theory 21(3), 435–483 (2003)
Altmann, L.J.P., Kemper, S.: Effects of Age, Animacy, and Activation Order on Sentence Production. Language and Cognitive Processes 21(1), 322–354 (2006)
Berenguer, C.R., Cruz Pastor Ferrán, M.: ¿Cuánto dura/tarda la clase de Español?: una reflexión sobre determinados usos verbales en Español. In: Lengua y cultura en la enseñanza del Español a extranjeros. Actas del VII Congreso de ASELE, pp. 397i–406i. Ediciones de la Universidad de Castilla la Mancha (1998)
Brants, T., Franz, A.: Web 1T 5-gram Version 1 Linguistic Data Consortium (2006)
Fleischman, M., Echihabi, A., Hovy, E.: Offline Strategies for Online Question Answering: Answering Questions before They are Asked. In: Proceedings of the ACL Conference, pp. 1–7 (2003)
Foundalis, H.E.: Evolution of Gender in Indo-European Languages. In: Proceedings of the 24th Annual Conference of the Cognitive Science Society, Fairfax, VA, pp. 304–309 (2002)
Galicia-Haro, S.N.: Using Electronic Texts for an Annotated Corpus Building. In: 4th Mexican International Conference on Computer Science, ENC, Mexico, pp. 26–33 (2003)
Heng, J., Lin, D.: Gender and Animacy Knowledge Discovery from Web-Scale N-Grams for Unsupervised Person Mention Detection. In: Proceedings of PACLIC (2009)
Lin, D.: Automatic Retrieval and Clustering of Similar Words. In: Proceedings of the 17th International Conference on Computational Linguistics, pp. 768–774 (1998)
Orăsan, C., Evans, R.: Learning to Identify Animate References. In: Proceedings of the Workshop on Computational Natural Language Learning, ACL (2001)
Orăsan, C., Evans, R.: NP Animacy Resolution for Anaphora Resolution. Journal of Artificial Intelligence Research 29, 79–103 (2007)
Øvrelid, L.: Empirical Evaluations of Animacy Annotation. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pp. 630–638 (2009)
Paşca, M., Van Durme, B.: What You Seek Is What You Get: Extraction of Class Attributes from Query Logs. In: Proceedings of the International Joint Conference on Artificial Intelligence 2007, pp. 2832–2837 (2007)
von Heusinger, K., Kaiser, G.A.: Differential Object Marking and the Lexical Semantics of Verbs in Spanish. In: Kaiser, G.A., Leonetti, M. (eds.) Proceedings of the Workshop Definiteness, Specificity and Animacy in Ibero-Romance Languages, pp. 85–110 (2007)
von Heusinger, K., Kaiser, G.A.: The Interaction of Animacy, Definiteness and Specificity in Spanish. In: von Heusinger, K., Kaiser, G.A. (eds.) Proceedings of the Workshop: Semantic and Syntactic Aspects of Specificity, Romance Languages, pp. 41–65. Universität Konstanz, Konstanz (2003)
Yamamoto, M.: Animacy and Reference: A Cognitive Approach to Corpus Linguistics. Studies in Language Companion Series, vol. 46. John Benjamins, Amsterdam (1999)
Zaenen, A., Carletta, J., Garretson, G., Bresnan, J., Koontz-Garboden, A., Nikitina, T., O’Connor, M.C., Wasow, T.: Animacy Encoding in English: Why and How. In: Proceedings of the 2004 ACL Workshop on Discourse Annotation, pp. 118–125 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Galicia-Haro, S.N., Gelbukh, A.F. (2010). Extracting Human Spanish Nouns. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2010. Lecture Notes in Computer Science(), vol 6231. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15760-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-15760-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15759-2
Online ISBN: 978-3-642-15760-8
eBook Packages: Computer ScienceComputer Science (R0)