Abstract
The objective of this work is to automatically determine, in an unsupervised manner, Spanish prepositional phrases of the type preposition – nominal phrase – preposition (P(NP(P) that behave in a sentence as a lexical unit and their semantic and syntactic properties cannot be deduced from the corresponding properties of each simple form, e.g., por medio de (by means of), a fin de (in order to), con respecto a (with respect to). We show that idiomatic P(NP(P combinations have some statistical properties distinct from those of usual idiomatic collocations. We also explore a way to differentiate P(NP(P combinations that could perform either as a regular prepositional phrase or as idiomatic prepositional phrase.
Work partially supported by Mexican Government (CONACyT, SNI, CGPI-IPN, PIFI-IPN).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Banerjee, S., Pedersen, T.: The Design, Implementation, and Use of the Ngram Statistic Package. In: Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics, México (2003)
Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. In: Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, pp. 76–83 (1989)
Degand, L., Bestgen, Y.: Towards automatic retrieval of idioms in French newspaper corpora. Literary and Linguistic Computing 18, 249–259 (2003)
Diccionario de María Moliner.: Diccionario de Uso del Español. Primera edición versión electrónica (CD-ROM) Editorial Gredos, S. A (1996)
Dunning, T.: Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19(1), 61–74 (1993)
Evert, S., Krenn, B.: Methods for the Qualitative Evaluation of Lexical Association. In: Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics, Toulouse, France, pp. 188–195 (2001)
Galicia-Haro, S.N.: Using Electronic Texts for an Annotated Corpus Building. In: 4th Mexican International Conference on Computer Science, ENC 2003, Mexico, pp. 26–33 (2003)
Graña Gil, J., Barcala Rodríguez, F.M., Vilares Ferro, J.: Formal Methods of Tokenization for Part-of-Speech Tagging. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 240–249. Springer, Heidelberg (2002)
Justeson, J.S., Katz, S.M.: Technical Terminology: Some Linguistic properties and an algorithm for identification in text. Natural Language Engineering 1, 9–27 (1995)
Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge (1999)
Mel’cuk, I.: Dependency Syntax: Theory and Practice. State University of New York Press, New York (1988)
Nañez Fernández, E.: Diccionario de construcciones sintácticas del español. Preposiciones. Madrid, España, Editorial de la Universidad Autónoma de Madrid (1995)
Rayson, P., Berridge, D., Francis, B.: Extending the Cochran rule for the comparison of word frequencies between corpora. In: Purnelle, G. (ed.) Le poids des mots: Proc. of 7th International Conf. on Statistical analysis of textual data (JADT 2004), vol. II, pp. 926–936. Presses universitaires de Louvain (2004)
Real Academia Española.: Diccionario de la Real Academia Española, 21 edición, Espasa, Calpe (1995) (CD-ROM)
Seco, M.: Gramática esencial del español, introducción al estudio de la lengua, Segunda edición revisada y aumentada, Madrid, Espasa Calpe (1989)
Villada, B., Bouma, G.: A corpus-based approach to the acquisition of collocational prepositional phrases. In: Proceedings of EURALEX 2002, Copenhagen. Denmark, pp. 153–158 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Galicia-Haro, S.N., Gelbukh, A. (2005). Towards the Automatic Learning of Idiomatic Prepositional Phrases. In: Gelbukh, A., de Albornoz, Á., Terashima-Marín, H. (eds) MICAI 2005: Advances in Artificial Intelligence. MICAI 2005. Lecture Notes in Computer Science(), vol 3789. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11579427_79
Download citation
DOI: https://doi.org/10.1007/11579427_79
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29896-0
Online ISBN: 978-3-540-31653-4
eBook Packages: Computer ScienceComputer Science (R0)