Skip to main content

Towards the Automatic Learning of Idiomatic Prepositional Phrases

  • Conference paper
MICAI 2005: Advances in Artificial Intelligence (MICAI 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3789))

Included in the following conference series:

Abstract

The objective of this work is to automatically determine, in an unsupervised manner, Spanish prepositional phrases of the type preposition – nominal phrase – preposition (P(NP(P) that behave in a sentence as a lexical unit and their semantic and syntactic properties cannot be deduced from the corresponding properties of each simple form, e.g., por medio de (by means of), a fin de (in order to), con respecto a (with respect to). We show that idiomatic P(NP(P combinations have some statistical properties distinct from those of usual idiomatic collocations. We also explore a way to differentiate P(NP(P combinations that could perform either as a regular prepositional phrase or as idiomatic prepositional phrase.

Work partially supported by Mexican Government (CONACyT, SNI, CGPI-IPN, PIFI-IPN).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Banerjee, S., Pedersen, T.: The Design, Implementation, and Use of the Ngram Statistic Package. In: Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics, México (2003)

    Google Scholar 

  2. Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. In: Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, pp. 76–83 (1989)

    Google Scholar 

  3. Degand, L., Bestgen, Y.: Towards automatic retrieval of idioms in French newspaper corpora. Literary and Linguistic Computing 18, 249–259 (2003)

    Article  Google Scholar 

  4. Diccionario de María Moliner.: Diccionario de Uso del Español. Primera edición versión electrónica (CD-ROM) Editorial Gredos, S. A (1996)

    Google Scholar 

  5. Dunning, T.: Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19(1), 61–74 (1993)

    Google Scholar 

  6. Evert, S., Krenn, B.: Methods for the Qualitative Evaluation of Lexical Association. In: Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics, Toulouse, France, pp. 188–195 (2001)

    Google Scholar 

  7. Galicia-Haro, S.N.: Using Electronic Texts for an Annotated Corpus Building. In: 4th Mexican International Conference on Computer Science, ENC 2003, Mexico, pp. 26–33 (2003)

    Google Scholar 

  8. Graña Gil, J., Barcala Rodríguez, F.M., Vilares Ferro, J.: Formal Methods of Tokenization for Part-of-Speech Tagging. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 240–249. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  9. Justeson, J.S., Katz, S.M.: Technical Terminology: Some Linguistic properties and an algorithm for identification in text. Natural Language Engineering 1, 9–27 (1995)

    Article  Google Scholar 

  10. Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge (1999)

    MATH  Google Scholar 

  11. Mel’cuk, I.: Dependency Syntax: Theory and Practice. State University of New York Press, New York (1988)

    Google Scholar 

  12. Nañez Fernández, E.: Diccionario de construcciones sintácticas del español. Preposiciones. Madrid, España, Editorial de la Universidad Autónoma de Madrid (1995)

    Google Scholar 

  13. Rayson, P., Berridge, D., Francis, B.: Extending the Cochran rule for the comparison of word frequencies between corpora. In: Purnelle, G. (ed.) Le poids des mots: Proc. of 7th International Conf. on Statistical analysis of textual data (JADT 2004), vol. II, pp. 926–936. Presses universitaires de Louvain (2004)

    Google Scholar 

  14. Real Academia Española.: Diccionario de la Real Academia Española, 21 edición, Espasa, Calpe (1995) (CD-ROM)

    Google Scholar 

  15. Seco, M.: Gramática esencial del español, introducción al estudio de la lengua, Segunda edición revisada y aumentada, Madrid, Espasa Calpe (1989)

    Google Scholar 

  16. Villada, B., Bouma, G.: A corpus-based approach to the acquisition of collocational prepositional phrases. In: Proceedings of EURALEX 2002, Copenhagen. Denmark, pp. 153–158 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Galicia-Haro, S.N., Gelbukh, A. (2005). Towards the Automatic Learning of Idiomatic Prepositional Phrases. In: Gelbukh, A., de Albornoz, Á., Terashima-Marín, H. (eds) MICAI 2005: Advances in Artificial Intelligence. MICAI 2005. Lecture Notes in Computer Science(), vol 3789. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11579427_79

Download citation

  • DOI: https://doi.org/10.1007/11579427_79

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29896-0

  • Online ISBN: 978-3-540-31653-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics