Abstract
This paper presents an algorithm based on heuristic rules in order to solve Spanish definite description references. This algorithm is applied to an information extraction system for Spanish language. These heuristic rules are extracted from the study of an unrestricted corpus. This algorithm solves identity co-reference produced by a definite description whose relation with its antecedents can be solved with syntactic or semantic information. This module achieves a precision of 95.3% in classification task (anaphoric or non-anaphoric) and a average precision of 78% in Conference topics: Natural Language Processing
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
P. Christopherson. The Articles: A study of their theory and use in English. E. Munksgaard, Copenhagen, 1939.
H. H. Clark. Bridging. In P. Johnson-Laird and P Wason, editors, Thinking: readings in cognitive science, pages 411–420. Cambridge: CUP, 1977.
J. Fukumoto, F. Masui, M. Shimohata, and M. Sasaki. Oki Electric Industry: Description of the Oki System as used for MUC-7. http://www.muc.saic.com/proceedings/, 1998.
R. Gaizauskas and Y. Wilks. Information Extraction: Beyond Document Retrieval. Journal of Documentation, 54(1):70–105, January 1998.
R. Garigliano, A. Urbanowicz, and D. J. Nettleton. University of Durham: Description of the LOLITA System as used in MUC-7. In Publishers [15].
J. A. Hawkins. Definiteness and indefiniteness. Humanities Press, Atlantic High-lands, NJ, 1978.
K. Humphreys, R. Gaizauskas, S. Azzam, C. Huyck, and B. Mitchell. University of Sheffield: Description of the LaSIE-II System as used for MUC-7. In Publishers [15].
F. Llopis, R. Mutano-noz, A. Suárez, and A. Montoyo. EXIT: Propuesta de un sistema de extracción de información de textos notariales. Revista Nováatica, 133:26–30, 1998.
R. Mutano-noz, A. Montoyo, F. Llopis, and A. Suárez. Reconocimiento de entidades en el sistema EXIT. Procesamiento del Lenguaje Natural, 23:47–53, september 1998.
R. Mutano-noz and M. Palomar. Processing of Spanish Definite Descriptions with the Same Head. In Dimitris N. Christodoulakis, editor, Proceeding of NLP2000: Filling the gap between theory and practice, Lectures Notes in Artificial Intelligence vol. 1835, pages 212–220, Patras, Greece, June 2000. Springer-Verlag.
R. Mutano-noz, M. Palomar, and A. Ferrández. Processing of Spanish Definite Descriptions. In O. Cairo, E.L. Sucar, and F.J. Cantu, editors, Proceeding of Mexican International Conference on Artificial Intelligence, Lectures Notes in Artificial Intelligence vol. 1793, pages 526–537, Acapulco, Mexico, April 2000. Springer-Verlag.
M. Palomar, A. Ferrández, L. Moreno, M. Saiz-Noeda, R. Mutano-noz, P. Martínez-Barco, J. Peral, and B. Navarro. A Robust Partial Parsing Strategy based on the Slot Unification Grammars. In Proceeding of 6e Conférence annuelle sur le Traitement Automatique des Langues Naturelles. TALN’99, pages 263–272, Cargèse, Corse, July 1999.
M. Poesio and R. Vieira. A Corpus-Based Investigation of Definite Description Use. Computational Linguistics. MIT Press, 24:183–216, 1998.
E. Prince. Toward a taxonomy of given-newinformation. In P. Cole, editor, Radical Pragmatics. Academic Press, New York, pages 223–256, 1981.
Morgan Kaufman Publishers, editor. Proceedings of Seventh Message Understandig Conference, http://www.muc.saic.com/proceedings/, Spring 1998.
R. Vieira and M. Poesio. Corpus-based and computational aproach to anaphora, chapter Processing definite descriptions in corpora. S. Botley and T. McEnery eds. UCL Press, London, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Palomar, M., Muñoz, R. (2000). Definite Descriptions in an Information Extraction System. In: Monard, M.C., Sichman, J.S. (eds) Advances in Artificial Intelligence. IBERAMIA SBIA 2000 2000. Lecture Notes in Computer Science(), vol 1952. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44399-1_33
Download citation
DOI: https://doi.org/10.1007/3-540-44399-1_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41276-2
Online ISBN: 978-3-540-44399-5
eBook Packages: Springer Book Archive