Abstract
This text proposes a method for automatic analysis of predicates for discovery (PD) in Spanish. A PD is a predicative unit that projects an argument structure (AS) whose meaning alludes to ‘something that is found by someone -or something- somewhere’ (e.g., ‘encontrar’, ‘hallar’). This type of task is useful in fields such as medicine, since it offers the possibility of automatically identifying findings of interest (diseases, test results, etc.) in large text corpora. The present work is based on Lexicon Grammar (LG), which proposes a formalization from the nature of arguments (object classes) and transformational possibilities. The methodology is carried out as follows: (i) manual identification of PDs from a corpus of gynecology and obstetrics; (ii) elaboration of LG tables for each PD, where object classes are categorized and possible transformations are listed; and (iii) computational modeling. For the last stage, electronic dictionaries and computer-generated grammars were built in NooJ. The algorithm with automatically detected and generated ASs from PDs (325 grammatical sentences) was evaluated against an annotated corpus (1000 manually-annotated sentences, randomly extracted from a corpus of 5 million words). Results gave 98% accuracy, 88% coverage, and 92% F-measure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Gross, M.: Méthodes en syntaxe, 1st edn. Hermann, Paris (1996)
Silberztein, M.: Formalizing Natural Languages. The NooJ Approach. 1st. edn. ISTE, London (2016)
Messina, S., Langella, A.: Paraphrases V<-> in one class of psychological predicates. In: Monti, J., Monteleone, M., di Buono, M. (eds.) Formalizing Natural Languages with NooJ 2014, pp. 140–149. Newcastle, Cambridge (2015)
Tolone, E.: Conversión de las tablas del Léxico-Gramática del francés en el léxico LGLex. In: 2nd Argentinian Workshop on Natural Language Processing. Universidad de Córdoba, Córdoba (2012)
Palma, S.: Hacia un enfoque semántico de las expresiones idiomáticas. In: Coursera, J., Dijan, M., Gaspara, A. (eds.) La lingüística francesa. Situación y perspectivas a finales del siglo XX, pp. 313–321. Prensas de la Universidad de Zaragoza, Zaragoza (1994)
Gross, G.: Manual de análisis lingüístico. Aproximación sintáctico-semántica al léxico. 1st. edn. Editorial UOC, Barcelona (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Jacobsen, J., Koza, W., Muñoz, M., Saiz, F. (2021). Formalizing Predicates for Discovery Under the Lexicon Grammar Framework. In: Bigey, M., Richeton, A., Silberztein, M., Thomas, I. (eds) Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities. NooJ 2021. Communications in Computer and Information Science, vol 1520. Springer, Cham. https://doi.org/10.1007/978-3-030-92861-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-92861-2_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92860-5
Online ISBN: 978-3-030-92861-2
eBook Packages: Computer ScienceComputer Science (R0)