Skip to main content

Formalizing Predicates for Discovery Under the Lexicon Grammar Framework

  • Conference paper
  • First Online:
Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities (NooJ 2021)

Abstract

This text proposes a method for automatic analysis of predicates for discovery (PD) in Spanish. A PD is a predicative unit that projects an argument structure (AS) whose meaning alludes to ‘something that is found by someone -or something- somewhere’ (e.g., ‘encontrar’, ‘hallar’). This type of task is useful in fields such as medicine, since it offers the possibility of automatically identifying findings of interest (diseases, test results, etc.) in large text corpora. The present work is based on Lexicon Grammar (LG), which proposes a formalization from the nature of arguments (object classes) and transformational possibilities. The methodology is carried out as follows: (i) manual identification of PDs from a corpus of gynecology and obstetrics; (ii) elaboration of LG tables for each PD, where object classes are categorized and possible transformations are listed; and (iii) computational modeling. For the last stage, electronic dictionaries and computer-generated grammars were built in NooJ. The algorithm with automatically detected and generated ASs from PDs (325 grammatical sentences) was evaluated against an annotated corpus (1000 manually-annotated sentences, randomly extracted from a corpus of 5 million words). Results gave 98% accuracy, 88% coverage, and 92% F-measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Gross, M.: Méthodes en syntaxe, 1st edn. Hermann, Paris (1996)

    Google Scholar 

  2. Silberztein, M.: Formalizing Natural Languages. The NooJ Approach. 1st. edn. ISTE, London (2016)

    Google Scholar 

  3. Messina, S., Langella, A.: Paraphrases V<-> in one class of psychological predicates. In: Monti, J., Monteleone, M., di Buono, M. (eds.) Formalizing Natural Languages with NooJ 2014, pp. 140–149. Newcastle, Cambridge (2015)

    Google Scholar 

  4. Tolone, E.: Conversión de las tablas del Léxico-Gramática del francés en el léxico LGLex. In: 2nd Argentinian Workshop on Natural Language Processing. Universidad de Córdoba, Córdoba (2012)

    Google Scholar 

  5. Palma, S.: Hacia un enfoque semántico de las expresiones idiomáticas. In: Coursera, J., Dijan, M., Gaspara, A. (eds.) La lingüística francesa. Situación y perspectivas a finales del siglo XX, pp. 313–321. Prensas de la Universidad de Zaragoza, Zaragoza (1994)

    Google Scholar 

  6. Gross, G.: Manual de análisis lingüístico. Aproximación sintáctico-semántica al léxico. 1st. edn. Editorial UOC, Barcelona (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Walter Koza .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jacobsen, J., Koza, W., Muñoz, M., Saiz, F. (2021). Formalizing Predicates for Discovery Under the Lexicon Grammar Framework. In: Bigey, M., Richeton, A., Silberztein, M., Thomas, I. (eds) Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities. NooJ 2021. Communications in Computer and Information Science, vol 1520. Springer, Cham. https://doi.org/10.1007/978-3-030-92861-2_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-92861-2_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-92860-5

  • Online ISBN: 978-3-030-92861-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics