Abstract
Representing the predicate argument structure of the medical domain (ASMD) is important for automatic text analyses. This work aims to describe the ASMD through verbs and transformational possibilities. Computer resources were constructed from 100 selected biomedical verbs (corpus CCM2009). Firstly, these were analyzed to determine the quantity and type of arguments, and classified these arguments into object classes (OC). Secondly, we established possible transformations for each ASMD. With this information, we created computer models on NooJ for the detection and automatic creation of ASMD in a corpus. This work involved the elaboration of electronic dictionaries, syntactic recognition, and generative grammars. The detection was performed on a corpus of 188,000 words conformed by texts from the gynecology and obstetrics area, achieving the following results: 100% accuracy, 96.92% coverage, and 98% F-measure. NooJ grammars provided grammatical sentences of each ASMD involving different transformations admitted by each particular class.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pustejovsky, J.: The Generative Lexicon, 1st edn. MIT Press, Cambridge (1996)
Kim, Y., Meystre, S.: Ensemble method-based extraction of medication and related information from clinical texts. J. Am. Med. Inform. Assoc. 27(1), 31–38 (2020)
Yehia, E., Boshnak, H., AbdelGaber, S., Abdo, A., Elzanfaly, D.: Ontology-based clinical information extraction from physician’s free-text. J. Biomed. Inform. 98, 1–7 (2019)
Gross, M.: Méthodes en syntaxe. Hermann, Paris (1996)
Gross, G.: Manual de análisis lingüístico. Aproximación sintáctico-semántica al léxico. Editorial UOC, Barcelona (2014)
Silberztein, M.: Formalizing natural languages. The NooJ approach. ISTE, London (2016)
Grishman, R.: Twenty-five years of information extraction. Nat. Lang. Eng. 6, 677–692 (2019)
Chen, L., et al.: Clinical trial cohort selection based on multi-level rule-based natural language processing system. J. Am. Inf. Assoc. 26(11), 1218–1226 (2019)
Messina, S., Langella, A.: Paraphrases V<-> in one class of psychological predicates. In: Monti, J., Monteleone, M., di Buono, M. (eds.) Formalizing Natural Languages with NooJ 2014, pp. 140–149. Newcastle, Cambridge (2015)
Real Academia Nacional de Medicina: Diccionario de Términos Médico. Editorial Panamericana, Buenos Aires (2012)
Real Academia Española: Diccionario esencial de la lengua española. Espasa Calpe, Madrid
Snomed CT: Guía de introducción a Snomed CT. https://confluence.ihtsdotools.org/display/DOCSTARTES. Accessed 24 Sept 2021
Burdiles, G.: Descripción de la organización del género Caso Clínico de la medicina a partir del corpus CCM-2009. Ph. D. thesis. Pontificia Universidad Católica de Valparaíso, Valparaíso (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Koza, W., Suy, C. (2021). Automatic Detection and Generation of Argument Structures Within the Medical Domain. In: Bigey, M., Richeton, A., Silberztein, M., Thomas, I. (eds) Formalizing Natural Languages: Applications to Natural Language Processing and Digital Humanities. NooJ 2021. Communications in Computer and Information Science, vol 1520. Springer, Cham. https://doi.org/10.1007/978-3-030-92861-2_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-92861-2_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92860-5
Online ISBN: 978-3-030-92861-2
eBook Packages: Computer ScienceComputer Science (R0)