Medical Entity and Relation Extraction from Narrative Clinical Records in Italian Language

Diomaiuta, Crescenzo; Mercorella, Maria; Ciampi, Mario; De Pietro, Giuseppe

doi:10.1007/978-3-319-59480-4_13

Crescenzo Diomaiuta⁷,
Maria Mercorella⁷,
Mario Ciampi⁷ &
…
Giuseppe De Pietro⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 76))

Included in the following conference series:

International Conference on Intelligent Interactive Multimedia Systems and Services

1730 Accesses

Abstract

Applying Natural Language Processing techniques enables to unlock precious information contained in free text clinical reports. In this paper, we propose a system able to annotate medical entities in narrative records. Considering that existing NLP systems mainly concern entity recognition in English language, we propose an NLP pipeline to manage clinical free text in Italian. The overall architecture includes a spell checker, sentence detector, word tokenizer, part-of-speech tagger, dictionary lookup annotator, and parsing rules annotator. Essentially, it uses a rule-based approach to extract relevant concepts regarding patient’s conditions, administered medications, or performed procedures, detecting their attributes, negated forms, and relations expressions. The indexing of the documents allows the user to retrieve relevant information, increasing his/her medical knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

FAROO spelling correction (2016). http://blog.faroo.com/category/spelling-correction/
IBM watson explorer (2016). https://www.ibm.com/us-en/marketplace/content-analytics
Mongo database (2016). https://www.mongodb.com/
Snowball resources (2016). http://snowball.tartarus.org/
UIMA home (2016). https://uima.apache.org/
UMLS documentation (2016). https://www.nlm.nih.gov/research/umls/
Alicante, A., Corazza, A., Isgrò, F., Silvestri, S.: Unsupervised entity and relation extraction from clinical records in italian. Comput. Biol. Med. 72, 263–275 (2016)
Article Google Scholar
Attardi, G., Cozza, V., Sartiano, D.: Adapting linguistic tools for the analysis of Italian medical records (2014)
Google Scholar
Attardi, G., Cozza, V., Sartiano, D.: UniPi: Recognition of mentions of disorders in clinical text. In: Proceedings of the 8th International Workshop on Semantic Evaluation, pp. 754–760 (2014)
Google Scholar
Attardi, G., Cozza, V., Sartiano, D.: Annotation and extraction of relations from Italian medical records. In: IIR (2015)
Google Scholar
Byrd, R.J., Steinhubl, S.R., Sun, J., Ebadollahi, S., Stewart, W.F.: Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records. Int. J. Med. Informatics 83(12), 983–992 (2014)
Article Google Scholar
De Bruijn, B., Martin, J.: Getting to the (c)ore of knowledge: mining biomedical literature. Int. J. Med. Informatics 67(1), 7–18 (2002)
Article Google Scholar
Doan, S., Conway, M., Phuong, T.M., Ohno-Machado, L.: Natural language processing in biomedicine: a unified system architecture overview. In: Clinical Bioinformatics, pp. 275–294 (2014)
Google Scholar
Esuli, A., Marcheggiani, D., Sebastiani, F.: An enhanced CRFs-based system for information extraction from radiology reports. J. Biomed. Inform. 46(3), 425–435 (2013)
Article Google Scholar
Friedman, C., Shagina, L., Lussier, Y., Hripcsak, G.: Automated encoding of clinical documents based on natural language processing. J. Am. Med. Inform. Assoc. 11(5), 392–402 (2004)
Article Google Scholar
Garla, V., Re, V.L., Dorey-Stein, Z., Kidwai, F., Scotch, M., Womack, J., Justice, A., Brandt, C.: The yale cTAKES extensions for document classification: architecture and application. J. Am. Med. Inform. Assoc. 18(5), 614–620 (2011)
Article Google Scholar
Hardeniya, N.: NLTK Essentials. Packt Publishing Ltd. (2015)
Google Scholar
Johnson, S.B., Bakken, S., Dine, D., Hyun, S., Mendonça, E., Morrison, F., Bright, T., Van Vleck, T., Wrenn, J., Stetson, P.: An electronic health record based on structured narrative. J. Am. Med. Inform. Assoc. 15(1), 54–64 (2008)
Article Google Scholar
Kunze, M., Rösner, D.: UIMA for NLP based researchers workplaces in medical domains. In: Towards Enhanced Interoperability for Large HLT Systems: UIMA for NLP, p. 20 (2008)
Google Scholar
Lin, C.H., Lai, W.S., Lee, L.H., Tsao, H.M., Liou, D.M.: An entry generation pipeline for converting free-text medical document into clinical document architecture document with entry-level. In: 2014 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), pp. 505–508. IEEE (2014)
Google Scholar
McCray, A.T., Aronson, A.R., Browne, A.C., Rindflesch, T.C., Razi, A., Srinivasan, S.: UMLS knowledge for biomedical language processing. Bull. Med. Libr. Assoc. 81(2), 184 (1993)
Google Scholar
Meystre, S.M., Savova, G.K., Kipper-Schuler, K.C., Hurdle, J.F., et al.: Extracting information from textual documents in the electronic health record: a review of recent research. Yearb. Med. Inform. 35(128), 44 (2008)
Google Scholar
Reyes-Ortiz, J.A., González-Beltrán, B.A., Gallardo-López, L.: Clinical decision support systems: a survey of NLP-based approaches from unstructured data. In: 2015 26th International Workshop on Database and Expert Systems Applications (DEXA), pp. 163–167. IEEE (2015)
Google Scholar
Savova, G.K., Masanz, J.J., Ogren, P.V., Zheng, J., Sohn, S., Kipper-Schuler, K.C., Chute, C.G.: Mayo clinical text analysis and knowledge extraction system (cTAKES): architecture, component evaluation and applications. J. Am. Med. Inform. Assoc. 17(5), 507–513 (2010)
Article Google Scholar
Skeppstedt, M., Kvist, M., Nilsson, G.H., Dalianis, H.: Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: an annotation and machine learning study. J. Biomed. Inform. 49, 148–158 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Research Council of Italy, Institute of High Performance Computing and Networking - ICAR, Via Pietro Castellino 111, 80131, Naples, Italy
Crescenzo Diomaiuta, Maria Mercorella, Mario Ciampi & Giuseppe De Pietro

Authors

Crescenzo Diomaiuta
View author publications
You can also search for this author in PubMed Google Scholar
Maria Mercorella
View author publications
You can also search for this author in PubMed Google Scholar
Mario Ciampi
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe De Pietro
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Maria Mercorella .

Editor information

Editors and Affiliations

National Research Council of Italy (CNR-ICAR), Institute for High-Performance Computing and Networking, Naples, Italy
Giuseppe De Pietro
National Research Council of Italy (CNR-ICAR), Institute for High-Performance Computing and Networking, Naples, Italy
Luigi Gallo
Fern Barrow, Bournemouth University, Poole, Dorset, United Kingdom
Robert J. Howlett
University of Canberra, Canberra, Aust Capital Terr, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Diomaiuta, C., Mercorella, M., Ciampi, M., De Pietro, G. (2018). Medical Entity and Relation Extraction from Narrative Clinical Records in Italian Language. In: De Pietro, G., Gallo, L., Howlett, R., Jain, L. (eds) Intelligent Interactive Multimedia Systems and Services 2017. KES-IIMSS-18 2018. Smart Innovation, Systems and Technologies, vol 76. Springer, Cham. https://doi.org/10.1007/978-3-319-59480-4_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-59480-4_13
Published: 28 May 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59479-8
Online ISBN: 978-3-319-59480-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics