Evaluating and Improving Annotation Tools for Medical Forms

Lin, Ying-Chi; Christen, Victor; Groß, Anika; Cardoso, Silvio Domingos; Pruski,  Cédric; Da Silveira, Marcos; Rahm, Erhard

doi:10.1007/978-3-319-69751-2_1

Ying-Chi Lin¹⁶,
Victor Christen¹⁶,
Anika Groß¹⁶,
Silvio Domingos Cardoso^17,18,
Cédric Pruski¹⁷,
Marcos Da Silveira¹⁷ &
…
Erhard Rahm¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 10649))

Included in the following conference series:

International Conference on Data Integration in the Life Sciences

604 Accesses

Abstract

The annotation of entities with concepts from standardized terminologies and ontologies is of high importance in the life sciences to enhance semantic interoperability, information retrieval and meta-analysis. Unfortunately, medical documents such as clinical forms or electronic health records are still rarely annotated despite the availability of some tools to automatically determine possible annotations. In this study, we comparatively evaluate the quality of two such tools, cTAKES and MetaMap, as well as of a recently proposed annotation approach from our group for annotating medical forms. We also investigate how to improve the match quality of the tools by post-filtering computed annotations as well as by combining several annotation approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Annotating Medical Forms Using UMLS

Semantic annotation in biomedicine: the current landscape

Article Open access 22 September 2017

A Reuse-Based Annotation Approach for Medical Documents

Notes

1.
https://medical-data-models.org.
2.
https://sites.google.com/site/shareclefehealth/.
3.
Clinical Text Analysis and Knowledge Extraction System http://ctakes.apache.org.
4.
Unstructured Information Management Architecture [16] https://uima.apache.org.
5.
Medical Dictionary for Regulatory Activities.
6.
Open-access and Collaborative (OAC) Consumer Health Vocabulary (CHV).
7.
US Extension to Systematized Nomenclature of Medicine-Clinical Terms.

References

Abedi, V., Zand, R., Yeasin, M., Faisal, F.E.: An automated framework for hypotheses generation using literature. BioData Min. 5(1), 13 (2012)
Article Google Scholar
Aronson, A.R., Lang, F.M.: An overview of MetaMap: historical perspective and recent advances. J. Am. Med. Inform. Assoc. 17(3), 229–236 (2010)
Article Google Scholar
Campos, D., Matos, S., Oliveira, J.: Current methodologies for biomedical named entity recognition. In: Biological Knowledge Discovery Handbook: Preprocessing, Mining, and Postprocessing of Biological Data, pp. 839–868 (2013)
Google Scholar
Christen, V., Groß, A., Rahm, E.: A reuse-based annotation approach for medical documents. In: Groth, P., Simperl, E., Gray, A., Sabou, M., Krötzsch, M., Lecue, F., Flöck, F., Gil, Y. (eds.) ISWC 2016. LNCS, vol. 9981, pp. 135–150. Springer, Cham (2016). doi:10.1007/978-3-319-46523-4_9
Chapter Google Scholar
Christen, V., Groß, A., Varghese, J., Dugas, M., Rahm, E.: Annotating medical forms using UMLS. In: Ashish, N., Ambite, J.-L. (eds.) DILS 2015. LNCS, vol. 9162, pp. 55–69. Springer, Cham (2015). doi:10.1007/978-3-319-21843-4_5
Chapter Google Scholar
Dai, M., Shah, N.H., Xuan, W., Musen, M.A., Watson, S.J., Athey, B.D., Meng, F., et al.: An efficient solution for mapping free text to ontology terms. In: AMIA Summit on Translational Bioinformatics 21 (2008)
Google Scholar
Doan, S., Conway, M., Phuong, T.M., Ohno-Machado, L.: Natural language processing in biomedicine: a unified system architecture overview. In: Trent, R. (ed.) Clinical Bioinformatics. Methods in Molecular Biology (Methods and Protocols), vol 1168, pp. 275–294. Humana Press, New York (2014)
Google Scholar
Dugas, M., Neuhaus, P., Meidt, A., Doods, J., Storck, M., Bruland, P., Varghese, J.: Portal of medical data models: information infrastructure for medical research and healthcare. Database: The Journal of Biological Databases and Curation p. bav121 (2016)
Google Scholar
Friedman, C., Shagina, L., Lussier, Y., Hripcsak, G.: Automated encoding of clinical documents based on natural language processing. J. Am. Med. Inform. Assoc. 11(5), 392–402 (2004)
Article Google Scholar
Funk, C., Baumgartner, W., Garcia, B., Roeder, C., Bada, M., Cohen, K.B., Hunter, L.E., Verspoor, K.: Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters. BMC Bioinform. 15(1), 1–29 (2014)
Article Google Scholar
Heinemann, F., Huber, T., Meisel, C., Bundschus, M., Leser, U.: Reflection of successful anticancer drug development processes in the literature. Drug Discovery Today 21(11), 1740–1744 (2016)
Article Google Scholar
Humphrey, S.M., Rogers, W.J., Kilicoglu, H., Demner-Fushman, D., Rindflesch, T.C.: Word sense disambiguation by selecting the best semantic type based on Journal Descriptor Indexing: Preliminary experiment. J. Am. Soc. Inform. Sci. Technol. 57(1), 96–113 (2006)
Article Google Scholar
LePendu, P., Iyer, S., Fairon, C., Shah, N.H., et al.: Annotation analysis for testing drug safety signals using unstructured clinical notes. J. Biomed. Semant. 3(S-1), S5 (2012)
Google Scholar
McCray, A.T., Srinivasan, S., Browne, A.C.: Lexical methods for managing variation in biomedical terminologies. In: Proceedings of the Annual Symposium on Computer Application in Medical Care, pp. 235–239 (1994)
Google Scholar
Oellrich, A., Collier, N., Smedley, D., Groza, T.: Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes. PLoS ONE 10(1), e0116040 (2015)
Article Google Scholar
Savova, G.K., Masanz, J.J., Ogren, P.V., Zheng, J., Sohn, S., Kipper-Schuler, K.C., Chute, C.G.: Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J. Am. Med. Inform. Assoc. 17(5), 507–513 (2010)
Article Google Scholar
Shah, N.H., Bhatia, N., Jonquet, C., Rubin, D., Chiang, A.P., Musen, M.A.: Comparison of concept recognizers for building the open biomedical annotator. BMC Bioinform. 10(Suppl. 9), S14–S14 (2009)
Article Google Scholar
Sohn, S., Kocher, J.P.A., Chute, C.G., Savova, G.K.: Drug side effect extraction from clinical narratives of psychiatry and psychology patients. J. Am. Med. Inform. Assoc. 18(Suppl. 1), i144–i149 (2011)
Article Google Scholar
Sohn, S., Savova, G.K.: Mayo clinic smoking status classification system: extensions and improvements. In: AMIA Annual Symposium Proceedings, pp. 619–623 (2009)
Google Scholar
Tanenblatt, M.A., Coden, A., Sominsky, I.L.: The ConceptMapper approach to named entity recognition. In: Proceedings of 7th Language Resources and Evaluation Conference (LREC), pp. 546–551 (2010)
Google Scholar
Tseytlin, E., Mitchell, K., Legowski, E., Corrigan, J., Chavan, G., Jacobson, R.S.: NOBLE-Flexible concept recognition for large-scale biomedical natural language processing. BMC Bioinform. 17(1), 32 (2016)
Article Google Scholar
University of Pittsburgh: TIES-Text Information Extraction System (2017). http://ties.dbmi.pitt.edu/
Zheng, J., Chapman, W.W., Miller, T.A., Lin, C., Crowley, R.S., Savova, G.K.: A system for coreference resolution for the clinical narrative. J. Am. Med. Inform. Assoc. 19(4), 660 (2012)
Article Google Scholar
Zou, Q., Chu, W.W., Morioka, C., Leazer, G.H., Kangarloo, H.: Indexfinder: a knowledge-based method for indexing clinical texts. In: AMIA Annual Symposium Proceedings, pp. 763–767 (2003)
Google Scholar

Download references

Acknowledgment

This work is funded by the German Research Foundation (DFG) (grant RA 497/22-1, “ELISA - Evolution of Semantic Annotations”), German Federal Ministry of Education and Research (BMBF) (grant 031L0026, “Leipzig Health Atlas”) and National Research Fund Luxembourg (FNR) (grant C13/IS/5809134).

Author information

Authors and Affiliations

Department of Computer Science, Universität Leipzig, Leipzig, Germany
Ying-Chi Lin, Victor Christen, Anika Groß & Erhard Rahm
LIST, Luxembourg Institute of Science and Technology, Esch-sur-Alzette, Luxembourg
Silvio Domingos Cardoso, Cédric Pruski & Marcos Da Silveira
LRI, University of Paris-Sud XI, Orsay, France
Silvio Domingos Cardoso

Authors

Ying-Chi Lin
View author publications
You can also search for this author in PubMed Google Scholar
Victor Christen
View author publications
You can also search for this author in PubMed Google Scholar
Anika Groß
View author publications
You can also search for this author in PubMed Google Scholar
Silvio Domingos Cardoso
View author publications
You can also search for this author in PubMed Google Scholar
Cédric Pruski
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Da Silveira
View author publications
You can also search for this author in PubMed Google Scholar
Erhard Rahm
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ying-Chi Lin .

Editor information

Editors and Affiliations

Luxembourg Institute of Science and Technology, Esch-sur-Alzette, Luxembourg
Marcos Da Silveira
Luxembourg Institute of Science and Technology, Esch-sur-Alzette, Luxembourg
Cédric Pruski
Université du Luxembourg, Esch-sur-Alzette, Luxembourg
Reinhard Schneider

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, YC. et al. (2017). Evaluating and Improving Annotation Tools for Medical Forms. In: Da Silveira, M., Pruski, C., Schneider, R. (eds) Data Integration in the Life Sciences. DILS 2017. Lecture Notes in Computer Science(), vol 10649. Springer, Cham. https://doi.org/10.1007/978-3-319-69751-2_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-69751-2_1
Published: 24 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69750-5
Online ISBN: 978-3-319-69751-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Evaluating and Improving Annotation Tools for Medical Forms

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Annotating Medical Forms Using UMLS

Semantic annotation in biomedicine: the current landscape

A Reuse-Based Annotation Approach for Medical Documents

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Evaluating and Improving Annotation Tools for Medical Forms

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Annotating Medical Forms Using UMLS

Semantic annotation in biomedicine: the current landscape

A Reuse-Based Annotation Approach for Medical Documents

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation