Automatic Semantic Labeling of Medical Texts with Feature Structures

Mykowiecka, Agnieszka; Marciniak, Małgorzata

doi:10.1007/978-3-642-23538-2_7

Agnieszka Mykowiecka²¹ &
Małgorzata Marciniak²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6836))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

941 Accesses

Abstract

This paper presents the results of testing two approaches in the automatic semantic labeling of medical data. For a chosen domain (diabetic patients’ discharge records) a set of domain related concepts was identified. The annotated resource is the result of a rule based application, that relies on the results of two related rule based information extraction (IE) systems, post processed in a way that makes the label structures simpler, and the boundaries of annotations more precise. The second application is a machine learning (CRF) approach in which the results of the first application are used as training data. Both applications were evaluated by comparing to manually corrected documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cohen, K.B., Fox, L., Ogren, P.V., Hunter, L.: Corpus design for biomedical natural language processing. In: ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics, Detroit, pp. 38–45 (2005)
Google Scholar
Hahn, S., Lehnen, P., Ney, H.: System combination for spoken language understanding. In: INTERSPEECH 2009. ISCA, Brisbane (2008)
Google Scholar
Karkaletis, V., et al.: Automating accreditation of medical web content. In: Proceeding of the 18th European Conference on Artificial Intelligence (2008)
Google Scholar
Kokkinakis, D.: A Semantically Annotated Swedish Medical Corpus. In: Proceedings of the LREC Conference, pp. 32–38 (2008)
Google Scholar
Lehnen, P., Hahn, S., Ney, H., Mykowiecka, A.: Large scale Polish SLU. In: INTERSPEECH 2009. ISCA, Brighton (2009)
Google Scholar
Mykowiecka, A., Marciniak, M.: Domain model for medical information extraction – the LightMedOnt ontology. In: Marciniak, M., Mykowiecka, A. (eds.) Bolc Festschrift. LNCS, vol. 5070, pp. 333–357. Springer, Heidelberg (2009)
Google Scholar
Mykowiecka, A., Marciniak, M.: Some remarks on automatic semantic annotation of a medical corpus. In: Proc. of Third Louhi Workshop on Health Documentation Text Mining and Information Analysis at AIME (2011)
Google Scholar
Mykowiecka, A., Marciniak, M., Kupść, A.: Rule-based information extraction from patient’s clinical data. Journal of Biomedical Informatics 42, 923–936 (2009)
Article Google Scholar
Mykowiecka, A., Waszczuk, J.: Semantic annotation of city transportation information dialogues using CRF method. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS (LNAI), vol. 5729, pp. 411–418. Springer, Heidelberg (2009)
Chapter Google Scholar
Roberts, A., et al.: Building a semantically annotated corpus of clinical texts. Journal of Biomedical Informatics 42(5), 950–966 (2009)
Article Google Scholar
Sutton, C., McCallum, A.: An introduction to conditional random fields for relational learning. In: Getoor, L., Taskar, B. (eds.) Introduction to Statistical Relational Learning. MIT Press, Cambridge (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, J. K. Ordona 21, 01-237, Warsaw, Poland
Agnieszka Mykowiecka & Małgorzata Marciniak

Authors

Agnieszka Mykowiecka
View author publications
You can also search for this author in PubMed Google Scholar
Małgorzata Marciniak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Sciences, University of West Bohemia, Univerzitní 22, 306 14, Pilsen, Czech Republic
Ivan Habernal
Faculty of Applied Sciences, Dept. of Computer Science and Engineering, University of West Bohemia, Univerzitni 8, 306 14, Pilsen, Czech Republic
Václav Matoušek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mykowiecka, A., Marciniak, M. (2011). Automatic Semantic Labeling of Medical Texts with Feature Structures. In: Habernal, I., Matoušek, V. (eds) Text, Speech and Dialogue. TSD 2011. Lecture Notes in Computer Science(), vol 6836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23538-2_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-23538-2_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23537-5
Online ISBN: 978-3-642-23538-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics