Compositional Information Extraction Methodology from Medical Reports

Rani, Pratibha; Reddy, Raghunath; Mathur, Devika; Bandyopadhyay, Subhadip; Laha, Arijit

doi:10.1007/978-3-642-20152-3_30

Pratibha Rani¹⁹,
Raghunath Reddy¹⁹,
Devika Mathur²⁰,
Subhadip Bandyopadhyay²⁰ &
…
Arijit Laha²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6588))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

1058 Accesses
2 Citations

Abstract

Currently health care industry is undergoing a huge expansion in different aspects. Advances in Clinical Informatics (CI) are an important part of this expansion process. One of the goals of CI is to apply Information Technology for better patient care service provision through two major applications namely electronic health care data management and information extraction from medical documents. In this paper we focus on the second application. For better management and fruitful use of information, it is necessary to contextually segregate important/relevant information buried in a huge corpus of unstructured texts. Hence Information Extraction (IE) from unstructured texts becomes a key technology in CI that deals with different sub-topics like extraction of biomedical entity and relations, passage/paragraph level information extraction, ontological study of diseases and treatments, summarization and topic identification etc. Though literature is promising for different IE tasks for individual topics, availability of an integrated approach for contextually relevant IE from medical documents is not apparent enough. To this end, we propose a compositional approach using integration of contextually (domain specific) constructed IE modules to improve knowledge support for patient care activity. The input to this composite system is free format medical case reports containing stage wise information corresponding to the evolution path of a patient care activity. The output is a compilation of various types of extracted information organized under different tags like past medical history, sign/symptoms, test and test results, diseases, treatment and follow up. The outcome is aimed to help the health care professionals in exploring a large corpus of medical case-studies and selecting only relevant component level information according to need/interest.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

http://himalaya-tools.sourceforge.net/Mafia/
http://www.eclipse.org/
http://incubator.apache.org/uima/
http://www.mysql.com/
http://jmedicalcasereports.com/
Afantenos, S., Karkaletsis, V., Stamatopoulos, P.: Summarization from medical documents: a survey. Artif. Intell. Med. 33(2), 157–177 (2005)
Article Google Scholar
Philip, B., Deshpande, P., Lee-and, Y.K., Barzilay, R.: Finding Temporal Order in Discharge Summaries. In: EMNLP (2006)
Google Scholar
Bundschus, M., Dejori, M., Stetter, M., Tresp, V., Kriegel, H.-P.: Extraction of semantic biomedical relations from text using conditional random fields. BMC Bioinformatics 9(1), 207 (2008)
Article Google Scholar
Bundschus, M., Dejori, M., Yu, S., Tresp, V., Kriegel, H.-P.: Statistical modeling of medical indexing processes for biomedical knowledge information discovery from text. In: BIOKDD 2008 (2008)
Google Scholar
Han, H., Choi, Y., Choi, Y.M., Zhou, X., Brooks, A.D.: A Generic Framework: From Clinical Notes to Electronic Medical Records. In: CBMS 2006, pp. 111–118 (2006)
Google Scholar
Hearst, M.A.: Multi-paragraph segmentation of expository text. In: Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics, pp. 9–16 (1994)
Google Scholar
Mangold, C.: A survey and classification of semantic search approaches. Int. J. Metadata Semant. Ontologies 2(1), 23–34 (2007)
Article Google Scholar
Meystre, S., Haug, P.J.: Natural language processing to extract medical problems from electronic clinical documents: Performance evaluation. J. of Biomedical Informatics 39(6), 589–599 (2006)
Article Google Scholar
Mooney, R.J., Bunescu, R.C.: Mining knowledge from text using information extraction. SIGKDD Explorations 7(1), 3–10 (2005)
Article Google Scholar
Morales, L.P., Esteban, A.D., Gervás, P.: Concept-graph based biomedical automatic summarization using ontologies. In: TextGraphs 2008, pp. 53–56 (2008)
Google Scholar
Mowery, D.L., Harkema, H., Dowling, J.N., Lustgarten, J.L., Chapman, W.W.: Distinguishing historical from current problems in clinical reports: which textual features help? In: BioNLP 2009, pp. 10–18 (2009)
Google Scholar
Takeuchi, K., Collier, N.: Bio-medical entity extraction using support vector machines. Artif. Intell. Med. 33(2), 125–137 (2005)
Article Google Scholar
Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E., Ciravegna, F.: Semantic annotation for knowledge management: Requirements and a survey of the state of the art. Web Semantics 4(1), 14–28 (2006)
Article Google Scholar
Zhou, X., Han, H., Chankai, I., Prestrud, A., Brooks, A.: Approaches to text mining for clinical medical records. In: SAC 2006, pp. 235–239 (2006)
Google Scholar
Zhou, X., Hu, X., Lin, X., Han, H., Zhang, X.-d.: Relation-Based Document Retrieval for Biomedical Literature Databases. In: Li Lee, M., Tan, K.-L., Wuwongse, V. (eds.) DASFAA 2006. LNCS, vol. 3882, pp. 689–701. Springer, Heidelberg (2006)
Chapter Google Scholar
Zhou, X., Zhang, X., Hu, X.: MaxMatcher: Biological concept extraction using approximate dictionary lookup. In: Yang, Q., Webb, G. (eds.) PRICAI 2006. LNCS (LNAI), vol. 4099, pp. 1145–1149. Springer, Heidelberg (2006)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

International Institute of Information Technology, Hyderabad, India
Pratibha Rani & Raghunath Reddy
SETLabs, Infosys Technologies Ltd., Hyderabad, India
Devika Mathur, Subhadip Bandyopadhyay & Arijit Laha

Authors

Pratibha Rani
View author publications
You can also search for this author in PubMed Google Scholar
Raghunath Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Devika Mathur
View author publications
You can also search for this author in PubMed Google Scholar
Subhadip Bandyopadhyay
View author publications
You can also search for this author in PubMed Google Scholar
Arijit Laha
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong, China
Jeffrey Xu Yu
Department of Computer Science, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro (373-1 Guseong-don), 305-701, Yuseong-gu, Daejeon, Korea
Myoung Ho Kim
Institute for Computer Science and Business Information Systems (ICB), University of Duisburg-Essen, Schützenbahn 70, 45117, Essen, Germany
Rainer Unland

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rani, P., Reddy, R., Mathur, D., Bandyopadhyay, S., Laha, A. (2011). Compositional Information Extraction Methodology from Medical Reports. In: Yu, J.X., Kim, M.H., Unland, R. (eds) Database Systems for Advanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20152-3_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-20152-3_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20151-6
Online ISBN: 978-3-642-20152-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics