Automatic Classification of Text Documents Presenting Radiology Examinations

Kłos, Monika; Żyłkowski, Jarosław; Spinczyk, Dominik

doi:10.1007/978-3-319-91211-0_43

Monika Kłos¹⁸,
Jarosław Żyłkowski¹⁹ &
Dominik Spinczyk¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 762))

Included in the following conference series:

International Conference on Information Technologies in Biomedicine

472 Accesses

Abstract

The paper presents the classification of text documents presenting radiology examinations, taking into consideration two groups: cases with aneurysms and those without it. A database containing descriptions of 1284 cases was classified using the maximum entropy algorithm and frequent phrase extraction. It was revealed that the best method was the classifier using the maximum entropy algorithm based on nouns. The best result obtained was 90% of sensitivity and 70% of specificity. The worse diagnostic capacity demonstrates frequent phrase extraction algorithm. The other classifiers turned out to be less effective, than the random ones.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Spinczyk, D., Dzieciatko, M.: Similarity search for the content of the medical records. In: Information Technologies in Medicine. Advances in Intelligent Systems and Computing, vol. 471, pp. 489–501 (2016)
Google Scholar
Ahmad, M.: Machine learning approach to text mining: a review. J. Adv. Res. Comput. Sci. Softw. Eng. 4, 1125–1131 (2014)
Google Scholar
Berger, A., Pietra, V., Pietra, S.: A maximum entropy approach to natural language processing. Comput. Linguist. 22, 39–71 (1996)
Google Scholar
Botist, T., Nguyen, M., Woo, E., Markatou, M., Ball, R.: Text mining for the vaccine adverse event reporting system: medical text classifiaction using informative feature selection. J. Am. Med. Inform. Assoc. 18, 631–638 (2011)
Article Google Scholar
Khachidze, M., Tsintsadze, M., Archuadze, M.: Natural language processing based instrument for classification of free text medical records. BioMed. Res. Int. 2016, 10 (2016)
Article Google Scholar
Ningam, K., Lafferty, J., McCallum A.: Using maximum entropy for text classification (1999)
Google Scholar
Ningam, K., McCallum, A., Thrun, A., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2000)
Article Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Association for Computational Linguistics, vol. 22 (2002)
Google Scholar

Download references

Acknowledgement

This research was supported by the Polish Ministry of Science and Silesian University of Technology statutory financial support partially by grant No. BK-200/RIB1/2016 and grant No. BK-200/RIB1/2017.

Author information

Authors and Affiliations

Faculty of Biomedical Engineering, Silesian University of Technology, Roosevelta 40, 41-800, Zabrze, Poland
Monika Kłos & Dominik Spinczyk
Second Department of Clinical Radiology, Medical University of Warsaw, Banacha 1a, 02-097, Warszawa, Poland
Jarosław Żyłkowski

Authors

Monika Kłos
View author publications
You can also search for this author in PubMed Google Scholar
Jarosław Żyłkowski
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Spinczyk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dominik Spinczyk .

Editor information

Editors and Affiliations

Faculty of Biomedical Engineering, Silesian University of Technology, Zabrze, Poland
Ewa Pietka
Faculty of Biomedical Engineering, Silesian University of Technology, Zabrze, Poland
Pawel Badura
Faculty of Biomedical Engineering, Silesian University of Technology, Zabrze, Poland
Jacek Kawa
Faculty of Biomedical Engineering, Silesian University of Technology, Zabrze, Poland
Wojciech Wieclawek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kłos, M., Żyłkowski, J., Spinczyk, D. (2019). Automatic Classification of Text Documents Presenting Radiology Examinations. In: Pietka, E., Badura, P., Kawa, J., Wieclawek, W. (eds) Information Technology in Biomedicine. ITIB 2018. Advances in Intelligent Systems and Computing, vol 762. Springer, Cham. https://doi.org/10.1007/978-3-319-91211-0_43

Download citation

DOI: https://doi.org/10.1007/978-3-319-91211-0_43
Published: 06 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91210-3
Online ISBN: 978-3-319-91211-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics