Abstract
The paper presents the classification of text documents presenting radiology examinations, taking into consideration two groups: cases with aneurysms and those without it. A database containing descriptions of 1284 cases was classified using the maximum entropy algorithm and frequent phrase extraction. It was revealed that the best method was the classifier using the maximum entropy algorithm based on nouns. The best result obtained was 90% of sensitivity and 70% of specificity. The worse diagnostic capacity demonstrates frequent phrase extraction algorithm. The other classifiers turned out to be less effective, than the random ones.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Spinczyk, D., Dzieciatko, M.: Similarity search for the content of the medical records. In: Information Technologies in Medicine. Advances in Intelligent Systems and Computing, vol. 471, pp. 489–501 (2016)
Ahmad, M.: Machine learning approach to text mining: a review. J. Adv. Res. Comput. Sci. Softw. Eng. 4, 1125–1131 (2014)
Berger, A., Pietra, V., Pietra, S.: A maximum entropy approach to natural language processing. Comput. Linguist. 22, 39–71 (1996)
Botist, T., Nguyen, M., Woo, E., Markatou, M., Ball, R.: Text mining for the vaccine adverse event reporting system: medical text classifiaction using informative feature selection. J. Am. Med. Inform. Assoc. 18, 631–638 (2011)
Khachidze, M., Tsintsadze, M., Archuadze, M.: Natural language processing based instrument for classification of free text medical records. BioMed. Res. Int. 2016, 10 (2016)
Ningam, K., Lafferty, J., McCallum A.: Using maximum entropy for text classification (1999)
Ningam, K., McCallum, A., Thrun, A., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Mach. Learn. 39, 103–134 (2000)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Association for Computational Linguistics, vol. 22 (2002)
Acknowledgement
This research was supported by the Polish Ministry of Science and Silesian University of Technology statutory financial support partially by grant No. BK-200/RIB1/2016 and grant No. BK-200/RIB1/2017.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Kłos, M., Żyłkowski, J., Spinczyk, D. (2019). Automatic Classification of Text Documents Presenting Radiology Examinations. In: Pietka, E., Badura, P., Kawa, J., Wieclawek, W. (eds) Information Technology in Biomedicine. ITIB 2018. Advances in Intelligent Systems and Computing, vol 762. Springer, Cham. https://doi.org/10.1007/978-3-319-91211-0_43
Download citation
DOI: https://doi.org/10.1007/978-3-319-91211-0_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91210-3
Online ISBN: 978-3-319-91211-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)