Abstract
Although the transmission of Hepatitis C through blood transfusion is getting less and less prevalent with the use of anti-HCV tests for blood donors, availability and practice of screening remain low in developing countries. This results in Hepatitis C patients who are unaware of their condition until it worsens to become chronic liver diseases that are diagnosed through more costly or invasive methods–liver biopsy and radiology scans. Due to these limitations of the current methods of diagnosis, this study seeks to develop a machine learning model to diagnose patients with different stages of liver disease: hepatitis c, liver fibrosis, and cirrhosis. In this research, machine learning algorithms were applied to a dataset containing HCV patient information, and the algorithms were evaluated for their accuracy and performance in classifying the patients with the proper diagnosis. Findings from the study indicated that XGBoost can most accurately classify patients with an accuracy score of 95.48, but other algorithms used had high accuracy scores as well: the algorithm with the lowest accuracy score–Decision Tree–still had a score of 92.66. The second experiment also showed that the Isolation Forest algorithm could detect and isolate the suspect blood donors of the data with a relatively high accuracy of 93.22%. As both experiments of the study yielded a machine learning model of high accuracy, the algorithms used can be implemented into a diagnostic kit for liver disease to be used in developing countries where accessibility to current diagnosis tools is limited.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sebastiani, G.: Chronic hepatitis C and liver fibrosis. World J. Gastroenterol. 20(32), 11033–11053 (2014)
Healthline. https://www.healthline.com/health/hepatitis-c-fibrosis-score#fibrosis-score. Accessed 15 Jan 2022
NHS website. https://www.nhs.uk/conditions/hepatitis-c/diagnosis/. Accessed 15 Jan 2022
Selvarajah, S., Busch, M.P.: Transfusion transmission of HCV, a long but successful road map to safety. Antivir. Ther. 17(7 Pt B), 1423–1429 (2012)
Prati, D.: Transmission of hepatitis C virus by blood transfusions and other medical procedures: a global review. J. Hepatol. 45(4), 607–616 (2006)
Frija, G., et al.: How to improve access to medical imaging in low- and middle-income countries? EClinicalMedicine 38, 101034 (2021)
Bajpai, M., Gupta, E., Choudhary, A.: Hepatitis C virus: screening, diagnosis, and interpretation of laboratory assays. Asian J. Transf. Sci. 8(1), 19 (2014)
Akella, A., Akella, S. Applying machine learning to evaluate for fibrosis in chronic hepatitis C. MedRxiv (2020)
Abd El-Salam, S.M., et al.: Performance of machine learning approaches on prediction of esophageal varices for Egyptian chronic hepatitis C patients. Inform. Med. Unlocked 17, 100267 (2019)
Chicco, D., Jurman, G.: An ensemble learning approach for enhanced classification of patients with hepatitis and cirrhosis. IEEE Access 9, 24485–24498 (2021)
Ahammed, K., Satu, M.S., Khan, M.I., Whaiduzzaman, M.: Predicting Infectious state of hepatitis C virus affected patient’s applying machine learning methods. In: 2020 IEEE Region 10 Symposium (TENSYMP) (2020)
UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/HCV+data. Accessed 17 Jan 2022
Chen, T., Guestrin, C.: XGBoost. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016)
Cheon, M.J., Lee, D.H., Joo, H.S., Lee, O.: Deep learning based hybrid approach of detecting fraudulent transactions. J. Theor. Appl. Inf. Technol. 99(16), 4044–4054 (2021)
Liu, F.T., Ting, K.M., Zhou, Z.H.: Isolation forest. In: 2008 Eighth IEEE International Conference on Data Mining (2008)
Ferreira, P., Le, D. C., Zincir-Heywood, N.: Exploring feature normalization and temporal information for machine learning based insider threat detection. In: 2019 15th International Conference on Network and Service Management (CNSM) (2019)
CDC. https://www.cdc.gov/hepatitis/statistics/2019surveillance/Figure3.6.htm. Accessed 17 Jan 2022
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sun, T. (2023). Diagnosis of Hepatitis C Patients via Machine Learning Approach: XGBoost and Isolation Forest. In: Arai, K. (eds) Proceedings of the Future Technologies Conference (FTC) 2022, Volume 1. FTC 2022 2022. Lecture Notes in Networks and Systems, vol 559. Springer, Cham. https://doi.org/10.1007/978-3-031-18461-1_43
Download citation
DOI: https://doi.org/10.1007/978-3-031-18461-1_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18460-4
Online ISBN: 978-3-031-18461-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)