Skip to main content

Diagnosis of Hepatitis C Patients via Machine Learning Approach: XGBoost and Isolation Forest

  • Conference paper
  • First Online:
Proceedings of the Future Technologies Conference (FTC) 2022, Volume 1 (FTC 2022 2022)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 559))

Included in the following conference series:

  • 739 Accesses

Abstract

Although the transmission of Hepatitis C through blood transfusion is getting less and less prevalent with the use of anti-HCV tests for blood donors, availability and practice of screening remain low in developing countries. This results in Hepatitis C patients who are unaware of their condition until it worsens to become chronic liver diseases that are diagnosed through more costly or invasive methods–liver biopsy and radiology scans. Due to these limitations of the current methods of diagnosis, this study seeks to develop a machine learning model to diagnose patients with different stages of liver disease: hepatitis c, liver fibrosis, and cirrhosis. In this research, machine learning algorithms were applied to a dataset containing HCV patient information, and the algorithms were evaluated for their accuracy and performance in classifying the patients with the proper diagnosis. Findings from the study indicated that XGBoost can most accurately classify patients with an accuracy score of 95.48, but other algorithms used had high accuracy scores as well: the algorithm with the lowest accuracy score–Decision Tree–still had a score of 92.66. The second experiment also showed that the Isolation Forest algorithm could detect and isolate the suspect blood donors of the data with a relatively high accuracy of 93.22%. As both experiments of the study yielded a machine learning model of high accuracy, the algorithms used can be implemented into a diagnostic kit for liver disease to be used in developing countries where accessibility to current diagnosis tools is limited.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Sebastiani, G.: Chronic hepatitis C and liver fibrosis. World J. Gastroenterol. 20(32), 11033–11053 (2014)

    Article  Google Scholar 

  2. Healthline. https://www.healthline.com/health/hepatitis-c-fibrosis-score#fibrosis-score. Accessed 15 Jan 2022

  3. NHS website. https://www.nhs.uk/conditions/hepatitis-c/diagnosis/. Accessed 15 Jan 2022

  4. Selvarajah, S., Busch, M.P.: Transfusion transmission of HCV, a long but successful road map to safety. Antivir. Ther. 17(7 Pt B), 1423–1429 (2012)

    Article  Google Scholar 

  5. Prati, D.: Transmission of hepatitis C virus by blood transfusions and other medical procedures: a global review. J. Hepatol. 45(4), 607–616 (2006)

    Article  MathSciNet  Google Scholar 

  6. Frija, G., et al.: How to improve access to medical imaging in low- and middle-income countries? EClinicalMedicine 38, 101034 (2021)

    Article  Google Scholar 

  7. Bajpai, M., Gupta, E., Choudhary, A.: Hepatitis C virus: screening, diagnosis, and interpretation of laboratory assays. Asian J. Transf. Sci. 8(1), 19 (2014)

    Article  Google Scholar 

  8. Akella, A., Akella, S. Applying machine learning to evaluate for fibrosis in chronic hepatitis C. MedRxiv (2020)

    Google Scholar 

  9. Abd El-Salam, S.M., et al.: Performance of machine learning approaches on prediction of esophageal varices for Egyptian chronic hepatitis C patients. Inform. Med. Unlocked 17, 100267 (2019)

    Article  Google Scholar 

  10. Chicco, D., Jurman, G.: An ensemble learning approach for enhanced classification of patients with hepatitis and cirrhosis. IEEE Access 9, 24485–24498 (2021)

    Article  Google Scholar 

  11. Ahammed, K., Satu, M.S., Khan, M.I., Whaiduzzaman, M.: Predicting Infectious state of hepatitis C virus affected patient’s applying machine learning methods. In: 2020 IEEE Region 10 Symposium (TENSYMP) (2020)

    Google Scholar 

  12. UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/HCV+data. Accessed 17 Jan 2022

  13. Chen, T., Guestrin, C.: XGBoost. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016)

    Google Scholar 

  14. Cheon, M.J., Lee, D.H., Joo, H.S., Lee, O.: Deep learning based hybrid approach of detecting fraudulent transactions. J. Theor. Appl. Inf. Technol. 99(16), 4044–4054 (2021)

    Google Scholar 

  15. Liu, F.T., Ting, K.M., Zhou, Z.H.: Isolation forest. In: 2008 Eighth IEEE International Conference on Data Mining (2008)

    Google Scholar 

  16. Ferreira, P., Le, D. C., Zincir-Heywood, N.: Exploring feature normalization and temporal information for machine learning based insider threat detection. In: 2019 15th International Conference on Network and Service Management (CNSM) (2019)

    Google Scholar 

  17. CDC. https://www.cdc.gov/hepatitis/statistics/2019surveillance/Figure3.6.htm. Accessed 17 Jan 2022

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ting Sun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sun, T. (2023). Diagnosis of Hepatitis C Patients via Machine Learning Approach: XGBoost and Isolation Forest. In: Arai, K. (eds) Proceedings of the Future Technologies Conference (FTC) 2022, Volume 1. FTC 2022 2022. Lecture Notes in Networks and Systems, vol 559. Springer, Cham. https://doi.org/10.1007/978-3-031-18461-1_43

Download citation

Publish with us

Policies and ethics