Abstract
Liver disease contributes to major morbidities and mortalities globally. Around 2 million people die annually worldwide from liver diseases. Diagnosing liver disease is usually made by measuring levels of biomarkers that could be enzymes linked to liver function tests and/or proteins. However, not often the levels of these biomarkers change in cases of liver diseases. The majority of diagnosis based on biomarkers occur when the liver is partially or fully damaged. On the other hand, early diagnosis of the disease can prevent further complication, decrease the burden on healthcare systems and save lives. Subsequently, this research evaluated machine learning algorithms for early prediction of liver disease. These algorithms included logistic regression, decision tree, random forest, adaptive boosting, extreme gradient boost, support vector machine and Naïve Bayes and were applied to a dataset of patients with and without liver diseases after data preprocessing. Metrics for evaluation included accuracy, precision, recall, AUC-ROC and F1-score. Based on these metrics and after hyperparameter tuning, support vector machine, random forest, adaptive gradient boost and extreme gradient boost showed to be the best performing models. However, the performance of these models did not exclude overfitting that could be related to the low sample size of the dataset. Future work involves applying these algorithms to a larger sample size of patients and more features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Williams R (2006) Global challenges in liver disease. Hepatology 44(3):521–526
Asrani SK, Devarbhavi H, Eaton J, Kamath PS (2019) Burden of liver diseases in the world. J Hepatol 70(1):151–171. https://doi.org/10.1016/j.jhep.2018.09.014
Limdi JK, Hyde GM (2003) Evaluation of abnormal liver function tests. Postgrad Med J 79(932):307–312
Ghosh M, Raihan MMS, Raihan M, Akter L, Bairagi AK, Alshamrani SS, Masud M (2021) A comparative analysis of machine learning algorithms to predict liver disease. Intell Autom Soft Comput 30(3)
Vijayarani S, Dhayanand S (2015) Liver disease prediction using SVM and Naïve Bayes algorithms. Int J Sci, Eng Technol Res (IJSETR) 4(4):816–820
Priya MB, Juliet PL, Tamilselvi PR (2018) Performance analysis of liver disease prediction using machine learning algorithms. Int Res J Eng Technol 5(1):206–211
Rahman AS, Shamrat FJM, Tasnim Z, Roy J, Hossain SA (2019) A comparative study on liver disease prediction using supervised machine learning algorithms. Int J Sci Technol Res 8(11):419–422
Wu CC, Yeh WC, Hsu WD, Islam MM, Nguyen PA, Poly TN, Wang YC, Yang HC, Li YC (2019) Prediction of fatty liver disease using machine learning algorithms. Comput Meth Programs Biomed 170:23–29
Mostafa F, Hasan E, Williamson M, Khan H (2021) Statistical machine learning approaches to liver disease prediction. Livers 1(4):294–312
Gupta S, Karanth G, Pentapati N, Prasad VB (2020) A web based framework for liver disease diagnosis using combined machine learning models. In: 2020 international conference on smart electronics and communication (ICOSEC). IEEE, pp 421–428
Vats V, Zhang L, Chatterjee S, Ahmed S, Enziama E, Tepe K (2018) A comparative analysis of unsupervised machine techniques for liver disease prediction. In: 2018 IEEE international symposium on signal processing and information technology (ISSPIT). IEEE, pp 486–489
University of California Irvine machine learning repository (2023). Available at: https://archive.ics.uci.edu/datasets. Accessed 26 August 2023
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357
LaValley MP (2008) Logistic regression. Circulation 117(18):2395–2399
Kingsford C, Salzberg SL (2008) What are decision trees? Nat Biotechnol 26(9):1011–1013
Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H, Chen K, Mitchell R, Cano I, Zhou T (2015) Xgboost: extreme gradient boosting. R package version 0.4–2, 1(4):1–4
Khan F, Ahamed J, Kadry S, Ramasamy LK (2020) Detecting malicious URLs using binary classification through ada boost algorithm. Int J Electr Comput Eng (2088–8708) 10(1)
Rish I (2001) An empirical study of the naive Bayes classifier. IJCAI 2001 Workshop Empirical Meth Artif Intell 3(22):41–46
Germani G, Theocharidou E, Adam R, Karam V, Wendon J, O’Grady J, Burra P, Senzolo M, Mirza D, Castaing D, Klempnauer J, Burroughs AK (2012) Liver transplantation for acute liver failure in Europe: outcomes over 20 years from the ELTR database. J Hepatol 57(2):288–296
Alizargar A, Chang YL, Tan TH (2023) Performance comparison of machine learning approaches on Hepatitis C prediction employing data mining techniques. Bioengineering 10(4):481
Tanwar N, Rahman KF (2021) Machine learning in liver disease diagnosis: current progress and future opportunities. In: IOP conference series: materials science and engineering, vol 1022, no 1. IOP Publishing, p 012029
Pal M (2005) Random forest classifier for remote sensing classification. Int J Remote Sens 26(1):217–222
Pisner DA, Schnyer DM (2020) Support vector machine. In: Machine learning. Academic Press, pp 101–121
Acknowledgements
The authors would like to thank UITAR International University for supporting this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Geddam, S., Assi, S., Naghavipour, H., Jayabalan, M., Al-Hamid, A., Al-Jumeily OBE, D. (2024). Evaluation of Machine Learning Algorithms for Early Prediction of Liver Disease. In: Bee Wah, Y., Al-Jumeily OBE, D., Berry, M.W. (eds) Data Science and Emerging Technologies. DaSET 2023. Lecture Notes on Data Engineering and Communications Technologies, vol 191. Springer, Singapore. https://doi.org/10.1007/978-981-97-0293-0_37
Download citation
DOI: https://doi.org/10.1007/978-981-97-0293-0_37
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0292-3
Online ISBN: 978-981-97-0293-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)