Abstract
Class imbalance in skin lesion cancer is pronounced, which in turn impacts the performance of deep learning models for classification tasks. In the case of using dermoscopic images, training a convolutional neural network-based ensemble deep learning model on 7 classes from the International Skin Imaging Collaboration (ISIC) dataset 2018 with the 2 minor classes as malignant yields decent performance in classifying skin lesions. However, the predictive ability of the model on unseen data, ISIC 2019 is unsatisfactory. To narrow the gap of overall accuracy between the training and testing tasks, Part 1 of this study conducts 2 different sampling techniques: augmentation and clustering algorithms. The results of this study—Part 2, show that 4 samples taken from Self-Organizing Maps, Hierarchical and K-Means techniques have narrower gaps of the overall accuracies between the training and testing sets. The best-balanced results of the overall accuracies for the training and testing sets are 82 and 43%, respectively, if one cluster of melanocytic nevi from hierarchical clustering is used, while augmentation gives 92 and 40%, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Maron R, Weichenthal M, Utikal JS, Hekler A, Berking C, Hauschild A, Enk AH, Haferkamp S, Klode J, Schdendorf D, Jansen P, Holland-Lets T, Schilling B, Kalle CV, Fröhiling S, Gaiser MR, Hartmann D, Gesierich A, Kähler KC, Wehkamp U, Thiem A (2019) Systematic outperformance of 112 dermatologists in multiclass skin cancer image classification by convolutional neural networks. Eur J Cancer 119:57–65
Popescu D, El-Khatib M, El-Khatib H, Ichim L (2022) New trends in melanoma detection using neural networks: a systematic review. Sensors 22(2):496
Nguyen TQ, Jancic-Turner T, Kaur A, Naguib RNG, Sakim HAM (2022) Sampling methods to balance classes in dermoscopic skin lesion images
Harangi B (2018) Skin lesion classification with ensembles of deep convolutional neural networks. J Biomed Inform 86:25–32
Subasi A (2020) Machine learning techniques, practical machine learning for data analysis using Python. Academic Press, New York, pp 91–202
Leevy JL, Khoshgoftaar TM, Bauder RA, Seliya N (2018) A survey on addressing high-class imbalance in big data. J Big Data 5(1):42
Fu’adah YN, Pratiwi NKC, Pramudito MA, Ibrahim N (2020) Convolutional neural network (CNN) for automatic skin cancer classification system. IOP Confer Ser 982(1):012005
ISIC Challenge Datasets (2018) ISIC challenge page. https://challenge.isic-archive.com/data/#2018. Accessed 25 Feb 2023
ISIC Challenge Datasets (2019) ISIC Challenge page. https://challenge.isic-archive.com/data/#2019. Accessed 25 Feb 2023
Acknowledgements
We sincerely appreciate the time that Dr. Nathan Jones spent on reviewing this manuscript and the insightful feedback that Prof. Raouf Naguib provided.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kaur, A., Jancic-Turner, T., Nguyen, Q.T., Vatts, S., Sakim, H.A.M. (2024). Unsupervised Clustering to Reduce Overfitting Issues in Ensemble Deep Learning Models for Skin Lesion Classifications. In: Ahmad, N.S., Mohamad-Saleh, J., Teh, J. (eds) Proceedings of the 12th International Conference on Robotics, Vision, Signal Processing and Power Applications. RoViSP 2021. Lecture Notes in Electrical Engineering, vol 1123. Springer, Singapore. https://doi.org/10.1007/978-981-99-9005-4_52
Download citation
DOI: https://doi.org/10.1007/978-981-99-9005-4_52
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9004-7
Online ISBN: 978-981-99-9005-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)