Skip to main content

Unsupervised Clustering to Reduce Overfitting Issues in Ensemble Deep Learning Models for Skin Lesion Classifications

  • Conference paper
  • First Online:
Proceedings of the 12th International Conference on Robotics, Vision, Signal Processing and Power Applications (RoViSP 2021)

Abstract

Class imbalance in skin lesion cancer is pronounced, which in turn impacts the performance of deep learning models for classification tasks. In the case of using dermoscopic images, training a convolutional neural network-based ensemble deep learning model on 7 classes from the International Skin Imaging Collaboration (ISIC) dataset 2018 with the 2 minor classes as malignant yields decent performance in classifying skin lesions. However, the predictive ability of the model on unseen data, ISIC 2019 is unsatisfactory. To narrow the gap of overall accuracy between the training and testing tasks, Part 1 of this study conducts 2 different sampling techniques: augmentation and clustering algorithms. The results of this study—Part 2, show that 4 samples taken from Self-Organizing Maps, Hierarchical and K-Means techniques have narrower gaps of the overall accuracies between the training and testing sets. The best-balanced results of the overall accuracies for the training and testing sets are 82 and 43%, respectively, if one cluster of melanocytic nevi from hierarchical clustering is used, while augmentation gives 92 and 40%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Maron R, Weichenthal M, Utikal JS, Hekler A, Berking C, Hauschild A, Enk AH, Haferkamp S, Klode J, Schdendorf D, Jansen P, Holland-Lets T, Schilling B, Kalle CV, Fröhiling S, Gaiser MR, Hartmann D, Gesierich A, Kähler KC, Wehkamp U, Thiem A (2019) Systematic outperformance of 112 dermatologists in multiclass skin cancer image classification by convolutional neural networks. Eur J Cancer 119:57–65

    Article  Google Scholar 

  2. Popescu D, El-Khatib M, El-Khatib H, Ichim L (2022) New trends in melanoma detection using neural networks: a systematic review. Sensors 22(2):496

    Article  Google Scholar 

  3. Nguyen TQ, Jancic-Turner T, Kaur A, Naguib RNG, Sakim HAM (2022) Sampling methods to balance classes in dermoscopic skin lesion images

    Google Scholar 

  4. Harangi B (2018) Skin lesion classification with ensembles of deep convolutional neural networks. J Biomed Inform 86:25–32

    Article  Google Scholar 

  5. Subasi A (2020) Machine learning techniques, practical machine learning for data analysis using Python. Academic Press, New York, pp 91–202

    Google Scholar 

  6. Leevy JL, Khoshgoftaar TM, Bauder RA, Seliya N (2018) A survey on addressing high-class imbalance in big data. J Big Data 5(1):42

    Article  Google Scholar 

  7. Fu’adah YN, Pratiwi NKC, Pramudito MA, Ibrahim N (2020) Convolutional neural network (CNN) for automatic skin cancer classification system. IOP Confer Ser 982(1):012005

    Article  Google Scholar 

  8. ISIC Challenge Datasets (2018) ISIC challenge page. https://challenge.isic-archive.com/data/#2018. Accessed 25 Feb 2023

  9. ISIC Challenge Datasets (2019) ISIC Challenge page. https://challenge.isic-archive.com/data/#2019. Accessed 25 Feb 2023

Download references

Acknowledgements

We sincerely appreciate the time that Dr. Nathan Jones spent on reviewing this manuscript and the insightful feedback that Prof. Raouf Naguib provided.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Quynh T. Nguyen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kaur, A., Jancic-Turner, T., Nguyen, Q.T., Vatts, S., Sakim, H.A.M. (2024). Unsupervised Clustering to Reduce Overfitting Issues in Ensemble Deep Learning Models for Skin Lesion Classifications. In: Ahmad, N.S., Mohamad-Saleh, J., Teh, J. (eds) Proceedings of the 12th International Conference on Robotics, Vision, Signal Processing and Power Applications. RoViSP 2021. Lecture Notes in Electrical Engineering, vol 1123. Springer, Singapore. https://doi.org/10.1007/978-981-99-9005-4_52

Download citation

Publish with us

Policies and ethics