Abstract
Breast cancer is one among the most frequent reasons of women’s death worldwide. Nowadays, healthcare informatics is mainly focussing on the classification of breast cancer images, due to the lethal nature of this cancer. There are chances of inter- and intra-observer variability that may lead to misdiagnosis in the detection of cancer. This study proposed an automatic breast cancer classification system that uses support vector machine (SVM) classifier based on integrated features (texture, geometrical, and color). The University of California Santa Barbara (UCSB) dataset and BreakHis dataset, which are available in public domain, were used. A classification comparison module which involves SVM, k-nearest neighbor (k-NN), random forest (RF), and artificial neural network (ANN) was also proposed to determine the classifier that best suits for the application of breast cancer detection from histopathology images. The performance of these classifiers was analyzed against metrics like accuracy, specificity, sensitivity, balanced accuracy, and F-score. Results showed that among the classifiers, the SVM classifier performed better with a test accuracy of approximately 90% on both the datasets. Additionally, the significance of the proposed integrated SVM model was statistically analyzed against other classifier models.
Graphical abstract
Similar content being viewed by others
References
Abiodun OI, Jantan A, Omolara AE, Dada KV, Mohamed NA, Arshad H (2018) State-of-the-art in artificial neural network applications: a survey. Heliyon 4(11):e00938
Acs B, Rantalainen M, Hartman J (2020) Artificial intelligence as the next step towards precision pathology. J Intern Med 288(1):62–81
Al-Sudani S, Palaniappan R (2019) Predicting students’ final degree classification using an extended profile. Educ Inf Technol 24:2357–2369
American Cancer Society, Cancer facts and figures 2018, ACS, Atlanta; 2018. https://www.cancer.org/research/cancer-facts-statistics/all-cancer-facts-figures/cancer-facts-figures-2018.html
Anuranjeeta A, Shukla KK, Tiwari A, Sharma S (2017) Classification of histopathological images of breast cancerous and non-cancerous cells based on morphological features. Biomed Pharmacol J 10(1):353–366
Asri H, Mousannif H, Al M, Noel T (2016) Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput Sci 8(3):1064–1069
Aswathy MA, Jagannath M (2020) Performance analysis of segmentation algorithms for the detection of breast cancer. Procedia Comput Sci 167:666–676
Bayramoglu N, Tiulpin A, Hirvasniemi J, Nieminen MT, Saarakkala S (2020) Adaptive segmentation of knee radiographs for selecting the optimal ROI in texture analysis. Osteoarthr Cartil 28(7):941–952
Cheng I, Sun X, Alsufyani N, Xiong Z, Major P, Basu A (2015) Ground truth delineation for medical image segmentation based on local consistency and distribution map analysis. In: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS), Milan, Italy, pp 3073–3076
Dhahri H, Al Maghayreh E, Mahmood A, Elkilani W, Nagi MF (2019) Automated breast cancer diagnosis based on machine learning algorithms. J Healthc Eng 2019:1–11
Diamond J, Anderson NH, Bartels PH, Montironi R, Hamilton PW (2004) The use of morphological characteristics and texture analysis in the identification of tissue composition in prostatic neoplasia. Hum Pathol 35(9):1121–1131
Doyle S, Hwang M, Shah K, Madabhushi A, Feldman M, Tomaszewski J (2007) Automated grading of prostate cancer using architectural and textural image features. In: 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, USA, pp 1284–1287
Duggirala RJ (2019) Segmenting images using hybridization of k-means and fuzzy c-means algorithms. In: Sud K, Erdogmus P, Kadry S (eds) Introduction to Data Science and Machine Learning. IntechOpen, India, pp 1–27
Ebrahim Ali EE, Feng WZ (2016) Breast cancer classification using support vector machine and neural network. Int J Sci Res 5(3):1–6
Freedman RA, Keating NL, Pace LE, Lii J, McCarthy EP, Schonberg MA (2017) Use of surveillance mammography among older breast cancer survivors by life expectancy. J Clin Oncol 35(27):3123–3130
Gelasca E, Jiyun B, Boguslaw O, Fedorov D, Kvilekval K, Manjunath BS (2008) Evaluation and benchmark for biological image segmentation. In: IEEE International Conference on Image Processing, USA, pp 1816–1819
Gupta V, Chaurasia V, Shandilya M (2015) Random-valued impulse noise removal using adaptive dual threshold median filter. J Vis Commun Image Represent 26:296–304
Gurcan MN, Boucheron LE, Can A, Madabhushi A, Rajpoot NM, Yener B (2009) Histopathological image analysis: a review. IEEE Rev Biomed Eng 2:147–171
Hornik K (1991) Approximation capabilities of multilayer feedforward networks. Neural Netw 4(2):251–257
Kalyani C, Ramudu K, Reddy RG (2018) A review on optimized k-means and FCM clustering techniques for biomedical image segmentation using level set formulation. Biomed Res 29(20):3660–3668
Leena JS, Baskaran S, Govardhan A (2013) A robust approach to classify microcalcification in digital mammograms using contourlet transform and support vector machine. Am J Eng Appl Sci 6(1):57–68
Li X, Plataniotis KN (2015) Color model comparative analysis for breast cancer diagnosis using H and E stained images. In: Gurcan MN, Madabhushi A (eds) Medical Imaging 2015: Digital Pathology, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, 9420, pp 94200L-1–94200L-6
Madabhushi A (2009) Digital pathology image analysis: opportunities and challenges. Imaging Med 1(1):7–10
Medina EA, Paredes JIP (2009) Artificial neural network modeling techniques applied to the hydrodesulfurization process. Math Comp Model 49:207–214
Melgani F, Lorenzo B (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42(8):1778–1790
Mohanty AK, Swain SK, Champati PK, Lenka SK (2011) Image mining for mammogram classification by association rule using statistical and GLCM features. International Journal of Computer Science Issues 8(5):309–318
Nam S, Chong Y, Jung CK, Kwak T, Lee JY, Park J, Rho MJ, Go H (2020) Introduction to digital pathology and computer-aided pathology. J Pathol Transl Med 54(2):125–134
Khan I, Luo Z, Shaikh AK, Hedjam R (2021) Ensemble clustering using extended fuzzy k-means for cancer data analysis. Expert Syst Appl 172:114622
Moteghaed NY, Tabatabaeefar M, Mostaar A (2020) Biomedical image denoising based on hybrid optimization algorithm and sequential filters. J Biomed Phys Eng 10(1):83–92
Ojala T, Pietikäinen M, Mäenpää T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Orlov N, Shamir L, Macura T, Johnston J, Eckley DM, Goldberg IG (2008) WND-CHARM: Multi-purpose image classification using compound image transforms. Pattern Recognit Lett 29(11):1684–1693
Pisano ED, Zong S, Hemminger BM, DeLuca M, Johnston RE, Muller K, Braeuning MP, Pizer SM (1998) Contrast limited adaptive histogram equalization image processing to improve the detection of simulated speculations in dense mammograms. J Digit Imaging 11(4):193–200
Shanmugapriya S, Valarmathi A (2018) Efficient fuzzy c-means based multilevel image segmentation for brain tumor detection in MR images. Des Autom Embed Syst 22(1–2):81–93
Qing L, Zhigang L, Shenghui Y, Kun J, Navid R (2020) Computer-aided breast cancer diagnosis based on image segmentation and interval analysis. Automatika 61(3):496–506
Rodriguez MZ, Comin CH, Casanova D, Bruno OM, Amancio DR, Costa LdF, Rodrigues FA (2019) Clustering algorithms: a comparative approach. PLoS One 14(1):e0210236
Roque AC, Andre TC (2002) Mammography and computerized decision systems: a review. Ann N Y Acad Sci 980:83–94
Shan P (2018) Image segmentation method based on K-mean algorithm. EURASIP J Image Video Proc 81(2018):1–9
Song Y, Li Q, Huang H, Feng D, Chen M, Cai W (2016) Histopathology image categorization with discriminative dimension reduction of fisher vectors. In: Hua G, Jégou H (eds) ECCV Workshops, Part I, Lecture Notes in Computer Science 9913, pp 306–317
Spanhol FA, Oliveira LS, Petitjean C, Heutte L (2016) A dataset for breast cancer histopathological image classification. IEEE Trans Biomed Eng 63(7): 1455–1462. https://web.inf.ufpr.br/vri/databases/breast-cancer-histopathological-database-breakhis/
Spanhol FA, Oliveira LS, Petitjean C, Heutte L (2016) Breast cancer histopathological image classification using convolutional neural networks. In: International Joint Conference on Neural Networks (IJCNN), Vancouver, pp. 2560–2567
Turkki R, Byckhov D, Lundin M et al (2019) Breast cancer outcome prediction with tumour tissue images and machine learning. Breast Cancer Res Treat 177(1):41–52
Zorluoglu G, Agaoglu M (2015) Diagnosis of breast cancer using ensemble of data mining classification methods. Int J Bioinform Biomed Eng 1(3):318–322
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Aswathy, M.A., Jagannath, M. An SVM approach towards breast cancer classification from H&E-stained histopathology images based on integrated features. Med Biol Eng Comput 59, 1773–1783 (2021). https://doi.org/10.1007/s11517-021-02403-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-021-02403-0