Abstract
Breast cancer is one of the most common cancers among females. Patients with breast cancer are regularly rising. The survival of patients can be improved by early diagnosis and treatment. Because of its success, machine learning is commonly used in most fields. In this paper, numerous methods for early detection of this disease are employed for machine learning. Here, we consider C 5.0, Naive Bayes, logistic regression, random forest, ctree, KNN, K-Mean, GBM, adaBoost, decision tree model to classify the breast cancer tumor and evaluate their performances based on Wisconsin and SEER datasets. The demonstrations of the classifiers were assessed using accuracy, precision, recall, and F1 measure. We also predict whether the tumor is dead or alive, considering the tumor size, the various cancer stages, and months’ survival. The general research will increase people’s understanding of breast cancer and reduce tumor fears.















Similar content being viewed by others
References
Abdar M, Zomorodi-Moghadam M, Zhou X, Gururajan R, Tao X, Barua PD, Gururajan R (2020) A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recognit Lett 132:123–131
Araújo T, Aresta G, Castro E, Rouco J, Aguiar P, Eloy C, Polónia A, Campilho A (2017) Classification of breast cancer histology images using convolutional neural networks. PloS One 12(6):e0177544
Asri H, Mousannif H, Al Moatassime H, Noel T (2016) Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput Sci 83:1064–1069
Bayramoglu N, Kannala J, Heikkilä J (2016) Deep learning for magnification independent breast cancer histopathology image classification. In: 2016 23rd International conference on pattern recognition (icpr), pp 2440–2445
Chou Y-H, Tiu C-M, Hung G-S, Wu S-C, Chang TY, Chiang HK (2001) Stepwise logistic regression analysis of tumor contour features for breast ultrasound diagnosis. Ultrasound Med Bio 27(11):1493–1498
Drukker K, Giger ML, Vyborny CJ, Mendelson EB (2004) Computerized detection and classification of cancer on breast ultrasound1. Acad Radiology 11(5):526–535
Dumitru D (2009) Prediction of recurrent events in breast cancer using the naive bayesian classification. Annals Univ Craiova-Math Comput Sci Series 36 (2):92–96
Dwivedi AK (2018) Analysis of computational intelligence techniques for diabetes mellitus prediction. Neural Comput Appl 30(12):3837–3845
Dwivedi AK, Chouhan U (2018) Multilayer perceptron and evolutionary radial basis function neural network models for discrimination of hiv-1 genomes. Current Sci 115(11):2063
Díaz-Uriarte R, De Andres SA (2006) Gene selection and classification of microarray data using random forest. BMC Bioinforma 7(1):3
Han SS, Kim MS, Lim W, Park GH, Park I, Chang SE (2018) Classification of the clinical images for benign and malignant cutaneous tumors using a deep learning algorithm. J Investigative Derma 138(7):1529–1538
Huang Q, Chen Y, Liu L, Tao D, Li X (2019) On combining biclustering mining and adaboost for breast tumor classification. IEEE Trans Knowl Data Eng 32(4):728–738
Jerez JM, Franco L, Alba E, Llombart-Cussac A, Lluch A, Ribelles N, Munarriz B, Martin M (2005) Improvement of breast cancer relapse prediction in high risk intervals using artificial neural networks. Breast Cancer Res Treatment 94(3):265–272
Jerez-Aragonés JM, Gómez-Ruiz JA, Ramos-Jiménez G, Muñoz-Pérez J, Alba-Conejo E (2003) A combined neural network and decision trees model for prognosis of breast cancer relapse. Artif Intell Med 27(1):45–63
Kim W, Kim KS, Lee JE, Noh D-Y, Kim S-W, Jung YS, Park MY, Park RW (2012) Development of novel breast cancer recurrence prediction model using support vector machine. J Breast Cancer 15(2):230–238
Kumar GR, Ramachandra GA, Nagamani K (2013) An efficient prediction of breast cancer data using data mining techniques. Int J Innovations Eng Technol (IJIET) 2(4):139
Kuo W-J, Chang R-F, Chen D-R, Lee CC (2001) Data mining with decision trees for diagnosis of breast tumor in medical ultrasonic images. Breast Cancer Res Treatment 66(1):51–57
Lawrence RL, Wright A (2001) Rule-based classification systems using classification and regression tree (cart) analysis. Photogrammetric Eng Remote Sensing 67(10):1137–1142
Liu C-Y, Hsu C-Y, Chou Y-H, Chen C-M (2014) A multi-scale tumor detection algorithm in whole breast sonography incorporating breast anatomy and tissue morphological information. In: 2014 Ieee healthcare innovation conference (hic), pp 193–196
Polat K, Güneş S (2007) Breast cancer diagnosis using least square support vector machine. Digital Signal Process 17(4):694–701
Punitha S, Amuthan A, Joseph KS (2018) Benign and malignant breast cancer segmentation using optimized region growing technique. Future Comput Inform J 3(2):348–358
Ridgeway G (2007) Generalized boosted models: a guide to the gbm package. Upyear 1(1):2007
Spanhol FA, Oliveira LS, Petitjean C, Heutte L (2015) A dataset for breast cancer histopathological image classification. IEEE Trans Biomed Eng 63 (7):1455–1462
Subashini TS, Ramalingam V, Palanivel S (2009) Breast mass classification based on cytological patterns using rbfnn and svm. Expert Syst Appl 36 (3):5284–5290
Varma C, Sawant O (2018) An alternative approach to detect breast cancer using digital image processing techniques. In: 2018 International conference on communication and signal processing (iccsp), pp 0134–0137
Yousefi M, Krzyżak A, Suen CY (2018) Mass detection in digital breast tomosynthesis data using convolutional neural networks and multiple instance learning. Comput Biology Med 96:283–293
Zhou L, Zhang Z, Chen Y-C, Zhao Z-Y, Yin X-D, Jiang H-B (2019) A deep learning-based radiomics model for differentiating benign and malignant renal tumors. Transl Oncol 12(2):292–300
Funding
Did not get any kind of funding for this work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
There is no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Singh, D., Nigam, R., Mittal, R. et al. Information retrieval using machine learning from breast cancer diagnosis. Multimed Tools Appl 82, 8581–8602 (2023). https://doi.org/10.1007/s11042-022-13550-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13550-3