Abstract
This paper deals with breast cancer diagnostic and prognostic estimations employing neural networks over the Wisconsin Breast Cancer datasets, which consist of measurements taken from breast cancer microscopic instances. A probabilistic approach is dedicated to solve the diagnosis problem, detecting malignancy among instances derived from the Fine Needle Aspirate test, while regression algorithms estimate the time interval that possibly correspond to the right end-point of the patients’ disease-free survival time or the time where the tumour recurs (time-to-recur). For the diagnosis problem, the accuracy of the neural network in terms of sensitivity and specificity was measured at 98.6 and 97.5% respectively, using the leave-one-out test method. As far as the prognosis problem is concerned, the accuracy of the neural network was measured through a stratified tenfold cross-validation approach. Sensitivity ranged between 80.5 and 91.8%, while specificity ranged between 91.9 and 97.9%, depending on the tested fold and the partition of the predicted period. The prognostic recurrence predictions were then further evaluated using survival analysis and compared with other techniques found in literature.
Similar content being viewed by others
References
Abbass HA (2002) An evolutionary artificial neural networks approach for breast cancer diagnosis. Artif Intell Med 25(3):265–281
Abbass HA, Towsey M, Finn GD (2001) C-net, a method for generating non-deterministic and dynamic multivariate decision trees. Knowl Inf Syst 3:184–197
Adamczak R, Duch W (1997) New developments in the feature space mapping model. In: 3rd Conference on neural networks and their applications, October 14–18
Bennett KP, Blue J (1997) A support vector machine approach to decision trees. In: R.P.I Math Report No. 97–100. Rensselaer Polytechnic Institute, Troy
Burke HB, Goodman PH (1997) Artificial neural networks improve the accuracy of cancer survival prediction. Cancer 79:857–862
Chen D, Chang RF, Huang YL (2000) Breast cancer diagnosis using self-organizing map for sonography. Ultrasound Med Biol 26:405–11
Cheng HD, Lui YM, Freimanis RI (1998) IEEE Trans Med Imaging 17(3):442–450
Choong PL, deSilva CJS (1996) Entropy maximization networks: an application to breast cancer prognosis. IEEE Trans Neural Netw 7(3):568–577
Choong PL, deSilva CJS (1998) Maximum entropy estimation vs. multivariate logistic regression: which should be used for the analysis of small binary outcome data sets? In: Proceedings of the 20th annual international conference of the IEEE Engineering in Medicine and Biology Society, vol 3, pp 1602–1605
Duch W, Adamczak R, Grabczewski K (2000) A new methodology of extraction, optimization and application of crisp and fuzzy logical rules. IEEE Trans Neural Netw 11(2):1–31
Emmanouilidis C, Hunter A, MacIntyre J, Cox C (1999) Multiple-criteria genetic algorithms for feature selection in neurofuzzy modelling. In: Proceedings of the IEEE international joint conference on neural networks (IJCNN’99), Washington, pp 4387–4392
Fonseca CM, Fleming PJ (1998) Multiobjective optimization and multiple constraint handling with evolutionary algorithms—part I a unified formulation. IEEE Trans Syst Man Cybern A Syst Hum 28(1):26–37
Garg AX, Adhikari NK, McDonald H et al (2005) Effects of computerized clinical decision support systems on practitioner performance and patient outcomes: a systematic review. JAMA 293(10):1223–1238
Giger M, Huo Z, Kupinski M, Vyborny C (2000) Computer-aided diagnosis in mammography. In: Sonka M, Fitzpatrick J (eds) Handbook of medical imaging, medical image processing and analysis, vol 2. SPIE Press, pp 917–986
Hamilton HJ, Shan N, Cercone N (1996) RIAC: a rule induction algorithm based on approximate classification. Technical report CS 96-06, Regina University
Horn J, Nafpliotis N, Goldberg DE (1994) A niched Pareto genetic algorithm for multiobjective optimisation. In: Proceedings of the IEEE conference on evolutionary computation, ICEC’94, vol 1, pp 82–87
Hoya T, Chambers JA (2001) Heuristic pattern correction scheme using adaptively trained generalized regression neural networks. IEEE Trans Neural Netw 12(1):91–100
Huo Z, Giger M, Vyborny C, Wolverton D, Schmidt R, Doi K (1998) Automated computerized classification of malignant and benign mass lesions on digital mammograms. Acad Radiol 5:155–168
Jankowski N, Kadirkamanathan V (1997) Statistical control of RBF-like networks for classification. In: Proceedings of the 7th international conference on artificial neural networks, Lausanne, pp 385–390
Jiang Y, Nishikawa R, Wolverton D, Metz C, Giger ML, Schmidt R, Doi K (1996) Automated feature analysis and classification of malignant and benign microcalcifications. Radiology 198:671–678
Kaban A, Girolami M (2000) Initialized and guided EM-clustering of sparse binary data with application to text based documents. In: 15th International conference on pattern recognition, vol 2, pp 744–747
Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Wermter S, Riloff E, Scheler G (eds) 14th International joint conference on artificial intelligence (IJCAI). Morgan Kaufman, San Francisco, pp 1137–1145
Liu B, Abbass HA, McKay B (2004) Classification rule discovery with ant colony optimisation. IEEE Comput Intell Bull 3(1):31–35
Mangasarian OL, Street WN, Wolberg WH (1995) Breast cancer diagnosis and prognosis via linear programming, Operat Res 43(4):570–577
Masters T (1995) Advanced algorithms for neural networks. Wiley, New York
Pendharkar PC, Rodger JA, Yaverbaum GJ, Herman N, Benner M (1999) Association, statistical, mathematical and neural approaches for mining breast cancer patterns. Exp Syst Appl 17:223–232
Seker H, Odetayo M, Petrovic D, Naguib RNG, Bartoli C, Alasio L, Lakshmi MS, Sherbet GV (2000) A fuzzy measurement-based assessment of breast cancer prognostic markers. In: Proceedings of the 2000 IEEE EMBS international conference on information technology applications in biomedicine, pp 174–178
Setiono R (2000) Generating concise and accurate classification rules for breast cancer diagnosis. Artif Intell Med 18:205–219
Shang N, Breiman L (1996) Distribution based trees are more accurate. In: Proceedings of the international conference on neural information processing, Hong Kong, vol 1, pp 133–138
Specht DF (1990) Probabilistic neural networks. Neural Netw 3(1):109–118
Specht DF (1996) Probabilistic neural networks and general regression neural networks. In: Chen CH (eds) Fuzzy logic and neural network handbook. McGraw-Hill, New York
Ster B, Dobnikar A (1996) Neural networks in medical diagnosis: Comparison with other methods. In: Bulsari A et al (eds) Proceedings of the international conference EANN ‘96, pp 427–430
Street WN (1998) A neural network model for prognostic prediction. In: Proceedings of the 15th international conference on machine learning. Morgan Kaufmann Publishers Inc., San Francisco, pp 540–546
Street WN, Mangasarian OL, Wolberg WH (1995) An inductive learning approach to prognostic prediction. In: 12th International conference on machine learning, pp 522–530
Taylor P, Fox J, Todd-Pokropek A (1998) Evaluation of a decision aid for the classification of microcalcifications. Digital mammography. Kluwer, Nijmegen, pp 237–244
Tourassi GD, Markey MK, Lo JY, Floyd CE Jr (2001) A neural network approach to breast cancer diagnosis as a constraint satisfaction problem. Med Phys 28:804–811
Wang TC, Karayiannis NB (1998) Detection of microcalcifications in digital mammograms using wavelets. IEEE Trans Med Imaging 17(4):498–509
Wojnarski M (2003) LTF-C, architecture, training algorithm and applications of new neural classifier. Fundam Informaticae 54(1):89–105
Wolberg WH, Street WN, Mangasarian OL (1994) Machine learning techniques to diagnose breast cancer from fine-needle aspirates, Cancer Lett 77:163–171
Wolberg WH, Street WN, Mangasarian OL (1995) Image analysis and machine learning applied to breast cancer diagnosis and prognosis. Anal Quant Cytol Histol 17(2):77–87
Wolberg WH, Street WN, Heisey DM, Mangasarian OL (1995) Computer-derived nuclear features distinguish malignant from benign breast cytology. Hum Pathol 26:792–796
http://www.seer.cancer.gov/cgi-bin/csr/1975_2001/search.pl#results. Estimated new cancer cases and deaths for 2004
http://www.cancernet.nci.nih.gov/. U.S. National Institutes of Health, National Cancer Institute
http://www.seer.cancer.gov. United States National Cancer Institute, Surveillance, Epidemiology and End Results (SEER) program
http://www.ftp.ics.uci.edu/pub/machine-learning-databases/breast-cancer-wisconsin/. Wisconsin Diagnostic Breast Cancer (WDBC) Dataset and Wisconsin Prognostic Breast Cancer (WPBC) Dataset
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Anagnostopoulos, I., Maglogiannis, I. Neural network-based diagnostic and prognostic estimations in breast cancer microscopic instances. Med Bio Eng Comput 44, 773–784 (2006). https://doi.org/10.1007/s11517-006-0079-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-006-0079-4