Abstract
In this paper, we applied Bayesian multi-layer perceptrons (MLP) using the evidence procedure to predict malignancy of ovarian masses in a large (n = 1,066) multi-centre data set. Automatic relevance determination (ARD) was used to select the most relevant inputs. Fivefold cross-validation (5CV) and repeated 5CV was used to select the optimal combination of input set and number of hidden neurons. Results indicate good performance of the models with area under the receiver operating characteristic curve values of 0.93–0.94 on independent data. Comparison with a linear benchmark model and a previously developed logistic regression model shows that the present problem is very well linearly separable. A resampling analysis further shows that the number of hidden neurons specified in the ARD analyses for input selection may influence model performance. This paper shows that Bayesian MLPs, although not frequently used, are a useful tool for detecting malignant ovarian tumours.






Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Jemal A, Siegel R, Ward E, Murray T, Xu J, Smigal C, Thun MJ (2007) Cancer statistics, 2007. CA Cancer J Clin 57:43–66
Vergote I, De Brabanter J, Fyles A, Bertelsen K, Einhorn N, Sevelda P, Gore ME, Karn J, Verrelst H, Sjovall K, Timmerman D, Vandewalle J, Van Gramberen M, Trope CG (2001) Prognostic factors in 1,545 patients with stage I invasive epithelial ovarium carcinoma: importance of degree of differential and cyst rupture in predicting relapse. Lancet 357:176–182
Jacobs I, Oram D, Fairbanks J, Turner J, Frost C, Grudzinskas JG (1990) A risk of malignancy index incorporating CA 125, ultrasound and menopausal status for the accurate preoperative diagnosis of ovarian cancer. BJOG 97:922–929
Tingulstad S, Hagen B, Skjeldestad FE, Onsrud M, Kiserud T, Halvorsen T, Nustad K (1996) Evaluation of a risk of malignancy index based on serum CA125, ultrasound findings and menopausal status in the pre-operative diagnosis of pelvic masses. BJOG 103:826–831
Alcázar JL, Mercé LT, Laparte C, Jurado M, López-García G (2003) A new scoring system to differentiate benign from malignant adnexal masses. Am J Obstet Gynecol 188:685–692
Sassone M, Timor-Tritsch IE, Artner A, Westhoff C, Warren W (1991) Transvaginal sonographic characterization of ovarian disease: evaluation of a new scoring system to predict ovarian malignancy. Obstet Gynecol 78:70–76
Lerner JP, Timor-Tritsch IE, Federman A, Abramovich G (1994) Transvaginal ultrasonographic characterization of ovarian masses with an improved, weighted scoring system. Am J Obstet Gynecol 170:81–85
Tailor A, Jurkovic D, Bourne TH, Collins WP, Campbell S (1997) Sonographic prediction of malignancy in adnexal masses using multivariate logistic regression analysis. Ultrasound Obstet Gynecol 10:41–47
Timmerman D, Bourne TH, Tailor A, Collins WP, Verrelst H, Vandenberghe K, Vergote I (1999) A comparison of methods for preoperative discrimination between malignant and benign adnexal masses: the development of a new logistic regression model. Am J Obstet Gynecol 181:57–65
Alcázar JL, Jurado M (1998) Using a logistic model to predict malignancy of adnexal masses based on menopausal status, ultrasound morphology, and color Doppler findings. Gynecol Oncol 69:146–150
Tailor A, Jurkovic D, Bourne TH, Collins WP, Campbell S (1999) Sonographic prediction of malignancy in adnexal masses using an artificial neural network. BJOG 106:21–30
Timmerman D, Verrelst H, Bourne TH, De Moor B, Collins WP, Vergote I, Vandewalle J (1999) Artificial neural network models for the preoperative discrimination between malignant and benign adnexal masses. Ultrasound Obstet Gynecol 13:17–25
Biagiotti R, Desii C, Vanzi E, Gacci G (1999) Predicting ovarian malignancy: application of artificial neural networks to transvaginal and color Doppler flow US. Radiology 210:399–403
Clayton RD, Snowden S, Weston MJ, Mogensen O, Eastaugh J, Lane G (1999) Neural networks in the diagnosis of malignant ovarian tumours. BJOG 106:1078–1082
Lu C, Van Gestel T, Suykens JAK, Van Huffel S, Vergote I, Timmerman D (2003) Preoperative prediction of malignancy of ovarian tumors using least squares support vector machines. Artif Intell Med 28:281–306
Timmerman D (2000) Lack of standardization in gynecological ultrasonography (editorial). Ultrasound Obstet Gynecol 16:395–398
Mol BWJ, Boll D, De Kanter M, Heintz APM, Sijmons EA, Oei SG, Bal H, Brölmann HAM (2001) Distinguishing the benign and malignant adnexal mass: an external validation of prognostic models. Gynecol Oncol 80:162–167
Aslam N, Banerjee S, Carr JV, Savvas M, Hooper R, Jurkovic D (2000) Prospective evaluation of logistic regression models for the diagnosis of ovarian cancer. Obstet Gynecol 96:75–80
Valentin L, Hagen B, Tingulstad S, Eik-Nes S (2001) Comparison of ‘pattern recognition’ and logistic regression models for discrimination between benign and malignant pelvic masses: a prospective cross validation. Ultrasound Obstet Gynecol 18:357–365
Timmerman D, Valentin L, Bourne TH, Collins WP, Verrelst H, Vergote I (2000) Terms, definitions and measurements to describe the sonographic features of adnexal tumors: a consensus opinion from the International Ovarian Tumor Analysis (IOTA) group. Ultrasound Obstet Gynecol 16:500–505
Timmerman D, Testa AC, Bourne T, Ferrazzi E, Ameye L, Konstantinovic ML, Van Calster B, Collins WP, Vergote I, Van Huffel S, Valentin L (2005) Logistic regression model to distinguish between the benign and malignant adnexal mass before surgery: a multicenter study by the International Ovarian Tumor Analysis Group. J Clin Oncol 23:8794–8801
Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press, Oxford
MacKay DJC (1995) Probable networks and plausible predictions—a review of practical Bayesian methods for supervised neural networks. Netw-Comput Neural Syst 6:469–505
MacKay DJC (1992) The evidence framework applied to classification networks. Neural Comput 4:698–714
Nabney IT, Evans DJ, Brulé Y, Gordon C (2005) Assessing the effectiveness of Bayesian feature selection. In: Husmeier D et al (eds) Probabilistic modelling in medical informatics and bioinformatics. Springer, London, pp 371–390
Thodberg HH (1996) A review of Bayesian neural networks with an application to near infrared spectroscopy. IEEE Trans Neural Netw 7:56–72
Berger JO (1985) Statistical design theory and Bayesian analysis. Springer, Berlin
Baum EB, Wilczek F (1988) Supervised learning of probability distributions by neural networks. In: Anderson D (ed) Advances in neural information processing systems, American Institute of Physics, New York, pp 52–61
Vellido A, Lisboa PJG (2001) An electronic commerce application of the Bayesian framework for MLPs: the effect of marginalisation and ARD. Neural Comput Appl 10:3–11
Penny WD, Roberts SJ (1999) Bayesian neural networks for classification: how useful is the evidence framework? Neural Netw 12:877–892
Marion G, Saad D (1995) A statistical mechanical analysis of a Bayesian inference scheme for an unrealizable rule. J Phys A-Math Gen 28:2159–2171
Neal RM (1996) Bayesian learning for neural networks. Springer, New York
Vivarelli F, Williams CKI (1997) Comparing Bayesian neural network algorithms for classifying segmented outdoor images. Neural Netw 14:427–437
Lu C (2005) Probabilistic machine learning approaches to medical classification problems. PhD thesis, Department of Electrical Engineering, Katholieke Universiteit Leuven, Leuven
Nabney IT (2002) NETLAB. Algorithms for pattern recognition. Springer, London
Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th international joint conference on artificial intelligence, Morgan Kaufman, San Francisco, pp 1137–1145
Hanley JA, McNeil B (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36
Hastie TJ, Tibshirani RJ (1990) Generalized additive models. Chapman & Hall/CRC, Boca Raton
Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10:1895–1923
Kirk RE (2007) Effect magnitude: a different focus. J Stat Plan Inference 137:1634–1646
Bouckaert RR (2003) Choosing between two learning algorithms based on calibrated tests. In: Proceedings of the 20th international conference on machine learning, AAAI Press, Menlo Park, pp 51–58
Nadeau I, Bengio Y (2000) Inference for the generalization error. In: Solla S et al (eds) Advances in neural information processing systems, vol 12, MIT Press, Cambridge, pp 307–313
Hand DJ (2006) Classifier technology and the illusion of progress. Stat Sci 21:1–15
Foresee FD, Hagan MT (1997) Gauss–Newton approximation to Bayesian learning. In proceedings of the 1997 international joint conference on neural networks, pp 1930–1935
Etchells TA, Lisboa PJG (2006) Orthogonal search-based rule extraction (OSRE) for trained neural networks: a practical and efficient approach. IEEE Trans Neural Netw 17:374–384
Acknowledgments
This research was supported by the Research Council KUL: GOA-AMBioRICS, CoE EF/05/006 Optimization in Engineering, several PhD/postdoc and fellow grants, Flemish Government: FWO [PhD/postdoc grants, projects, G.0407.02 (support vector machines), G.0360.05 (EEG, Epileptic), G.0519.06 (Noninvasive brain oxygenation), FWO-G.0321.06 (tensors/spectral analysis), research communities (ICCoS, ANMMM)]; IWT (PhD grants), Belgian Federal Science Policy Office IUAP P5/22 (`Dynamical Systems and Control: Computation, Identification and Modelling’), EU: BIOPATTERN (FP6-2002-IST 508803), ETUMOUR (FP6-2002-LIFESCIHEALTH 503094), Healthagents (IST 2004–27214), and by research grants from the Swedish Medical Research Council (grants nos. K98-17X-11605-03A, K2001-72X-11605-06A and K2002-72X-11605-07B), by funds administered by Malmö University Hospital, Allmänna Sjukhusets i Malmö Stiftelse för bekämpande av cancer (the Malmö General Hospital Foundation for fighting against cancer) and ALF-medel (a Swedish governmental grant from the region of Scania).
Author information
Authors and Affiliations
Corresponding author
Additional information
Parts of this work were originally presented at the 28th annual international conference of the IEEE Engineering in Medicine and Biology Society, 31 August–3 September 2006, New York City (NY), USA.
Rights and permissions
About this article
Cite this article
Van Calster, B., Timmerman, D., Nabney, I.T. et al. Using Bayesian neural networks with ARD input selection to detect malignant ovarian masses prior to surgery. Neural Comput & Applic 17, 489–500 (2008). https://doi.org/10.1007/s00521-007-0147-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-007-0147-1