Abstract
There is much empirical evidence about the success of naive Bayesian classification (NBC) in medical applications of attribute-based machine learning. NBC assumes conditional independence between attributes. In classification, such classifiers sum up the pieces of class-related evidence from individual attributes, independently of other attributes. The performance, however, deteriorates significantly when the “interactions” between attributes become critical. We propose an approach to handling attribute interactions within the framework of “voting” classifiers, such as NBC. We propose an operational test for detecting interactions in learning data and a procedure that takes the detected interactions into account while learning. This approach induces a structuring of the domain of attributes, it may lead to improved classifier’s performance and may provide useful novel information for the domain expert when interpreting the results of learning. We report on its application in data analysis and model construction for the prediction of clinical outcome in hip arthroplasty.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shapiro, A.D.: Structured induction in expert systems. Turing Institute Press in association with Addison-Wesley Publishing Company (1987)
Michie, D.: Problem decomposition and the learning of skills. In: Lavrač, N., Wrobel, S. (eds.) ECML 1995. LNCS, vol. 912, pp. 17–31. Springer, Heidelberg (1995)
Zupan, B., Bohanec, M., Demšar, J., Bratko, I.: Learning by discovering concept hierarchies. Artificial Intelligence 109, 211–242 (1999)
Harris, W.H.: Traumatic arthritis of the hip after dislocation and acetabular fractures: Treatment by mold arthroplasty: end result study using a new method of result evaluation. J. Bone Joint. Surg. 51-A, 737–755 (1969)
Zupan, B., Demšar, J., Smrke, D., Božikov, K., Stankovski, V., Bratko, I., Beck, J.R.: Predicting patient’s long term clinical status after hip arthroplasty using hierarchical decision modeling and data mining. Methods of Information in Medicine 40, 25–31 (2001)
Jakulin, A.: Attribute interactions in machine learning. Master’s thesis, University of Ljubljana, Faculty of Computer and Information Science (2003)
McGill, W.J.: Multivariate information transmission. Psychometrika 19, 97–116 (1954)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifiers. Machine Learning 29, 131–163 (1997)
Struyf, A., Hubert, M., Rousseeuw, P.J.: Integrating robust clustering techniques in S-PLUS. Computational Statistics and Data Analysis 26, 17–37 (1997)
Kononenko, I.: Semi-naive Bayesian classifier. In: Kodratoff, Y. (ed.) EWSL 1991. Lecture Notes in Computer Science (LNAI), vol. 482. Springer, Heidelberg (1991)
Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning 29, 103–130 (1997)
Rish, I., Hellerstein, J., Jayram, T.: An analysis of data characteristics that affect naive Bayes performance. Technical Report RC21993, IBM (2001)
Demšar, J., Zupan, B.: Orange: a data mining framework. (2002), http://magix.fri.unilj.si/orange
Brier, G.W.: Verification of forecasts expressed in terms of probability. Weather Rev. 78, 1–3 (1950)
Margolis, D.J., Halpern, A.C., Rebbeck, T., et al.: Validation of a melanoma prognostic model. Arch. Dermatol. 134, 1597–1601 (1998)
Myllymaki, P., Silander, T., Tirri, H., Uronen, P.: B-Course: A web-based tool for Bayesian and causal data analysis. International Journal on Artificial Intelligence Tools 11, 369–387 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jakulin, A., Bratko, I., Smrke, D., Demšar, J., Zupan, B. (2003). Attribute Interactions in Medical Data Analysis. In: Dojat, M., Keravnou, E.T., Barahona, P. (eds) Artificial Intelligence in Medicine. AIME 2003. Lecture Notes in Computer Science(), vol 2780. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39907-0_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-39907-0_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20129-8
Online ISBN: 978-3-540-39907-0
eBook Packages: Springer Book Archive