Machine learning for medical diagnosis: history, state of the art and perspective

https://doi.org/10.1016/S0933-3657(01)00077-XGet rights and content

Abstract

The paper provides an overview of the development of intelligent data analysis in medicine from a machine learning perspective: a historical view, a state-of-the-art view, and a view on some future trends in this subfield of applied artificial intelligence. The paper is not intended to provide a comprehensive overview but rather describes some subareas and directions which from my personal point of view seem to be important for applying machine learning in medical diagnosis. In the historical overview, I emphasize the naive Bayesian classifier, neural networks and decision trees. I present a comparison of some state-of-the-art systems, representatives from each branch of machine learning, when applied to several medical diagnostic tasks. The future trends are illustrated by two case studies. The first describes a recently developed method for dealing with reliability of decisions of classifiers, which seems to be promising for intelligent data analysis in medicine. The second describes an approach to using machine learning in order to verify some unexplained phenomena from complementary medicine, which is not (yet) approved by the orthodox medical community but could in the future play an important role in overall medical diagnosis and treatment.

Introduction

Artificial intelligence is a part of computer science that tries to make computers more intelligent. One of the basic requirements for any intelligent behavior is learning. Most of the researchers today agree that there is no intelligence without learning. Therefore, machine learning [1], [2], [3], [4] is one of major branches of artificial intelligence and, indeed, it is one of the most rapidly developing subfields of AI research.

Machine learning algorithms were from the very beginning designed and used to analyze medical datasets. Today, machine learning provides several indispensible tools for intelligent data analysis. Especially in the last few years, the digital revolution provided relatively inexpensive and available means to collect and store the data. Modern hospitals are well equipped with monitoring and other data collection devices, and data is gathered and shared in large information systems. Machine learning technology is currently well suited for analyzing medical data, and in particular there is a lot of work done in medical diagnosis in small specialized diagnostic problems.

Data about correct diagnoses are often available in the form of medical records in specialized hospitals or their departments. All that has to be done is to input the patient records with known correct diagnosis into a computer program to run a learning algorithm. This is of course an oversimplification, but in principle, the medical diagnostic knowledge can be automatically derived from the description of cases solved in the past. The derived classifier can then be used either to assist the physician when diagnosing new patients in order to improve the diagnostic speed, accuracy and/or reliability, or to train students or physicians non-specialists to diagnose patients in a special diagnostic problem.

The aim of this paper is to provide an overview of the development of the intelligent data analysis in medicine from a machine learning perspective: a historical view, a state-of-the-art view and a view on some future trends in this subfield of applied artificial intelligence, which are, respectively, described in 2 Historical overview, 3 State of the art, 4 Future trends — two case studies. None of the three sections is intended to provide a comprehensive overview, but rather describe some subeareas and directions which from my personal point of view seem to be important for medical diagnosis. In the historical overview, I emphasize the naive Bayesian classifier, neural networks, and decision trees. Section 3 presents a comparison of some state-of-the-art systems, one or two representatives from each branch of machine learning, when applied to several medical diagnostic tasks. The future trends are illustrated by two case studies. Section 4.1 describes a recently developed method for dealing with reliability of decisions of classifiers, which seems to be promising for intelligent data analysis in medicine. Section 4.2 describes an approach to using machine learning in order to verify some unexplained phenomena from complementary medicine, which is not (yet) approved by the orthodox medical community, but could in the future play an important role in overall medical diagnosis and treatment.

Section snippets

Historical overview

As soon as electronic computers came into use in the 1950s and 1960s, the algorithms were developed that enabled modeling and analyzing large sets of data. From the very beginning, three major branches of machine learning emerged. Classical work in symbolic learning is described by Hunt et al. [5], in statistical methods by Nilsson [6], and in neural networks by Rosenblatt [7]. Through the years, all three branches developed advanced methods [2]: statistical or pattern recognition methods, such

State of the art

In this section, we give a description of specific requirements that any machine learning system has to satisfy in order to be used in the development of applications in medical diagnosis. Several learning algorithms are briefly described. We compared the performance of all the algorithms on several medical diagnostic and prognostic problems, and their appropriateness for applications in medical diagnosis is discussed.

Future trends — two case studies

There are many directions in which future development of machine learning in medical diagnosis may take place. Some may rely on new trends in computer technology or technology of medical equipment, however, probably more important is going to be the development of new machine learning algorithms and the philosophy of medical diagnosis. We do not want to speculate all possible trends. Instead, we describe two case studies that illustrate the new trends in the development of machine learning

Discussion

The historical development of machine learning and its applications in medical diagnosis shows that from simple and straightforward to use algorithms, systems and methodology have emerged that enable advanced and sophisticated data analysis. In the future, intelligent data analysis will play even a more important role due to the huge amount of information produced and stored by modern technology. Current machine learning algorithms provide tools that can significantly help medical practitioners

Acknowledgements

Special thanks to Ivan Bratko, Matjaž Kukar, and Nada Lavrač for longterm joint work on projects related to intelligent data analysis in medicine. Experiments with the Kirlian camera were done with the invaluable help and support from Matjaž Bevk, Zoran Bosnić, Tom Chalko, Minnie Hein, my wife Irena, Milan Mladženović, Barbara Novak, Petar Papuga, Vili Poznik, Bor Prihavec, Marko Robnik-Šikonja, Aleksander Sadikov, Danijel Skočaj, Slobodan Stanojević, Tatjana Zrimec, and many others. I thank

References (81)

  • M. Kukar et al.

    Machine learning in prognostics of the femoral neck fracture recovery

    Artif. Intell. Med.

    (1996)
  • M. Kukar et al.

    Analysing and improving the diagnosis of ishaemic heart disease with machine learning

    Artif. Intell. Med.

    (1999)
  • Shavlik JW, Dietterich TG, editors. Readings in machine learning. Los Altos, CA: Morgan Kaufmann,...
  • Michie D, Spiegelhalter DJ, Taylor CC, editors. Machine learning, neural and statistical classification. Chichester,...
  • Mitchell T. Machine learning. New York: McGraw-Hill,...
  • Michalski RS, Bratko I, Kubat M, editors. Machine learning, data mining and knowledge discovery: methods and...
  • Hunt E, Martin J, Stone P. Experiments in induction. New York: Academic Press,...
  • Nilsson N. Learning machines. New York: McGraw-Hill,...
  • Rosenblatt F. Principles of neurodynamics. Washington, DC: Spartan Books,...
  • Kononenko I, Bratko I, Kukar M. Application of machine learning to medical diagnosis. In: Michalski RS, Bratko I, Kubat...
  • Lavrač N, Džeroski S. Inductive logic programming. Chichester, UK: Ellis Horwood,...
  • Pompe U, Kononenko I. Probabilistic first-order classification. In: Lavrač N, Džeroski S, editors. Proceedings of the...
  • D.J. Spiegelhalter et al.

    Bayesian analysis in expert systems

    Stat. Sci.

    (1993)
  • Good IJ. Probability and the weighing of evidence. London: Charles Griffin,...
  • Good IJ. The estimation of probabilities — an essay on modern Bayesian methods. Cambridge: MIT Press,...
  • Kononenko I, Bratko I, Roškar E. Experiments in automatic learning of medical diagnostic rules. In: Proceedings of the...
  • Cestnik B, Kononenko I, Bratko I. ASSISTANT 86: a knowledge elicitation tool for sophisticated users. In: Bratko I,...
  • Kononenko I. Interpretation of neural networks decisions. In: Proceedings of the IASTED International Conference on...
  • I. Kononenko

    Inductive and Bayesian learning in medical diagnosis

    Appl. Artif. Intell.

    (1993)
  • Cestnik B. Estimating probabilities: a crucial task in machine learning. In: Proceedings of the European Conference on...
  • Kononenko I. Semi-naive Bayesian classifier. In: Kodratoff Y, editor. Proceedings of the European Working Session on...
  • I. Kononenko

    Naive Bayesian classifier and continuous attributes

    Informatica

    (1992)
  • Langley P. Induction of recursive Bayesian classifiers. In: Proceedings of the European Conference on Machine Learning,...
  • Pazzani M. Searching for dependencies in Bayesian classifiers. Artificial intelligence and statistics IV. Lecture notes...
  • Kohavi R, Becker B, Sommerfield D. Making sense of simple Bayes. Technical report, Data Mining and Visualization Group,...
  • Minsky M. In: Papert S, editor. Perceptrons. Cambridge, MA: MIT Press,...
  • J.J. Hopfield

    Neural networks and physical systems with emergent collective computational abilities

    Proc. Natl. Acad. Sci.

    (1982)
  • J.J. Hopfield

    Neurons with graded response have collective computational properties like those of two-state neurons

    Proc. Natl. Acad. Sci.

    (1984)
  • Rumelhart DE, Hinton GE, Williams RJ. Learning internal representations by error propagation. In: Rumelhart DE,...
  • Haykin S. Neural networks: a comprehensive foundation. New York: Macmillan,...
  • I. Kononenko

    Bayesian neural networks

    Biol. Cybernetics J.

    (1989)
  • Quinlan JR. Discovering rules from large collections of examples. In: Michie D, editor. Expert systems in the...
  • R.S. Michalski et al.

    Learning by being told and learning from examples: an experimental comparison of the two methods of knowledge acquisition in the context of developing an expert system for soybean disease diagnosis

    Int. J. Policy Anal. Inform. Syst.

    (1980)
  • I. Bratko et al.

    An experiment in automatic learning of diagnostic rules

    Informatica (Ljubljana)

    (1980)
  • Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees. Belmont, CA: Wadsworth,...
  • Elomaa T, Holsti N. An experimental comparison of inducing decision trees and decision lists in noisy domains. In:...
  • Lesmo L, Saitta L, Torasso P. Learning of fuzzy production rules for medical diagnoses. In: Gupta MM, Sanchez E,...
  • K.A. Horn et al.

    An expert system for the interpretation of thyroid assays in a clinical laboratory

    Aust. Comput. J.

    (1985)
  • Hojker S, Kononenko I, Jauk A, Fidler V, Porenta M. Expert system’s development in the management of thyroid diseases....
  • Quinlan R, Compton P, Horn KA, Lazarus L. Inductive knowledge acquisition: a case study. In: Quinlan JR, editor....
  • Cited by (1219)

    • Novel three-way decision model in medical diagnosis based on inexact reasoning

      2024, Engineering Applications of Artificial Intelligence
    View all citing articles on Scopus
    View full text