Machine learning for medical diagnosis: history, state of the art and perspective

The paper provides an overview of the development of intelligent data analysis in medicine from a machine learning perspective: a historical view, a state-of-the-art view, and a view on some future trends in this subfield of applied artificial intelligence. The paper is not intended to provide a comprehensive overview but rather describes some subareas and directions which from my personal point of view seem to be important for applying machine learning in medical diagnosis. In the historical overview, I emphasize the naive Bayesian classifier, neural networks and decision trees. I present a comparison of some state-of-the-art systems, representatives from each branch of machine learning, when applied to several medical diagnostic tasks. The future trends are illustrated by two case studies. The first describes a recently developed method for dealing with reliability of decisions of classifiers, which seems to be promising for intelligent data analysis in medicine. The second describes an approach to using machine learning in order to verify some unexplained phenomena from complementary medicine, which is not (yet) approved by the orthodox medical community but could in the future play an important role in overall medical diagnosis and treatment.

[1]  Jason Catlett,et al.  On Changing Continuous Attributes into Ordered Discrete Attributes , 1991, EWSL.

[2]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[3]  Igor Kononenko,et al.  Analysing and improving the diagnosis of ischaemic heart disease with machine learning , 1999, Artif. Intell. Medicine.

[4]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[5]  David E. Rumelhart,et al.  Predicting the Future: a Connectionist Approach , 1990, Int. J. Neural Syst..

[6]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[7]  David J. Spiegelhalter,et al.  Bayesian analysis in expert systems , 1993 .

[8]  Huan Liu,et al.  Book review: Machine Learning, Neural and Statistical Classification Edited by D. Michie, D.J. Spiegelhalter and C.C. Taylor (Ellis Horwood Limited, 1994) , 1996, SGAR.

[9]  I. Bratko,et al.  Information-based evaluation criterion for classifier's performance , 2004, Machine Learning.

[10]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[11]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[12]  Marko Robnik-Sikonja,et al.  An adaptation of Relief for attribute estimation in regression , 1997, ICML.

[13]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[14]  Paul Compton,et al.  Inductive knowledge acquisition: a case study , 1987 .

[15]  Brian R. Gaines,et al.  Current Trends in Knowledge Acquisition , 1990 .

[16]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[17]  Thomas G. Dietterich,et al.  Readings in Machine Learning , 1991 .

[18]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[19]  Con Connell,et al.  Applications of Expert Systems , 1989 .

[20]  Jude W. Shavlik,et al.  Learning Symbolic Rules Using Artificial Neural Networks , 1993, ICML.

[21]  J J Hopfield,et al.  Neurons with graded response have collective computational properties like those of two-state neurons. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[23]  G. Diamond,et al.  Analysis of probability as an aid in the clinical diagnosis of coronary-artery disease. , 1979, The New England journal of medicine.

[24]  Ivan Bratko,et al.  ASSISTANT 86: A Knowledge-Elicitation Tool for Sophisticated Users , 1987, EWSL.

[25]  Matjaz Kukar,et al.  Machine Learning in Stepwise Diagnostic Process , 1999, AIMDM.

[26]  J. Ross Quinlan,et al.  An Expert System for the Interpretation of Thyroid Assays in a Clinical Laboratory , 1985, Aust. Comput. J..

[27]  Aleksander Sadikov,et al.  GDV images: Current research and results , 2000 .

[28]  A. Hasman Kardio. A study in deep and qualitative knowledge for expert systems , 1991 .

[29]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[30]  I. Good,et al.  The Estimation of Probabilities: An Essay on Modern Bayesian Methods. , 1967 .

[31]  Bojan Cestnik,et al.  Estimating Probabilities: A Crucial Task in Machine Learning , 1990, ECAI.

[32]  Igor Kononenko,et al.  Probabilistic First-Order Classification , 1997, ILP.

[33]  Paul W. Baim A Method for Attribute Selection in Inductive Learning Systems , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Pietro Torasso,et al.  LEARNING OF FUZZY PRODUCTION RULES FOR MEDICAL DIAGNOSIS , 1993 .

[35]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Igor Kononenko,et al.  Inductive and Bayesian learning in medical diagnosis , 1993, Appl. Artif. Intell..

[37]  Michael J. Pazzani,et al.  Searching for Dependencies in Bayesian Classifiers , 1995, AISTATS.

[38]  Aleksander Sadikov,et al.  Machine learning and GDV images: current research and results , 1999 .

[39]  Igor Kononenko,et al.  Machine learning in prognosis of the femoral neck fracture recovery , 1996, Artif. Intell. Medicine.

[40]  Stephen Muggleton,et al.  Inductive acquisition of expert knowledge , 1986 .

[41]  Ivan Bratko,et al.  Experiments in automatic learning of medical diagnostic rules , 1984 .

[42]  Larry A. Rendell,et al.  Lookahead Feature Construction for Learning Hard Concepts , 1993, International Conference on Machine Learning.

[43]  Igor Kononenko,et al.  Semi-Naive Bayesian Classifier , 1991, EWSL.

[44]  L. J. Savage,et al.  Probability and the weighing of evidence , 1951 .

[45]  Madan M. Gupta,et al.  Approximate reasoning in decision analysis , 1982 .

[46]  Pat Langley,et al.  Induction of Recursive Bayesian Classifiers , 1993, ECML.