Machine Learning for Data Mining in Medicine

Large collections of medical data are a valuable resource from which potentially new and useful knowledge can be discovered through data mining. This paper gives an overview of machine learning approaches used in mining of medical data, distinguishing between symbolic and sub-symbolic data mining methods, and giving references to applications of these methods in medicine. In addition, the paper presents selected measures for performance evaluation used in medical prediction and classification problems, proposing also some alternative measures for rule evaluation that can be used in ranking and filtering of induced rule sets.

[1]  W. Baxt Application of artificial neural networks to clinical medicine , 1995, The Lancet.

[2]  Haku Ishida,et al.  Applying a Neural Network to Prostate Cancer Survival Data , 1997 .

[3]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory, Third Edition , 1989, Springer Series in Information Sciences.

[4]  I. Bratko,et al.  Learning decision rules in noisy domains , 1987 .

[5]  Tsau Young Lin,et al.  Introducing the book , 2000 .

[6]  Nada Lavrac,et al.  Selected techniques for data mining in medicine , 1999, Artif. Intell. Medicine.

[7]  Nada Lavrac,et al.  The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.

[8]  S Dzeroski,et al.  Rule induction and instance-based learning applied in medical diagnosis. , 1996, Technology and health care : official journal of the European Society for Engineering and Medicine.

[9]  J. Ross Quinlan,et al.  An Expert System for the Interpretation of Thyroid Assays in a Clinical Laboratory , 1985, Aust. Comput. J..

[10]  David McSherry,et al.  Hypothesist: A Development Environment for Intelligent Diagnostic Systems , 1997, AIME.

[11]  Bojan Cestnik,et al.  Estimating Probabilities: A Crucial Task in Machine Learning , 1990, ECAI.

[12]  Ryszard S. Michalski,et al.  A Theory and Methodology of Inductive Learning , 1983, Artificial Intelligence.

[13]  Luc De Raedt,et al.  Clausal Discovery , 1997, Machine Learning.

[14]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[15]  Jan Komorowski,et al.  Modelling prognostic power of cardiac tests using rough sets , 1999, Artif. Intell. Medicine.

[16]  Warren T. Jones,et al.  Research Paper: Association Rules and Data Mining in Hospital Infection Control and Public Health Surveillance , 1998, J. Am. Medical Informatics Assoc..

[17]  Saso Dzeroski,et al.  The utility of background knowledge in learning medical diagnostic rules , 1993, Appl. Artif. Intell..

[18]  F.M. Ham,et al.  Classification of cardiac arrhythmias using fuzzy ARTMAP , 1996, IEEE Transactions on Biomedical Engineering.

[19]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[20]  Robert T. Macura,et al.  Case-based reasoning: opportunities and applications in health care , 1997, Artif. Intell. Medicine.

[21]  Yan Zhu,et al.  Computerized tumor boundary detection using a Hopfield neural network , 1997, IEEE Transactions on Medical Imaging.

[22]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[23]  K. Liestøl,et al.  Survival analysis and neural nets. , 1994, Statistics in medicine.

[24]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[25]  J Zeleznikow,et al.  FLORENCE: synthesis of case-based and model-based reasoning in a nursing care planning system. , 1993, Computers in nursing.

[26]  Saso Dzeroski,et al.  Inductive Logic Programming: Techniques and Applications , 1993 .

[27]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[28]  Sholom M. Weiss,et al.  Computer Systems That Learn , 1990 .

[29]  Igor Kononenko,et al.  Inductive and Bayesian learning in medical diagnosis , 1993, Appl. Artif. Intell..

[30]  Laurene V. Fausett,et al.  Fundamentals Of Neural Networks , 1994 .

[31]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[32]  Elpida T. Keravnou,et al.  Intelligent Data Analysis for Medical Diagnosis: Using Machine Learning and Temporal Abstraction , 1998, AI Commun..

[33]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[34]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[35]  Robert F. Harrison,et al.  Application of the fuzzy ARTMAP neural network model to medical pattern classification tasks , 1996, Artif. Intell. Medicine.

[36]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[37]  Zdzislaw Pawlak,et al.  Information systems theoretical foundations , 1981, Inf. Syst..

[38]  G Edwards,et al.  Peirs: A pathologist‐maintained expert system for the interpretation of chemical pathology reports , 1993, Pathology.

[39]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[40]  Z. Pawlak,et al.  Reasoning about Knowledge , 1991 .

[41]  Shusaku Tsumoto,et al.  Modelling Medical Diagnostic Rules Based on Rough Sets , 1998, Rough Sets and Current Trends in Computing.

[42]  David McSherry,et al.  Avoiding premature closure in sequential diagnosis , 1997, Artif. Intell. Medicine.

[43]  Paul Compton,et al.  Knowledge in Context: A Strategy for Expert System Maintenance , 1990, Australian Joint Conference on Artificial Intelligence.

[44]  N. Lavrac,et al.  Intelligent Data Analysis in Medicine and Pharmacology , 1997 .

[45]  Hannu Toivonen,et al.  Finding Frequent Substructures in Chemical Compounds , 1998, KDD.

[46]  Rudy Setiono,et al.  Extracting rules from pruned networks for breast cancer diagnosis , 1996, Artif. Intell. Medicine.

[47]  C E Kahn,et al.  Case-Based Reasoning and Imaging Procedure Selection , 1994, Investigative radiology.

[48]  P W Hamilton,et al.  Quantitative study of ductal breast cancer--patient targeted prognosis: an exploration of case base reasoning. , 1997, Pathology, research and practice.

[49]  Peter Clark,et al.  Rule Induction with CN2: Some Recent Improvements , 1991, EWSL.

[50]  Stephen Muggleton,et al.  Carcinogenesis Predictions Using Inductive Logic Programming , 1997 .

[51]  Ah-Hwee Tan,et al.  Rule Extraction, Fuzzy ARTMAP, and Medical Databases , 1993 .

[52]  Cathy H. Wu Artificial Neural Networks for Molecular Sequence Analysis , 1997, Comput. Chem..

[53]  Igor Kononenko,et al.  Semi-Naive Bayesian Classifier , 1991, EWSL.

[54]  Peter A. Flach,et al.  Rule Evaluation Measures: A Unifying View , 1999, ILP.

[55]  Tom M. Mitchell,et al.  Using the Future to Sort Out the Present: Rankprop and Multitask Learning for Medical Risk Evaluation , 1995, NIPS.

[56]  P. Wilding,et al.  The application of backpropagation neural networks to problems in pathology and laboratory medicine. , 1992, Archives of pathology & laboratory medicine.

[57]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[58]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[59]  Thomas G. Dietterich,et al.  A study of distance-based machine learning algorithms , 1994 .