Data mining techniques applied to medical information

Knowledge discovery from the dramatically increased data of an auto-stored medical information system is still in its infancy. The purpose of this study is to use widely available and easily operated techniques that can satisfy general users in extracting specific knowledge to make the medical information system more functional. Data mining techniques, including data visualisation, correlation analysis, discriminant analysis, and neural networks supervised classification, were applied to heart disease databases. These techniques can help to identify high risk patients, define the most important factors (variables) in heart disease, and build a multivariate relationship model to show the relationship between any two variables in a way that such relationships are easy to view. Simple visualization techniques were utilised to construct this model, which corresponds with current medical knowledge. Two nonparametric (distribution assumption free) classification tools were employed to identify high risk heart disease patients. Both the neural networks supervised classification methods and the discriminant analysis method produced reliable classification rates for heart disease patients. However, neural networks yielded a higher percentage of correct classifications (averaging 89%) than discriminant analysis (79%). Data visualisation and correlation analysis resulted in similar conclusions regarding the most important factors in heart disease. These data mining tools provide simple and effective methods of extracting knowledge from general medical information. The treatment of missing data is also discussed.

[1]  R. Detrano,et al.  International application of a new probability algorithm for the diagnosis of coronary artery disease. , 1989, The American journal of cardiology.

[2]  H ShortliffeEdward The adolescence of AI in Medicine , 1993 .

[3]  Y Reisman,et al.  Computer-based clinical decision aids. A review of methods and assessment of systems. , 1996, Medical informatics = Medecine et informatique.

[4]  J C Lindsey,et al.  Interpreting treatment differences when patients drop out of a clinical trial. , 1998, AIDS patient care and STDs.

[5]  R. W. Dobbins,et al.  Computational intelligence PC tools , 1996 .

[6]  伏信 進矢 アイオワで computational な夏 , 2007 .

[7]  Pat Langley,et al.  Models of Incremental Concept Formation , 1990, Artif. Intell..

[8]  Nicholas V. Findler An Artificial Intelligence Technique for Information and Fact Retrieval: An Application in Medical Knowledge Processing , 1991 .

[9]  C. Kastner,et al.  MAREG and WinMAREG , 1996 .

[10]  Laurene V. Fausett,et al.  Fundamentals Of Neural Networks , 1994 .

[11]  E. B. Steen,et al.  The Computer-Based Patient Record: An Essential Technology for Health Care , 1992, Annals of Internal Medicine.

[12]  J Rashbass Guide to Medical Informatics, the Internet and Telemedicine , 1998 .

[13]  L Gierl,et al.  Multiparametric time course prognoses by means of case-based reasoning and abstractions of data and time. , 1997, Medical informatics = Medecine et informatique.

[14]  Edward H. Shortliffe,et al.  Computer-based medical consultations, MYCIN , 1976 .

[15]  Edward H. Shortliffe,et al.  The adolescence of AI in medicine: will the field come of age in the '90s? , 1993, Artif. Intell. Medicine.

[16]  J. Evans,et al.  The General Practice Morbidity Database Project Wales--a methodology for primary care data extraction. , 1997, Medical informatics = Medecine et informatique.

[17]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[18]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[19]  Jon Doyle,et al.  Guardian Angel: Patient-Centered Health Information Systems , 1994 .