Medical Diagnosis Data Mining Based on Improved Apriori Algorithm

With the wide application of computer science and technology, the amount of data generated by various disciplines increased rapidly. In order to discover valuable knowledge in these databases, people use data mining methods to solve this problem. The application of association rule mining is an important research topic in data mining. As the association rule technology becomes more mature, it is a new research that how to use this method to find out the intrinsic association rules from a large number of medical data, providing an effective basis for clinical disease surveillance, evaluation of drug treatment and disease prevention. This paper uses Apriori, the classic algorithm of association rule, for data mining analysis of medical data. According to the characteristics of medical data, it improved the Apriori algorithm. Using the improved Apriori algorithm, it finds frequent item sets in a database of medical diagnosis, and generates strong association rules, in order to find out the useful association relationship or pattern between the large data item sets. The results show that, the improved Apriori algorithm can dig out association rule models about the properties and nature of the disease from a medical database, which can assist doctors in medical diagnosis. Therefore, it is a worthy research direction that using data mining method to process and analyze the data of disease prevention and drug treatment in the field of medicine

[1]  Krzysztof J. Cios,et al.  Bayesian learning for cardiac SPECT image interpretation , 2002, Artif. Intell. Medicine.

[2]  Lin Feng,et al.  AT-Mine: An Efficient Algorithm of Frequent Itemset Mining on Uncertain Dataset , 2013, J. Comput..

[3]  Yu Chang Application of data mining in medical field , 2003 .

[4]  Huiyong Wang,et al.  Research on Frequent Itemsets Mining Algorithm based on Relational Database , 2013, J. Softw..

[5]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[6]  Ye Wang,et al.  Data Process of Diagnose Expert System based on Neural Network , 2013, J. Networks.

[7]  F. Gilliland,et al.  Ethnic differences in the prevalence of nonmalignant respiratory disease among uranium miners. , 1997, American journal of public health.

[8]  Susan P. Imberman,et al.  Using dependency/association rules to find indications for computed tomography in a head trauma dataset , 2002, Artif. Intell. Medicine.

[9]  Dafei Wu,et al.  Research on Patient Privacy Protection for Medical Data in Cloud Computing , 2013, J. Networks.

[10]  Kristian Kersting,et al.  Analysis of respiratory pressure-volume curves in intensive care medicine using inductive machine learning , 2002, Artif. Intell. Medicine.

[11]  Junping Wang,et al.  Data Cleaning of Medical Data for Knowledge Mining , 2013, J. Networks.

[12]  R. Nishikawa,et al.  The use of a priori information in the detection of mammographic microcalcifications to improve their classification. , 2003, Medical physics.

[13]  Ping Wang,et al.  Design and Implementation of a General Purpose 2D CAD System , 2012, J. Comput..

[14]  Chen Xue Establishment of an analysis system based on clinical database of malignant blood-diseases , 2005 .

[15]  Wang Jian-wen Data mining and its application , 2005 .

[16]  Bin Dong,et al.  The automatic diagnosis system of breast cancer based on the improved Apriori algorithm , 2012, 2012 International Conference on Machine Learning and Cybernetics.

[17]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[18]  Sen Gan Application of Data Mining in Medical Field , 2007 .

[19]  Qu Ai The research of data mining and knowledge discovery in computer aided medical diagnosing system , 2002 .

[20]  H T Lynch,et al.  Automated detection of hereditary syndromes using data mining. , 1997, Computers and biomedical research, an international journal.

[21]  Zhang Yong Data Mining and Its Application in Medical Science , 2005 .

[22]  Ling Chen,et al.  A Fast and Efficient Algorithm for Finding Frequent Items over Data Stream , 2012, J. Comput..

[23]  Sruti Gan Chaudhuri,et al.  Design and implementation of a , 2012 .

[24]  Yao Wei Mining decision rules based on the improved Apriori algorithm , 2013 .

[25]  Hong Chen,et al.  An Empirical Study of the Critical Factors Influencing Learner Satisfaction and Effectiveness: A 3D CAD System's Perspective , 2013, J. Softw..