Feature selection and syndrome prediction for liver cirrhosis in traditional Chinese medicine

Traditional Chinese medicine (TCM) treatment is one of the safe and effective methods for liver cirrhosis. In the process of its treatment, a very important step, syndrome prediction is generally performed by physicians at present, which actually hinders the application prospects of TCM. Based on the data mining algorithm, a novel method called TCMSP (traditional Chinese medicine syndrome prediction) is proposed, which consists of two phases. In the first phase, based on an improved information gain method in multi-view, the critical features are filtered from the original features. In the second phase, the class label of a new case is predicted automatically based on accuracy-weighted majority voting. The proposed method is evaluated by the liver cirrhosis dataset, 20 critical features are selected from original 105 features and the corresponding syndromes of 138 new cases are identified respectively. The critical features are in sound agreement with those used by the physicians in making their clinical decisions. Finally, this new method is also demonstrated on three standard datasets (SPECT Heart, Lung Cancer and Iris) and the results are compared with some other methods. The experimental results show that TCMSP method performs well in the field of TCM diagnosis.

[1]  Ping Liu,et al.  A self-learning expert system for diagnosis in traditional Chinese medicine , 2004, Expert Syst. Appl..

[2]  Zhang Bin,et al.  Data Mining Application to Syndrome Differentiation in Traditional Chinese Medicine , 2006, 2006 Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'06).

[3]  Duan Shu-min Support Vector Machine and its Application in the Syndrome Diagnosis of Traditional Chinese Medicine , 2007 .

[4]  Qiao Qiao,et al.  Study of sEGF level in chronic atrophic gastritis with either Chinese traditional medicine or western medicine , 2002 .

[5]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[6]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[7]  Qu Hai Method for Self-Extracting Diagnostic Rules of Blood Stasis Syndrome Based on Decision Tree , 2005 .

[8]  Ma Lizhuang,et al.  Correlation between Child-Pugh Degree and the Four Examinations of Traditional Chinese Medicine (TCM) with Liver Cirrhosis , 2008, BMEI 2008.

[9]  Tao Chen,et al.  Latent tree models and diagnosis in traditional Chinese medicine , 2008, Artif. Intell. Medicine.

[10]  Changle Zhou,et al.  An Approach to Syndrome Differentiation in Traditional Chinese Medicine based on Neural Network , 2007, Third International Conference on Natural Computation (ICNC 2007).

[11]  Ian Witten,et al.  Data Mining , 2000 .

[12]  Chen Hui Study on the Characteristics of Traditional Chinese Medical Syndrome of Hepatocirrhosis Relation with Liver Function Disorder , 2004 .

[13]  Ricardo Vilalta,et al.  A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.

[14]  He Guan Exploration on Relationship between Cirrhosis of liver's Traditional Chinese Medical Syndrome Differentiation Typing and Child-pugh degree,Complication , 2002 .

[15]  Zhaohui Wu,et al.  TCM-SIRD: an integrated aided system for traditional Chinese medicine Sizheng , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[16]  Sanjoy Kumar Pal,et al.  Complementary and alternative medicine: An overview , 2002 .

[17]  Zhaohui Wu,et al.  Knowledge discovery in traditional Chinese medicine: State of the art and perspectives , 2006, Artif. Intell. Medicine.

[18]  Siu Cheung Hui,et al.  Computational methods for Traditional Chinese Medicine: A survey , 2007, Comput. Methods Programs Biomed..

[19]  Norman D. Black,et al.  Feature selection and classification model construction on type 2 diabetic patients' data , 2007, Artif. Intell. Medicine.

[20]  Gong Yan-bing Modern methodology of TCM syndrome study(I):Data mining technology of TCM syndrome , 2006 .

[21]  Alexey Tsymbal,et al.  Learning feature selection for medical databases , 1999, Proceedings 12th IEEE Symposium on Computer-Based Medical Systems (Cat. No.99CB36365).

[22]  Chen Huifen Multi-analysis: Characteristics of Traditional Chinese Medical Syndrome of Hepatocirrhosis , 2003 .