Modelling of inquiry diagnosis for coronary heart disease in traditional Chinese medicine by using multi-label learning

BackgroundCoronary heart disease (CHD) is a common cardiovascular disease that is extremely harmful to humans. In Traditional Chinese Medicine (TCM), the diagnosis and treatment of CHD have a long history and ample experience. However, the non-standard inquiry information influences the diagnosis and treatment in TCM to a certain extent. In this paper, we study the standardization of inquiry information in the diagnosis of CHD and design a diagnostic model to provide methodological reference for the construction of quantization diagnosis for syndromes of CHD. In the diagnosis of CHD in TCM, there could be several patterns of syndromes for one patient, while the conventional single label data mining techniques could only build one model at a time. Here a novel multi-label learning (MLL) technique is explored to solve this problem.MethodsStandardization scale on inquiry diagnosis for CHD in TCM is designed, and the inquiry diagnostic model is constructed based on collected data by the MLL techniques. In this study, one popular MLL algorithm, ML-kNN, is compared with other two MLL algorithms RankSVM and BPMLL as well as one commonly used single learning algorithm, k-nearest neighbour (kNN) algorithm. Furthermore the influence of symptom selection to the diagnostic model is investigated. After the symptoms are removed by their frequency from low to high; the diagnostic models are constructed on the remained symptom subsets.ResultsA total of 555 cases are collected for the modelling of inquiry diagnosis of CHD. The patients are diagnosed clinically by fusing inspection, pulse feeling, palpation and the standardized inquiry information. Models of six syndromes are constructed by ML-kNN, RankSVM, BPMLL and kNN, whose mean results of accuracy of diagnosis reach 77%, 71%, 75% and 74% respectively. After removing symptoms of low frequencies, the mean accuracy results of modelling by ML-kNN, RankSVM, BPMLL and kNN reach 78%, 73%, 75% and 76% when 52 symptoms are remained.ConclusionsThe novel MLL techniques facilitate building standardized inquiry models in CHD diagnosis and show a practical approach to solve the problem of labelling multi-syndromes simultaneously.

[1]  David W. Aha,et al.  Special Issue on Lazy Learning , 1997 .

[2]  Objectified study of abdomen diagnosis on Blood Stasis Syndrome , 1997, Chinese Journal of Integrated Traditional and Western Medicine.

[3]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[4]  Zhang Bo Multivariate Analysis of TCM Syndrome of Stroke , 2003 .

[5]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[6]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[7]  Li Guo-chun,et al.  An Investigation into Regularity of Syndrome Classification for Chronic Atrophic Gastritis Based on Structural Equation Model , 2006 .

[8]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[9]  Yi-Ling Wu,et al.  [Application of entropy-based complex systems partition method in research on quantizing TCM syndrome diagnostic criteria of angina pectoris]. , 2007, Zhongguo Zhong xi yi jie he za zhi Zhongguo Zhongxiyi jiehe zazhi = Chinese journal of integrated traditional and Western medicine.

[10]  [Multicentric randomized double blinded clinical study of Yiqi Tongmai Oral Liquid against angina pectoris in patients with coronary heart disease]. , 2007, Zhong xi yi jie he xue bao = Journal of Chinese integrative medicine.

[11]  [Development and evaluation of an inquiry scale for diagnosis of heart system syndromes in traditional Chinese medicine]. , 2009, Zhong xi yi jie he xue bao = Journal of Chinese integrative medicine.

[12]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.