Multi-label classification methods for improving comorbidities identification

The medical diagnostic process may be supported by computational classification techniques. In many cases, patients are affected by multiple illnesses, and more than one classification label is required to improve medical decision-making. In this paper, we consider a multi-perspective classification problem for medical diagnostics, where cases are described by labels from separate sets. We attempt to improve the identification of comorbidities using multi-label classification techniques. Several investigated methods, which provide label dependencies, are analysed and evaluated. The methods' performances are verified by experiments conducted on four sets of medical data from subject patients. The results were evaluated using several metrics and were statistically verified. We compare the effects of the techniques that do and do not consider label correlations. We demonstrate that multi-label classification methods from the first group outperform the techniques from the second one.

[1]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[2]  Zhang Zhigan Parallel coordinates visualization based on image morphing , 2015 .

[3]  Hui Wu,et al.  Local analgesia adverse effects prediction using multi-label classification , 2012, Neurocomputing.

[4]  Cristina V. Lopes,et al.  Multi-Label Classification of Short Text: A Study on Wikipedia Barnstars , 2011, Analyzing Microtext.

[5]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[6]  Fei Wang,et al.  Supervised patient similarity measure of heterogeneous patient records , 2012, SKDD.

[7]  Andrew McCallum,et al.  Collective multi-label classification , 2005, CIKM '05.

[8]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[9]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[10]  Tao Li,et al.  Content-based music similarity search and emotion detection , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Stefano Bromuri,et al.  Multi-label classification of chronically ill patients with bag of words and supervised dimensionality reduction algorithms , 2014, J. Biomed. Informatics.

[12]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[13]  Bertjan Broeksema,et al.  Big Data Visual Analytics with Parallel Coordinates , 2015, 2015 Big Data Visual Analytics (BDVA).

[14]  Rui Guo,et al.  A multi-instance multi-label learning approach to objective auscultation analysis of traditional Chinese medicine , 2011, 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI).

[15]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.

[16]  Hui Zhang,et al.  Multi-label classification with Bayes' theorem , 2011, 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI).

[17]  Eyke Hüllermeier,et al.  Multilabel classification via calibrated label ranking , 2008, Machine Learning.

[18]  Alfred Inselberg,et al.  Parallel Coordinates: Visualization, Exploration and Classification of High-Dimensional Data , 2008 .

[19]  L. M. Patnaik,et al.  Prominent label identification and multi-label classification for cancer prognosis prediction , 2012, TENCON 2012 IEEE Region 10 Conference.

[20]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[21]  J. Koenderink Q… , 2014, Les noms officiels des communes de Wallonie, de Bruxelles-Capitale et de la communaute germanophone.

[22]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[23]  Nathalie Japkowicz,et al.  Multi-label Classification of Anemia Patients , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[24]  Lina Yao,et al.  Diagnosis Code Assignment Using Sparsity-Based Disease Correlation Embedding , 2016, IEEE Transactions on Knowledge and Data Engineering.

[25]  Archana Bhattarai,et al.  Classification of Clinical Conditions: A Case Study on Prediction of Obesity and Its Co-morbidities , 2009 .

[26]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[27]  Edward Puchala,et al.  Comparison of Multi-label and Multi-perspective Classifiers in Multi-task Pattern Recognition Problems , 2015, CORES.

[28]  Abbas Z. Kouzani,et al.  Comparative evaluation of multi-label classification methods , 2012, 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery.

[29]  Mei-Ling Huang,et al.  Fetal distress prediction using discriminant analysis, decision tree, and artificial neural network , 2012 .

[30]  Eyke Hüllermeier,et al.  Bayes Optimal Multilabel Classification via Probabilistic Classifier Chains , 2010, ICML.

[31]  Rolf Ingold,et al.  Performance comparison of multi-label learning algorithms on clinical data for chronic diseases , 2015, Comput. Biol. Medicine.

[32]  Ronaldo C. Prati,et al.  Fuzzy rule classifiers for multi-label classification , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[33]  Yue Peng,et al.  Entropy chain multi-label classifiers for Traditional Medicine diagnosing Parkinson's disease , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[34]  Danuta Zakrzewska,et al.  Effective Multi-label Classification Method for Multidimensional Datasets , 2015, FQAS.

[35]  Guo-Zheng Li,et al.  Clinical multi-label free text classification by exploiting disease label relation , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[36]  Agnieszka Wosiak,et al.  Intra-uterine growth restriction as a risk factor for hypertension in children six to 10 years old , 2014, Cardiovascular journal of Africa.

[37]  Przemyslaw Kazienko,et al.  Multi-label classification using error correcting output codes , 2012, Int. J. Appl. Math. Comput. Sci..

[38]  Feng-Feng Shao,et al.  Patient classification of hypertension in Traditional Chinese Medicine using multi-label learning techniques , 2015, BMC Medical Genomics.

[39]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[40]  Jimeng Sun,et al.  Localized Supervised Metric Learning on Temporal Physiological Data , 2010, 2010 20th International Conference on Pattern Recognition.

[41]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[42]  Ramakanth Kavuluru,et al.  Supervised Extraction of Diagnosis Codes from EMRs: Role of Feature Selection, Data Selection, and Probabilistic Thresholding , 2013, 2013 IEEE International Conference on Healthcare Informatics.

[43]  M.W. Kurzynski,et al.  Multiperspective recognition applied to the computer-aided medical diagnosis - a comparative study of methods , 2001, 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[44]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[45]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[46]  Rui Guo,et al.  Classifying syndromes in Chinese medicine using multi-label learning algorithm with relevant features for each label , 2016, Chinese Journal of Integrative Medicine.

[47]  Jun Suzuki,et al.  Multi-label Text Categorization with Model Combination based on F1-score Maximization , 2008, IJCNLP.

[48]  Agnieszka Wosiak,et al.  Improving Children Diagnostics by Efficient Multi-label Classification Method , 2016, ITIB.