Performance comparison of multi-label learning algorithms on clinical data for chronic diseases

We are motivated by the issue of classifying diseases of chronically ill patients to assist physicians in their everyday work. Our goal is to provide a performance comparison of state-of-the-art multi-label learning algorithms for the analysis of multivariate sequential clinical data from medical records of patients affected by chronic diseases. As a matter of fact, the multi-label learning approach appears to be a good candidate for modeling overlapped medical conditions, specific to chronically ill patients. With the availability of such comparison study, the evaluation of new algorithms should be enhanced. According to the method, we choose a summary statistics approach for the processing of the sequential clinical data, so that the extracted features maintain an interpretable link to their corresponding medical records. The publicly available MIMIC-II dataset, which contains more than 19,000 patients with chronic diseases, is used in this study. For the comparison we selected the following multi-label algorithms: ML-kNN, AdaBoostMH, binary relevance, classifier chains, HOMER and RAkEL. Regarding the results, binary relevance approaches, despite their elementary design and their independence assumption concerning the chronic illnesses, perform optimally in most scenarios, in particular for the detection of relevant diseases. In addition, binary relevance approaches scale up to large dataset and are easy to learn. However, the RAkEL algorithm, despite its scalability problems when it is confronted to large dataset, performs well in the scenario which consists of the ranking of the labels according to the dominant disease of the patient.

[1]  A. Alwan Global status report on noncommunicable diseases 2010. , 2011 .

[2]  Piotr Synak,et al.  Multi-Label Classification of Emotions in Music , 2006, Intelligent Information Systems.

[3]  ZhouZhi-Hua,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006 .

[4]  Grigorios Tsoumakas,et al.  Multi-Label Classification , 2009, Database Technologies: Concepts, Methodologies, Tools, and Applications.

[5]  Saeid Nahavandi,et al.  Bag-of-words representation for biomedical time series classification , 2012, Biomed. Signal Process. Control..

[6]  Grigorios Tsoumakas,et al.  Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[7]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[8]  C. Mathers Global Burden of Disease , 2008 .

[9]  M. Ezzati,et al.  National, regional, and global trends in fasting plasma glucose and diabetes prevalence since 1980: systematic analysis of health examination surveys and epidemiological studies with 370 country-years and 2·7 million participants , 2011, The Lancet.

[10]  Robert E. Schapire,et al.  Hierarchical multi-label prediction of gene function , 2006, Bioinform..

[11]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[12]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[13]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[14]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[15]  Damien Zufferey,et al.  Probabilistic Multi-Label Learning for Medical Data , 2014, IEEE Intell. Informatics Bull..

[16]  Saso Dzeroski,et al.  Ensembles of Multi-Objective Decision Trees , 2007, ECML.

[17]  Jesse Read,et al.  A Pruned Problem Transformation Method for Multi-label Classification , 2008 .

[18]  B Jennett,et al.  Adding up the Glasgow Coma Score. , 1979, Acta neurochirurgica. Supplementum.

[19]  Stefano Bromuri,et al.  Multi-label classification of chronically ill patients with bag of words and supervised dimensionality reduction algorithms , 2014, J. Biomed. Informatics.

[20]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[21]  D. Scott,et al.  ICU admission characteristics and mortality rates among elderly and very elderly patients , 2012, Intensive Care Medicine.

[22]  P. Zimmet,et al.  Definition, diagnosis and classification of diabetes mellitus and its complications. Part 1: diagnosis and classification of diabetes mellitus. Provisional report of a WHO Consultation , 1998, Diabetic medicine : a journal of the British Diabetic Association.

[23]  G. Fraser,et al.  Prospective evaluation of the Sedation-Agitation Scale for adult critically ill patients. , 1999, Critical care medicine.

[24]  E. Barrett-Connor,et al.  Diabetes and hypertension in a community of older adults. , 1981, American journal of epidemiology.

[25]  Fei Wang,et al.  Supervised patient similarity measure of heterogeneous patient records , 2012, SKDD.

[26]  Mu-Yen Chen,et al.  Integrating data mining with case-based reasoning for chronic diseases prognosis and diagnosis , 2007, Expert Syst. Appl..

[27]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[28]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[29]  M. Saeed Multiparameter Intelligent Monitoring in Intensive Care II ( MIMIC-II ) : A public-access intensive care unit database , 2011 .

[30]  Vimla L. Patel,et al.  Understanding the nature of information seeking behavior in critical care: Implications for the design of health information technology , 2013, Artif. Intell. Medicine.

[31]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[32]  T. H. Kyaw,et al.  Multiparameter Intelligent Monitoring in Intensive Care II: A public-access intensive care unit database* , 2011, Critical care medicine.

[33]  K. Chou,et al.  iLoc-Virus: a multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites. , 2011, Journal of theoretical biology.

[34]  J. Vincent,et al.  The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure , 1996, Intensive Care Medicine.

[35]  Luc De Raedt,et al.  Top-Down Induction of Clustering Trees , 1998, ICML.

[36]  Dan Jackson,et al.  How much can we learn about missing data?: an exploration of a clinical trial in psychiatry , 2010, Journal of the Royal Statistical Society. Series A,.

[37]  Esther Rodríguez-Villegas,et al.  COMMODITY12: A smart e-health environment for diabetes management , 2013, J. Ambient Intell. Smart Environ..

[38]  Johannes Fürnkranz,et al.  Efficient Pairwise Multilabel Classification for Large-Scale Problems in the Legal Domain , 2008, ECML/PKDD.

[39]  VlahavasIoannis,et al.  Random k-Labelsets for Multilabel Classification , 2011 .

[40]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[41]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[42]  Grigorios Tsoumakas,et al.  Effective and Efficient Multilabel Classification in Domains with Large Number of Labels , 2008 .

[43]  W. Cullen,et al.  Research confuses me: what is the difference between case-control and cohort studies in quantitative research? , 2013, Irish medical journal.

[44]  R. Little,et al.  The prevention and treatment of missing data in clinical trials. , 2012, The New England journal of medicine.

[45]  Alan N Peiris,et al.  Endocrinology in crisis? , 2013, Southern medical journal.

[46]  N. Bergstrom,et al.  The Braden Scale for Predicting Pressure Sore Risk , 1987, Nursing research.

[47]  Alan D. Lopez,et al.  A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010 , 2012, The Lancet.

[48]  George Miller,et al.  National health spending by medical condition, 1996-2005. , 2009, Health affairs.

[49]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[50]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[51]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[52]  Lori A. Post,et al.  Strategies for Dealing with Missing Data in Clinical Trials: From Design to Analysis , 2013, The Yale journal of biology and medicine.

[53]  L PatelVimla,et al.  Understanding the nature of information seeking behavior in critical care , 2013 .

[54]  Jorge Luís Machado do Amaral,et al.  Machine learning algorithms and forced oscillation measurements applied to the automatic identification of chronic obstructive pulmonary disease , 2012, Comput. Methods Programs Biomed..

[55]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[56]  Juan José del Coz,et al.  Binary relevance efficacy for multilabel classification , 2012, Progress in Artificial Intelligence.

[57]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[58]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..