Combine multi-valued attribute decomposition with multi-label learning

Multi-valued and multi-labeled learning is concerned with samples associated with a set of values both with label and attribute. This paper proposes a new learning framework, which combines multi-valued attribute decomposition with multi-label learning. To deal with multi-valued attribute, we present five methods which differ in strategies with the correlations of multi values. After data transformation, three classic multi-label algorithms are employed for learning. Experimental results demonstrate that most combined methods significantly outperform the existing decision tree based algorithms. Furthermore, exploring the advantages and limitations of each combined method, we find the optimal combination corresponding to different types of datasets.

[1]  E. F. Codd,et al.  Further Normalization of the Data Base Relational Model , 1971, Research Report / RJ / IBM / San Jose, California.

[2]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[3]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[4]  Geoff Holmes,et al.  Multi-label Classification Using Ensembles of Pruned Sets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[5]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[6]  Grigorios Tsoumakas,et al.  Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[7]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[8]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[9]  Shihchieh Chou,et al.  MMDT: a multi-valued and multi-labeled decision tree classifier for data mining , 2005, Expert Syst. Appl..

[10]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[11]  Yihong Gong,et al.  Multi-labelled classification using maximum entropy method , 2005, SIGIR '05.

[12]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[13]  Yen-Liang Chen,et al.  Constructing a multi-valued and multi-labeled decision tree , 2003, Expert Syst. Appl..