Multi-Label Classification Using Dependent and Independent Dual Space Reduction

While multi-label classification can be widely applied for problems where multiple classes can be assigned to an object, its effectiveness may be sacrificed due to curse of dimensionality in the feature space and sparseness of dimensionality in the label space. As a solution, this paper presents two alternative methods, namely Dependent Dual Space Reduction and Independent Dual Space Reduction, to reduce dimensions in the dual spaces, i.e., the feature and label spaces, using Singular Value Decomposition (SVD). The first approach constructs the cross-covariance matrix to represent dependency between the features and labels, projects both of them into a single reduced space, and then performs prediction on the reduced space. On the other hand, the second approach handles the feature space and the label space separately by constructing a covariance matrix for each space to represent feature dependency and label dependency before performing SVD on dependency profile of each space to reduce dimension and for noise elimination and then predicting classes using their reduced dimensions. A number of experiments show that prediction on the reduced spaces for both dependent and independent reduction approaches can obtain better classification accuracy as well as faster computation, compared to the prediction using the original spaces. Finally, the results of our methods are also compared to the prediction using other types of dimensionality reduction, with consideration of the factors of covariance and threshold selection.

[1]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[2]  Robert E. Schapire,et al.  Hierarchical multi-label prediction of gene function , 2006, Bioinform..

[3]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[4]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[5]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[6]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[7]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[8]  Grigorios Tsoumakas,et al.  Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[9]  James T. Kwok,et al.  MultiLabel Classification on Tree- and DAG-Structured Hierarchies , 2011, ICML.

[10]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[11]  Hsuan-Tien Lin,et al.  Multilabel Classification with Principal Label Space Transformation , 2012, Neural Computation.

[12]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[13]  Chih-Jen Lin,et al.  A Study on Threshold Selection for Multi-label Classification , 2007 .

[14]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[15]  Thanaruk Theeramunkong,et al.  An Empirical Study on Multi-dimensional Sentiment Analysis from User Service Reviews , 2012, 2012 Seventh International Conference on Knowledge, Information and Creativity Support Systems.

[16]  Eyke Hüllermeier,et al.  On label dependence in multilabel classification , 2010, ICML 2010.

[17]  Zhi-Hua Zhou,et al.  Multilabel dimensionality reduction via dependence maximization , 2008, TKDD.

[18]  Yuhong Guo,et al.  Multi-Label Classification Using Conditional Dependency Networks , 2011, IJCAI.

[19]  John Langford,et al.  Multi-Label Prediction via Compressed Sensing , 2009, NIPS.

[20]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[21]  Kun Zhang,et al.  Multi-label learning by exploiting label dependency , 2010, KDD.

[22]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[23]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[24]  Geoff Holmes,et al.  Multi-label Classification Using Ensembles of Pruned Sets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[25]  Grigorios Tsoumakas,et al.  Multilabel Text Classification for Automated Tag Suggestion , 2008 .

[26]  Yueting Zhuang,et al.  Multi-Task Sparse Discriminant Analysis (MtSDA) with Overlapping Categories , 2010, AAAI.

[27]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[28]  Everton Alvares Cherman,et al.  Incorporating label dependency into the binary relevance framework for multi-label classification , 2012, Expert Syst. Appl..

[29]  Volker Tresp,et al.  Multi-label informed latent semantic indexing , 2005, SIGIR '05.

[30]  Grigorios Tsoumakas,et al.  A Triple-Random Ensemble Classification Method for Mining Multi-label Data , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[31]  C. Q. Lee,et al.  The Computer Journal , 1958, Nature.

[32]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.