Neighbor selection for multilabel classification

Abstract k NN is extensively studied for multilabel classification in the literature. Several k NN-based multilabel learning algorithms have been witnessed during the past years. They usually take k NN as their base classifiers to construct classification models, and then predict the class labels by virtue of Bayesian or majority rules. In this paper, a nearest neighbor selection for multilabel classification is proposed. Specifically, the target labels of new data are predicted with the help of those relevant and reliable data, which explored by the concept of shelly nearest neighbor. For effectiveness, the certainty factor is further adopted to well address the problem of unbalanced and uncertain data. The comparison experiments with eleven popular multilabel classifiers are conducted on ten benchmark data sets. The experimental results show that the performance of the proposed method is competitive and outperforms the popular multilabel classifiers in most cases.

[1]  Honggang Zhang,et al.  Multi-label learning with prior knowledge for facial expression analysis , 2015, Neurocomputing.

[2]  Hui Xiong,et al.  Capturing correlations of multiple labels: A generative probabilistic model for multi-label learning , 2012, Neurocomputing.

[3]  Eyke Hüllermeier,et al.  Dependent binary relevance models for multi-label classification , 2014, Pattern Recognit..

[4]  Shiliang Sun,et al.  An adaptive k-nearest neighbor algorithm , 2010, 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery.

[5]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[6]  Eyke Hüllermeier,et al.  Combining Instance-Based Learning and Logistic Regression for Multilabel Classification , 2009, ECML/PKDD.

[7]  Xiao Li,et al.  Multi-label learning with discriminative features for each label , 2015, Neurocomputing.

[8]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[9]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[10]  Francisco Charte,et al.  Addressing imbalance in multilabel classification: Measures and random resampling algorithms , 2015, Neurocomputing.

[11]  Juan José Rodríguez Diez,et al.  A weighted voting framework for classifiers ensembles , 2012, Knowledge and Information Systems.

[12]  Edward H. Shortliffe,et al.  A model of inexact reasoning in medicine , 1990 .

[13]  Craig A. Knoblock,et al.  A Survey of Digital Map Processing Techniques , 2014, ACM Comput. Surv..

[14]  Grigorios Tsoumakas,et al.  Dynamic ensemble pruning based on multi-label classification , 2015, Neurocomputing.

[15]  Shichao Zhang,et al.  Penalized partial least square discriminant analysis with l1 for multi-label data , 2015, Pattern Recognit..

[16]  Liang Tao,et al.  A least squares formulation of multi-label linear discriminant analysis , 2015, Neurocomputing.

[17]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[18]  Shichao Zhang,et al.  Shell-neighbor method and its application in missing data imputation , 2011, Applied Intelligence.

[19]  Juan Ramón Rico-Juan,et al.  Improving kNN multi-label classification in Prototype Selection scenarios using class proposals , 2015, Pattern Recognit..

[20]  Eyke Hüllermeier,et al.  Label ranking by learning pairwise preferences , 2008, Artif. Intell..

[21]  Xindong Wu,et al.  MLSLR: Multilabel Learning via Sparse Logistic Regression , 2014, Inf. Sci..

[22]  Jianhua Xu,et al.  Multi-label core vector machine with a zero label , 2014, Pattern Recognit..

[23]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[24]  Xiaoling Li,et al.  A survey of queries over uncertain data , 2013, Knowledge and Information Systems.

[25]  Francisco Charte,et al.  LI-MLC: A Label Inference Methodology for Addressing High Dimensionality in the Label Space for Multilabel Classification , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[26]  David G. Stork,et al.  Pattern Classification , 1973 .

[27]  Shiliang Sun,et al.  Local within-class accuracies for weighting individual outputs in multiple classifier systems , 2010, Pattern Recognit. Lett..

[28]  Sebastián Ventura,et al.  A Tutorial on Multilabel Learning , 2015, ACM Comput. Surv..

[29]  Jun Wang,et al.  Enhancing multi-label classification by modeling dependencies among labels , 2014, Pattern Recognit..

[30]  Eyke Hüllermeier,et al.  Multilabel classification via calibrated label ranking , 2008, Machine Learning.

[31]  Michel Verleysen,et al.  Mutual information-based feature selection for multilabel classification , 2013, Neurocomputing.

[32]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[33]  Min Wu,et al.  Multi-label ensemble based on variable pairwise constraint projection , 2013, Inf. Sci..

[34]  Grigorios Tsoumakas,et al.  Correlation-Based Pruning of Stacked Binary Relevance Models for Multi-Label Learning , 2009 .

[35]  Fernando Benites,et al.  Multi-label classification and extracting predicted class hierarchies , 2011, Pattern Recognit..

[36]  Zhihua Wei,et al.  Semi-supervised multi-label image classification based on nearest neighbor editing , 2013, Neurocomputing.

[37]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[38]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[39]  Sebastián Ventura,et al.  Scalable extensions of the ReliefF algorithm for weighting and selecting features on the multi-label learning context , 2015, Neurocomputing.