A New Multi-label Learning Algorithm Using Shelly Neighbors

Since multi-label data is ubiquitous in reality, a promising study in data mining is multi-label learning. Facing with the multi-label data, traditional single-label learning methods are not competent for the classification tasks. This paper proposes a new lazy learning algorithm for the multi-label classification. The characteristic of our method is that it takes both binary relevance and shelly neighbors into account. Unlike k nearest neighbors, the shelly neighbors form a shell to surround a given instance. As a result, our method not only identifies more helpful neighbors for classification, but also exempts from the perplexity of choosing an optimal value for k in the lazy learning methods. The experiments carried out on five benchmark datasets demonstrate that the proposed approach outperforms standard lazy multi-label classification in most cases.

[1]  David G. Stork,et al.  Pattern Classification , 1973 .

[2]  Robert Meersman,et al.  On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE , 2003, Lecture Notes in Computer Science.

[3]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[4]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[5]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[6]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[7]  Eyke Hüllermeier,et al.  Multilabel classification via calibrated label ranking , 2008, Machine Learning.

[8]  Shichao Zhang,et al.  Shell-neighbor method and its application in missing data imputation , 2011, Applied Intelligence.

[9]  Yaxin Bi,et al.  KNN Model-Based Approach in Classification , 2003, OTM.

[10]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[11]  Lior Rokach,et al.  Data Mining and Knowledge Discovery Handbook, 2nd ed , 2010, Data Mining and Knowledge Discovery Handbook, 2nd ed..

[12]  Grigorios Tsoumakas,et al.  Correlation-Based Pruning of Stacked Binary Relevance Models for Multi-Label Learning , 2009 .

[13]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[14]  Eyke Hüllermeier,et al.  Label ranking by learning pairwise preferences , 2008, Artif. Intell..

[15]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[16]  Juho Rousu,et al.  Kernel-Based Learning of Hierarchical Multilabel Classification Models , 2006, J. Mach. Learn. Res..

[17]  Grigorios Tsoumakas,et al.  Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[18]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[19]  George A. Vouros,et al.  Artificial Intelligence: Theories, Models and Applications, 5th Hellenic Conference on AI, SETN 2008, Syros, Greece, October 2-4, 2008. Proceedings , 2008, SETN.

[20]  Grigorios Tsoumakas,et al.  An Empirical Study of Lazy Multilabel Classification Algorithms , 2008, SETN.