论文信息 - A Novel Uncertainty Sampling Algorithm for Cost-Sensitive Multiclass Active Learning

A Novel Uncertainty Sampling Algorithm for Cost-Sensitive Multiclass Active Learning

Active learning is a setup that allows the learning algorithm to iteratively and strategically query the labels of some instances for reducing human labeling efforts. One fundamental strategy, called uncertainty sampling, measures the uncertainty of each instance when making querying decisions. Traditional active learning algorithms focus on binary or multiclass classification, but few works have studied active learning for cost-sensitive multiclass classification (CSMCC), which allows charging different costs for different types of misclassification errors. The few works are generally based on calculating the uncertainty of each instance by probability estimation, and can suffer from the inaccuracy of the estimation. In this paper, we propose a novel active learning algorithm that relies on a different way of calculating the uncertainty. The algorithm is based on our newly-proposed cost embedding approach (CE) for CSMCC. CE embeds the cost information in the distance measure of a special hidden space with non-metric multidimensional scaling, and deals with both symmetric and asymmetric cost information by our carefully designed mirroring trick. The embedding allows the proposed algorithm, active learning with cost embedding (ALCE), to define a cost-sensitive uncertainty measure from the distance in the hidden space. Extensive experimental results demonstrate that ALCE selects more useful instances by taking the cost information into account through the embedding and is superior to existing cost-sensitive active learning algorithms.

Hsuan-Tien Lin | Kuan-Hao Huang

[1] Bo Zhang,et al. Entropy-based active learning with support vector machines for content-based image retrieval , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[2] Xiaowei Xu,et al. Representative Sampling for Text Classification Using Support Vector Machines , 2003, ECIR.

[3] Burr Settles,et al. Active Learning Literature Survey , 2009 .

[4] Thomas P. Hayes,et al. Error limiting reductions between classification tasks , 2005, ICML.

[5] Daphne Koller,et al. Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[6] Hsuan-Tien Lin,et al. Active Learning for Multiclass Cost-Sensitive Classification Using Probabilistic Models , 2013, 2013 Conference on Technologies and Applications of Artificial Intelligence.

[7] Rong Jin,et al. Active Learning by Querying Informative and Representative Examples , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Pedro M. Domingos. MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[9] Mark Craven,et al. An Analysis of Active Learning Strategies for Sequence Labeling Tasks , 2008, EMNLP.

[10] William A. Gale,et al. A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[11] W. Torgerson. Multidimensional scaling: I. Theory and method , 1952 .

[12] Hsuan-Tien Lin,et al. One-sided Support Vector Regression for Multiclass Cost-sensitive Classification , 2010, ICML.

[13] J. Kruskal. Nonmetric multidimensional scaling: A numerical method , 1964 .

[14] Chih-Jen Lin,et al. Probability Estimates for Multi-class Classification by Pairwise Coupling , 2003, J. Mach. Learn. Res..

[15] Chun-Liang Li,et al. Active Learning Using Hint Information , 2015, Neural Computation.

[16] Andrew McCallum,et al. Reducing Labeling Effort for Structured Prediction Tasks , 2005, AAAI.

[17] Alekh Agarwal,et al. Selective sampling algorithms for cost-sensitive multiclass prediction , 2013, ICML.

[18] John Langford,et al. An iterative method for multi-class cost-sensitive learning , 2004, KDD.

[19] Stefan Wrobel,et al. Active Hidden Markov Models for Information Extraction , 2001, IDA.

[20] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[21] J. Leeuw. Applications of Convex Analysis to Multidimensional Scaling , 2000 .

[22] Sanjoy Dasgupta,et al. Hierarchical sampling for active learning , 2008, ICML '08.

[23] John Langford,et al. Error-Correcting Tournaments , 2009, ALT.