论文信息 - Uncertainty sampling methods for one-class classifiers

Uncertainty sampling methods for one-class classifiers

Selective sampling, a part of the active learning method, reduces the cost of labeling supplementary training data by asking for the labels only of the most informative, unlabeled examples. This additional information added to an initial, randomly chosen training set is expected to improve the generalization performance of a learning machine. We investigate some methods for a selection of the most informative examples in the context of one-class classification problems (OCC) i.e. problems where only (or nearly only) the examples of the so-called target class are available. We applied selective sampling algorithms to a variety of domains, including realworld problems: mine detection and texture segmentation. The goal of this paper is to show why the best or most often used selective sampling methods for two- or multi-class problems are not necessarily the best ones for the one-class classification problem. By modifying the sampling methods, we present a way of selecting a small subset from the unlabeled data to be presented to an expert for labeling such that the performance of the retrained one-class classifier is significantly improved.

Robert P. W. Duin | Piotr Juszczak | P. Juszczak | R. Duin

[1] H. Sebastian Seung,et al. Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[2] Daphne Koller,et al. Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[3] David A. Cohn,et al. Active Learning with Statistical Models , 1996, NIPS.

[4] Robert P. W. Duin,et al. Support Vector Data Description , 2004, Machine Learning.

[5] Craig A. Knoblock,et al. Selective Sampling with Redundant Views , 2000, AAAI/IAAI.

[6] William A. Gale,et al. A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[7] Tom M. Mitchell,et al. Generalization as Search , 2002 .

[8] Gunnar Rätsch,et al. Active Learning in the Drug Discovery Process , 2001, NIPS.

[9] Nathalie Japkowicz,et al. Concept learning in the absence of counterexamples: an autoassociation-based approach to classification , 1999 .

[10] Nello Cristianini,et al. Query Learning with Large Margin Classi ersColin , 2000 .

[11] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[12] David M. J. Tax,et al. One-class classification , 2001 .

[13] David A. Cohn,et al. Improving generalization with active learning , 1994, Machine Learning.

[14] Daphne Koller,et al. Support Vector Machine Active Learning with Application sto Text Classification , 2000, ICML.