论文信息 - Semi-Supervised Self-Training of Object Detection Models

Semi-Supervised Self-Training of Object Detection Models

The construction of appearance-based object detection systems is time-consuming and difficult because a large number of training examples must be collected and manually labeled in order to capture variations in object appearance. Semi-supervised training is a means for reducing the effort needed to prepare the training set by training the model with a small number of fully labeled examples and an additional set of unlabeled or weakly labeled examples. In this work we present a semi-supervised approach to training object detection systems based on self-training. We implement our approach as a wrapper around the training process of an existing object detector and present empirical results. The key contributions of this empirical study is to demonstrate that a model trained in this manner can achieve results comparable to a model trained in the traditional manner using a much larger set of fully labeled data, and that a training data selection metric that is defined independently of the detector greatly outperforms a selection metric based on the detection confidence generated by the detector.

[1] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[2] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[3] Sebastian Thrun,et al. Learning to Classify Text from Labeled and Unlabeled Documents , 1998, AAAI/IAAI.

[4] Shumeet Baluja,et al. Probabilistic Modeling for Face Orientation Discrimination: Learning from Labeled and Unlabeled Data , 1998, NIPS.

[5] Thorsten Joachims,et al. Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[6] Pietro Perona,et al. Unsupervised Learning of Models for Recognition , 2000, ECCV.

[7] Rayid Ghani,et al. Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[8] Andrew McCallum,et al. Efficient clustering of high-dimensional data sets with application to reference matching , 2000, KDD '00.

[9] Avrim Blum,et al. Learning from Labeled and Unlabeled Data using Graph Mincuts , 2001, ICML.

[10] Paul A. Viola,et al. Robust Real-time Object Detection , 2001 .

[11] Tom M. Mitchell,et al. Using unlabeled data to improve text classification , 2001 .

[12] Tommi S. Jaakkola,et al. Partially labeled classification with Markov random walks , 2001, NIPS.

[13] Andrea Salgian,et al. Minimally supervised acquisition of 3D recognition models from cluttered images , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[14] Tommi S. Jaakkola,et al. Information Regularization with Partially Labeled Data , 2002, NIPS.

[15] Zoubin Ghahramani,et al. Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[16] Paul A. Viola,et al. Unsupervised improvement of visual detectors using cotraining , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[17] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18] Adrian Corduneanu,et al. On Information Regularization , 2002, UAI.

[19] Pietro Perona,et al. A Bayesian approach to unsupervised one-shot learning of object categories , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[20] Martial Hebert,et al. Semi-supervised training of models for appearance-based statistical object detection methods , 2004 .

[21] Henry Schneiderman,et al. Learning a restricted Bayesian network for object detection , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[22] H. Schneiderman. Feature-centric evaluation for efficient cascaded object detection , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[23] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[24] Bernt Schiele,et al. Recognition without Correspondence using Multidimensional Receptive Field Histograms , 2004, International Journal of Computer Vision.