Unsupervised Detector Adaptation by Joint Dataset Feature Learning

Object detection is an important step in automated scene understanding. Training state-of-the-art object detectors typically require manual annotation of training data which can be labor-intensive. In this paper, we propose a novel algorithm to automatically adapt a pedestrian detector trained on a generic image dataset to a video in an unsupervised way using joint dataset deep feature learning. Our approach does not require any background subtraction or tracking in the video. Experiments on two challenging video datasets show that our algorithm is effective and outperforms the state-of-the-art approach.

[1]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[2]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  Yuan Shi,et al.  Geodesic flow kernel for unsupervised domain adaptation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  James Martens,et al.  Deep learning via Hessian-free optimization , 2010, ICML.

[5]  Alexei A. Efros,et al.  Unbiased look at dataset bias , 2011, CVPR 2011.

[6]  Peter J. Bickel,et al.  Maximum Likelihood Estimation of Intrinsic Dimension , 2004, NIPS.

[7]  Fei-Fei Li,et al.  Shifting Weights: Adapting Object Detectors from Image to Video , 2012, NIPS.

[8]  Meng Wang,et al.  Transferring a generic pedestrian detector towards specific scenes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[10]  Quoc V. Le,et al.  On optimization methods for deep learning , 2011, ICML.

[11]  Rama Chellappa,et al.  Domain adaptation for object recognition: An unsupervised approach , 2011, 2011 International Conference on Computer Vision.