Small Target Detection combining Foreground and Background Manifolds

This paper focuses on the detection of small objects (e.g. vehicules in aerial images) on complex back-grounds (e.g. natural backgrounds). A key contribution of the paper is to show that, in such situations, learning a target model and a background model separately is better than training a unique discriminative model. This contrasts with standard object detection approaches for which objects vs. background classi fiers use the same types of visual features/models for both. The second contribution lies in the use of manifold learning approaches to build these models. The proposed detection algorithm is validated on the publicly available OIRDS dataset, on which we obtain state-of-the-art results.

[1]  Larry S. Davis,et al.  Vehicle Detection Using Partial Least Squares , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Andrew Zisserman,et al.  Sparse kernel approximations for efficient classification and detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[4]  M GavrilaDariu,et al.  Monocular Pedestrian Detection , 2009 .

[5]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[6]  Lawrence K. Saul,et al.  Think Globally, Fit Locally: Unsupervised Learning of Low Dimensional Manifold , 2003, J. Mach. Learn. Res..

[7]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Yongsheng Gao,et al.  Parametric Manifold of an Object under Different Viewing Directions , 2012, ECCV.

[10]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[11]  Dariu Gavrila,et al.  An Experimental Study on Pedestrian Classification , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[13]  Peyman Milanfar,et al.  Visual saliency for automatic target detection, boundary detection, and image quality assessment , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Alex Pentland,et al.  Probabilistic Visual Learning for Object Representation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[17]  M. Naderi Think globally... , 2004, HIV prevention plus!.

[18]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  Raphaël Féraud,et al.  A Fast and Accurate Face Detector Based on Neural Networks , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  George D. C. Cavalcanti,et al.  A weighted image reconstruction based on PCA for pedestrian detection , 2011, The 2011 International Joint Conference on Neural Networks.

[22]  Veronica Carlan,et al.  Overhead imagery research data set — an annotated data library & tools to aid in the development of computer vision algorithms , 2009, 2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009).