Generalizing Pedestrian Detector to Low Resolution Images by Integrating High and Low Resolution Features

Detecting low resolution (low-res) pedestrians is rather important for automotive driving systems as it can provide enough time for decision making. However, many detectors perform quite well on high resolution (high-res) pedestrians, while perform poorly on low resolution ones. In this paper, to generalize the high performance of high-res pedestrians detectors to low-res pedestrians ones, we propose to integrate the high and low resolution features, the resolution-aware transformations are learned simultaneously to map the high and low resolution pedestrians to a common space and the shared detector in this space can be learned based on the linear SVM model. Meanwhile, a Maximum Mean Discrepancy (MMD) is considered to minimize the distribution mismatch of the transformed high and low resolution features. Then the shared detector and the transformations are learned jointly by minimizing both the distribution mismatch between the mapped high and low resolution samples and structural risk functional. In addition, we show that our proposed learning framework can be converted to a standard Multiple Kernel Learning (MKL) problem, which is convex and hence the global solution can be guaranteed. Experiments on the Caltech Pedestrian Benchmark show that our method is effective for the low resolution pedestrian detection tasks.

[1]  Subhransu Maji,et al.  Classification using intersection kernel support vector machines is efficient , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Piotr Dollár,et al.  Crosstalk Cascades for Frame-Rate Pedestrian Detection , 2012, ECCV.

[3]  Zaïd Harchaoui,et al.  Lifted coordinate descent for learning with trace-norm regularization , 2012, AISTATS.

[4]  Yves Grandvalet,et al.  Y.: SimpleMKL , 2008 .

[5]  Bernt Schiele,et al.  New features and insights for pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Sebastian Nowozin,et al.  Infinite Kernel Learning , 2008, NIPS 2008.

[7]  Ivor W. Tsang,et al.  Learning With Augmented Features for Supervised and Semi-Supervised Heterogeneous Domain Adaptation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Luc Van Gool,et al.  Pedestrian detection at 100 frames per second , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Derek Hoiem,et al.  Diagnosing Error in Object Detectors , 2012, ECCV.

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[13]  Shengcai Liao,et al.  Robust Multi-resolution Pedestrian Detection in Traffic Scenes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[16]  M. Kloft,et al.  l p -Norm Multiple Kernel Learning , 2011 .

[17]  Gene H. Golub,et al.  An Inverse Free Preconditioned Krylov Subspace Method for Symmetric Generalized Eigenvalue Problems , 2002, SIAM J. Sci. Comput..

[18]  Patrick J. Flynn,et al.  Multidimensional Scaling for Matching Low-Resolution Face Images , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Ramakant Nevatia,et al.  High performance object detection by collaborative learning of Joint Ranking of Granules features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  Larry S. Davis,et al.  Human detection using partial least squares analysis , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  Charless C. Fowlkes,et al.  Multiresolution Models for Object Detection , 2010, ECCV.

[22]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Hans-Peter Kriegel,et al.  Integrating structured biological data by Kernel Maximum Mean Discrepancy , 2006, ISMB.

[24]  Alexander Zien,et al.  lp-Norm Multiple Kernel Learning , 2011, J. Mach. Learn. Res..

[25]  Pietro Perona,et al.  The Fastest Pedestrian Detector in the West , 2010, BMVC.

[26]  David A. McAllester,et al.  Cascade object detection with deformable part models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[27]  Shiguang Shan,et al.  Low-Resolution Face Recognition via Coupled Locality Preserving Mappings , 2010, IEEE Signal Processing Letters.

[28]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.