SVM-based fast pedestrian recognition using a hierarchical codebook of local features

The performance of an object recognition system depends on both object representation and classification algorithms. On the one hand, Object representation by using local descriptors have become a very powerful representation of images. On the other hand, SVM has shown impressive learning and recognition performances. In this paper, we present a method for fast pedestrian classification by combining a SVM with a hierarchical codebook of local features augmented with reliable global features. When compared to SVM based on local matching kernels, our method provides significant improvement of recognition performances with a speed up in learning and classification time. We evaluate our approach on a set of far-infrared images where pedestrians occur at different scales and in difficult recognition situations. The experiment shows that our method performs a fast and reliable pedestrian recognition system.

[1]  Bernt Schiele,et al.  Learning semantic object parts for object categorization , 2008, Image Vis. Comput..

[2]  Bernt Schiele,et al.  Multiple Object Class Detection with a Generative Model , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5]  Barbara Caputo,et al.  Recognition with local features: the kernel recipe , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[6]  Alexandrina Rogozan,et al.  SVM-based obstacle classification in visible and infrared images , 2009, 2009 17th European Signal Processing Conference.

[7]  D.J. Miller,et al.  Combined Generative-Discriminative Learning for Object Recognition using Local Image Descriptors , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[8]  Bernt Schiele,et al.  Integrating representative and discriminant models for object category detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[10]  B. Schiele,et al.  Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[11]  Trevor Darrell,et al.  The Pyramid Match Kernel: Efficient Learning with Sets of Features , 2007, J. Mach. Learn. Res..

[12]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[14]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[15]  Michael Arens,et al.  Feature based person detection beyond the visible spectrum , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[16]  Massimo Bertozzi,et al.  Pedestrian detection by means of far-infrared stereo vision , 2007, Comput. Vis. Image Underst..

[17]  Zehang Sun,et al.  Monocular precrash vehicle detection: features and classifiers , 2006, IEEE Transactions on Image Processing.