Robust Person Detection by Classifier Cubes and Local Verification ∗

Classifier grids have shown to be an alternative to sliding window approaches for object detection from static cameras. However, existing approaches neglected two essential points: (a) temporal information is not used and (b) a standard non-maxima suppression is applied as post-processing step. Thus, the contribution of this paper is twofold. First, we introduce classifier cubes, which exploit the available temporal information within a classifier grid by adapting the local detection likelihood based on preceded detections. Second, we introduce a more sophisticated post-processing step to verify detection hypotheses by comparing a local figure/ground segmentation to a provided prototype model. Experiments on publicly available data demonstrate that both extensions improve the detection performance.

[1]  Horst Bischof,et al.  On robustness of on-line boosting - a competitive study , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[2]  Toby Sharp,et al.  Image segmentation with a bounding box prior , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  H. Grabner,et al.  Classifier grids for robust adaptive object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Luc Van Gool,et al.  Exploring context to learn scene specific object detectors , 2009 .

[5]  Gang Song,et al.  Object Detection Combining Recognition and Segmentation , 2007, ACCV.

[6]  Deva Ramanan,et al.  Using Segmentation to Verify Object Hypotheses , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Shai Avidan,et al.  Ensemble Tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  H. Grabner,et al.  Is Pedestrian Detection Really a Hard Task ? ∗ , 2007 .

[9]  Ramakant Nevatia,et al.  Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors , 2007, International Journal of Computer Vision.

[10]  Pushmeet Kohli,et al.  OBJCUT for Face Detection , 2006, ICVGIP.

[11]  Horst Bischof,et al.  On-line Boosting and Vision , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Larry S. Davis,et al.  Closely coupled object detection and segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[13]  Jianbo Shi,et al.  Spectral segmentation with multiscale graph decomposition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Chunming Li,et al.  Level set evolution without re-initialization: a new variational formulation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Zhuowen Tu,et al.  Image Parsing: Unifying Segmentation, Detection, and Recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[16]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..