Extended Joint Deep Learning for Pedestrian Detection

In this paper, we propose an extended version of Unified Deep Network (UDN). The Extended UDN (EUDN) uses multiple deformation models that operate independently of each other and mixture of the responses of the models to estimate the detection label. The deformation models of the EUDN jointly learned in order to complement each other through penalized in-diversity loss measured from the average correlation between the models. In our experiments, we show that combining independently the deformation models (which are even if worse than existing one) reduces the error in the manner similar to the ensemble learning, and considering diversity of the individual models is more effective without considering diversity. Our approach is evaluated on the Caltech datasets and achieves better performance than the UDN.

[1]  A. Sharkey Linear and Order Statistics Combiners for Pattern Classification , 1999 .

[2]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Xiaogang Wang,et al.  Joint Deep Learning for Pedestrian Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[4]  Yann LeCun,et al.  Pedestrian Detection with Unsupervised Multi-stage Feature Learning , 2012, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.