Improving Classification Accuracy by Comparing Local Features through Canonical Correlations

Classifying images using features extracted from densely sampled local patches has enjoyed significant success in many detection and recognition tasks. It is also well known that generally more than one type of feature is needed to achieve robust classification performance. Previous works using multiple features have addressed this issue either through simple concatenation of feature vectors or through combining feature specific kernels at the classifier level. In this work we introduce a novel approach for combining features at the feature level by projecting two types of features onto two respective subspaces in which they are maximally correlated. We use their correlation as an augmented feature and demonstrate improvement in classification accuracy over simple combination through concatenation in a pedestrian detection framework.

[1]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Ankita Kumar,et al.  Support Kernel Machines for Object Recognition , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[3]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[5]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Pietro Perona,et al.  Multiple Component Learning for Object Detection , 2008, ECCV.

[8]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[9]  Yun Fu,et al.  Multiple feature fusion by subspace learning , 2008, CIVR '08.

[10]  Fatih Murat Porikli,et al.  Pedestrian Detection via Classification on Riemannian Manifolds , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.