论文信息 - Discriminative Learning and Recognition of Image Set Classes Using Canonical Correlations

Discriminative Learning and Recognition of Image Set Classes Using Canonical Correlations

We address the problem of comparing sets of images for object recognition, where the sets may represent variations in an object's appearance due to changing camera pose and lighting conditions. canonical correlations (also known as principal or canonical angles), which can be thought of as the angles between two d-dimensional subspaces, have recently attracted attention for image set matching. Canonical correlations offer many benefits in accuracy, efficiency, and robustness compared to the two main classical methods: parametric distribution-based and nonparametric sample-based matching of sets. Here, this is first demonstrated experimentally for reasonably sized data sets using existing methods exploiting canonical correlations. Motivated by their proven effectiveness, a novel discriminative learning method over sets is proposed for set classification. Specifically, inspired by classical linear discriminant analysis (LDA), we develop a linear discriminant function that maximizes the canonical correlations of within-class sets and minimizes the canonical correlations of between-class sets. Image sets transformed by the discriminant function are then compared by the canonical correlations. Classical orthogonal subspace method (OSM) is also investigated for the similar purpose and compared with the proposed method. The proposed method is evaluated on various object recognition problems using face image sets with arbitrary motion captured under different illuminations and image sets of 500 general objects taken at different views. The method is also applied to object category recognition using ETH-80 database. The proposed method is shown to outperform the state-of-the-art methods in terms of accuracy and efficiency

[1] David J. Kriegman,et al. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[2] Josef Kittler,et al. Locally linear discriminant analysis for multimodally distributed classes for face recognition with a single model image , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Josef Kittler,et al. Learning Discriminative Canonical Correlations for Object Recognition with Image Sets , 2006, ECCV.

[4] P. Jonathon Phillips,et al. Facial Recognition Vendor Test 2000: Evaluation Report , 2001 .

[5] H. Hotelling. Relations Between Two Sets of Variates , 1936 .

[6] R. Gittins,et al. Canonical Analysis: A Review with Applications in Ecology , 1985 .

[7] Thomas Kailath,et al. A view of three decades of linear filtering theory , 1974, IEEE Trans. Inf. Theory.

[8] Rama Chellappa,et al. Discriminant analysis of principal components for face recognition , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[9] Masashi Nishiyama,et al. Face Recognition with the Multiple Constrained Mutual Subspace Method , 2003, AVBPA.

[10] M. Bressan,et al. Nonparametric discriminant analysis and nearest neighbor classification , 2003, Pattern Recognit. Lett..

[11] Björn Stenger,et al. A Framework for 3D Object Recognition Using the Kernel Constrained Mutual Subspace Method , 2006, ACCV.

[12] Bernt Schiele,et al. Analyzing appearance and contour based methods for object categorization , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13] Trevor Darrell,et al. Face recognition with image sets using manifold density divergence , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[15] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[16] David J. Kriegman,et al. Video-based face recognition using probabilistic appearance manifolds , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17] David J. Kriegman,et al. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[18] Shaogang Gong,et al. Recognising the Dynamics of Faces across Multiple Views , 2000, BMVC.

[19] Shigeo Abe DrEng. Pattern Classification , 2001, Springer London.

[20] Osamu Yamaguchi,et al. Face Recognition Using Multi-viewpoint Patterns for Robot Vision , 2003, ISRR.

[21] Erkki Oja,et al. Subspace methods of pattern recognition , 1983 .

[22] Rama Chellappa,et al. Probabilistic recognition of human faces from video , 2002, Proceedings. International Conference on Image Processing.

[23] P. Jonathon Phillips,et al. Face recognition vendor test 2002 , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[24] Ming-Hsuan Yang,et al. Kernel Eigenfaces vs. Kernel Fisherfaces: Face recognition using kernel methods , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[25] Xiaogang Wang,et al. Random sampling LDA for face recognition , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[26] Arnold W. M. Smeulders,et al. The Amsterdam Library of Object Images , 2004, International Journal of Computer Vision.

[27] David G. Stork,et al. Pattern Classification , 1973 .

[28] Trevor Darrell,et al. Face Recognition from Long-Term Observations , 2002, ECCV.

[29] Tae-Kyun Kim,et al. Learning over Sets using Boosted Manifold Principal Angles (BoMPA) , 2005, BMVC.

[30] Lior Wolf,et al. Learning over Sets using Kernel Principal Angles , 2003, J. Mach. Learn. Res..

[31] Ignacio Santamaría,et al. Canonical correlation analysis (CCA) algorithms for multiple data sets: Application to blind SIMO equalization , 2005, 2005 13th European Signal Processing Conference.

[32] Gene H. Golub,et al. Numerical methods for computing angles between linear subspaces , 1971, Milestones in Matrix Computation.

[33] Matti Pietikäinen,et al. From still image to video-based face recognition: an experimental analysis , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[34] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[35] Josef Kittler,et al. Decision making in the LDA space: generalised gradient direction metric , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[36] John Shawe-Taylor,et al. Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[37] Tsuhan Chen,et al. Video-based face recognition using adaptive hidden Markov models , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[38] Ken-ichi Maeda,et al. Face recognition using temporal image sequence , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[39] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[40] Shin'ichi Satoh,et al. Comparative evaluation of face sequence matching for content-based video access , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).