Multiview emotion recognition via multi-set locality preserving canonical correlation analysis

In this paper, we propose a novel Multi-set Locality-Preserving Canonical Correlation Analysis (MLPCCA) for multi-view learning and fusion. The proposed MLPCCA captures the intrinsic structure of data while it learns the optimum basis for maximizing the correlation among different sets of data. To verify the effectiveness of the proposed technique, the proposed MLPC A has been applied in audio-based emotion recognition and visual-based emotion recognition, respectively. The experimental results demonstrated that the proposed MLPCCA can achieve a higher recognition accuracy compared to the existing methods including CCA, LPCCA, and MCCA.

[1]  David G. Stork,et al.  Pattern Classification , 1973 .

[2]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[3]  Colin Fyfe,et al.  Kernel and Nonlinear Canonical Correlation Analysis , 2000, IJCNN.

[4]  Allan Aasbjerg Nielsen,et al.  Multiset canonical correlations analysis and multispectral, truly multitemporal remote sensing data , 2002, IEEE Trans. Image Process..

[5]  Songcan Chen,et al.  Locality preserving CCA with applications to data visualization and pose estimation , 2007, Image Vis. Comput..

[6]  S. Dharanipragada,et al.  Feature extraction for robust speech recognition , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[7]  Pheng-Ann Heng,et al.  Feature fusion method based on canonical correlation analysis and handwritten character recognition , 2004, ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, 2004..

[8]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[9]  Ishwar K. Sethi,et al.  Multimedia content processing through cross-modal association , 2003, MULTIMEDIA '03.

[10]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[11]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[12]  Yousef Saad,et al.  Orthogonal Neighborhood Preserving Projections: A Projection-Based Dimensionality Reduction Technique , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Ling Guan,et al.  Recognizing Human Emotional State From Audiovisual Signals , 2008, IEEE Transactions on Multimedia.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.