Local Structure Preserving Using Manifold Regularization and Pairwise Constraints for Action Recognition

With the rapid development of Internet technology and smart devices, tremendous amounts of multimedia data (e.g. text, image, video, audio, etc.) are produced and uploaded online every day. Semi-supervised learning has been proved to be one effect and effective solution to manage the massive emerging multimedia, which usually leverages the performance by exploiting the local geometry of a small number of labelled and a large number of unlabeled samples. The representative local structure preserving methods include manifold regularization and pairwise constraints. In this paper, we propose a local structure preserving method that effectively integrates manifold regularization and pairwise constraints. Particularly, we construct a new graph Laplacian by combining the traditional Laplacian and pairwise constraints. The new graph Laplacian can better preserve the local geometry and then further boost the performance. Finally, we build new local structure preserving classifiers including kernel least squares and support vector machines. We conduct extensive experiments on Chinese Academy of Sciences - Yunnan University - Multimodal Human Action Database (CAS-YNU-MHAD) for action recognition, respectively. The experimental results demonstrate that the proposed algorithm outperforms the baseline algorithms.

[1]  Frédéric Jurie,et al.  PCCA: A new approach for distance learning from sparse pairwise constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Tomer Hertz,et al.  Learning a Mahalanobis Metric from Equivalence Constraints , 2005, J. Mach. Learn. Res..

[3]  Yuan Yan Tang,et al.  High-Order Distance-Based Multiview Stochastic Learning in Image Classification , 2014, IEEE Transactions on Cybernetics.

[4]  Claire Cardie,et al.  Clustering with Instance-Level Constraints , 2000, AAAI/IAAI.

[5]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[6]  Daoqiang Zhang,et al.  Constraint Score: A new filter method for feature selection with pairwise constraints , 2008, Pattern Recognit..

[7]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[8]  D. Donoho,et al.  Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Hongjie Jia,et al.  Research of semi-supervised spectral clustering algorithm based on pairwise constraints , 2012, Neural Computing and Applications.

[10]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[11]  Meng Wang,et al.  Image-Based Three-Dimensional Human Pose Recovery by Multiview Locality-Sensitive Sparse Retrieval , 2015, IEEE Transactions on Industrial Electronics.

[12]  Hakan Cevikalp,et al.  Semi-Supervised Dimensionality Reduction Using Pairwise Equivalence Constraints , 2008, VISAPP.

[13]  Jun Yu,et al.  Click Prediction for Web Image Reranking Using Multimodal Sparse Coding , 2014, IEEE Transactions on Image Processing.

[14]  Weifeng Liu,et al.  Multiview Cauchy Estimator Feature Embedding for Depth and Inertial Sensor-Based Human Action Recognition , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[15]  Meng Wang,et al.  Multimodal Deep Autoencoder for Human Pose Recovery , 2015, IEEE Transactions on Image Processing.

[16]  Y. Rui,et al.  Learning to Rank Using User Clicks and Visual Features for Image Retrieval , 2015, IEEE Transactions on Cybernetics.

[17]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[18]  Jun Yu,et al.  Multi-view ensemble manifold regularization for 3D object recognition , 2015, Inf. Sci..

[19]  Jun Yu,et al.  Exploiting Click Constraints and Multi-view Features for Image Re-ranking , 2014, IEEE Transactions on Multimedia.