Image-Based Three-Dimensional Human Pose Recovery by Multiview Locality-Sensitive Sparse Retrieval

Image-based 3-D human pose recovery is usually conducted by retrieving relevant poses with image features. However, it suffers from the high dimensionality of image features and the low efficiency of the retrieving process. Particularly for multiview data, the integration of different types of features is difficult. In this paper, a novel approach is proposed to recover 3-D human poses from silhouettes. This approach improves traditional methods by adopting multiview locality-sensitive sparse coding in the retrieving process. First, it incorporates a local similarity preserving term into the objective of sparse coding, which groups similar silhouettes to alleviate the instability of sparse codes. Second, the objective function of sparse coding is improved by integrating multiview data. The experimental results show that the retrieval error has been reduced by 20% to 50%, which demonstrate the effectiveness of the proposed method.

[1]  Liyanage C. De Silva,et al.  Multimodal Approach to Human-Face Detection and Tracking , 2008, IEEE Transactions on Industrial Electronics.

[2]  Xuelong Li,et al.  General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  David J. Fleet,et al.  Stochastic Tracking of 3D Human Figures Using 2D Image Motion , 2000, ECCV.

[4]  Philip S. Yu,et al.  An efficient orientation distance–based discriminative feature extraction method for multi-classification , 2013, Knowledge and Information Systems.

[5]  Xuelong Li,et al.  Geometric Mean for Subspace Selection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Anca Loredana Ion,et al.  Algorithms for reducing the semantic gap in image retrieval systems , 2009, 2009 2nd Conference on Human System Interactions.

[7]  Dacheng Tao,et al.  Discriminative Locality Alignment , 2008, ECCV.

[8]  Meng Wang,et al.  Unified Video Annotation via Multigraph Learning , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Jing Li,et al.  Building Recognition Using Local Oriented Features , 2013, IEEE Transactions on Industrial Informatics.

[10]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[11]  José Francisco Martínez Trinidad,et al.  AGraP: an algorithm for mining frequent patterns in a single graph using inexact matching , 2014, Knowledge and Information Systems.

[12]  Matthew Brand,et al.  Shadow puppetry , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[13]  Shuicheng Yan,et al.  Semi-supervised Learning by Sparse Representation , 2009, SDM.

[14]  Mojtaba Ahmadieh Khanesar,et al.  Extended Kalman Filter Based Learning Algorithm for Type-2 Fuzzy Logic Systems and Its Experimental Evaluation , 2012, IEEE Transactions on Industrial Electronics.

[15]  Jianbo Yu,et al.  Local and Nonlocal Preserving Projection for Bearing Defect Classification and Performance Assessment , 2012, IEEE Transactions on Industrial Electronics.

[16]  Ronald Poppe,et al.  Evaluating Example-based Pose Estimation: Experiments on the HumanEva Sets , 2007 .

[17]  Jiankun Hu,et al.  A New Dimensionality Reduction Algorithm for Hyperspectral Image Using Evolutionary Strategy , 2012, IEEE Transactions on Industrial Informatics.

[18]  Michael J. Black,et al.  HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion , 2010, International Journal of Computer Vision.

[19]  Yan Liu,et al.  A general framework for scalable transductive transfer learning , 2013, Knowledge and Information Systems.

[20]  Okyay Kaynak,et al.  Sliding Mode Control Approach for Online Learning as Applied to Type-2 Fuzzy Neural Networks and Its Experimental Evaluation , 2012, IEEE Transactions on Industrial Electronics.

[21]  Jiawei Han,et al.  Isometric Projection , 2007, AAAI.

[22]  Meng Wang,et al.  Multimodal Graph-Based Reranking for Web Image Search , 2012, IEEE Transactions on Image Processing.

[23]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[24]  Ronald Poppe,et al.  Vision-based human motion analysis: An overview , 2007, Comput. Vis. Image Underst..

[25]  Shuicheng Yan,et al.  Neighborhood preserving embedding , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[26]  Mohan M. Trivedi,et al.  3-D Posture and Gesture Recognition for Interactivity in Smart Spaces , 2012, IEEE Transactions on Industrial Informatics.

[27]  Rómer Rosales,et al.  Combining Generative and Discriminative Models in a Framework for Articulated Pose Estimation , 2006, International Journal of Computer Vision.

[28]  Kun Zhou,et al.  Locality Sensitive Discriminant Analysis , 2007, IJCAI.

[29]  Jitendra Malik,et al.  Estimating Human Body Configurations Using Shape Context Matching , 2002, ECCV.

[30]  William T. Freeman,et al.  Bayesian Reconstruction of 3D Human Motion from Single-Camera Video , 1999, NIPS.

[31]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[32]  Jitendra Malik,et al.  Efficient shape matching using shape contexts , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Wei-Yun Yau,et al.  Multichannel Pulse-Coupled-Neural-Network-Based Color Image Segmentation for Object Detection , 2012, IEEE Transactions on Industrial Electronics.

[34]  Yi Yang,et al.  3D human pose recovery from image by efficient visual feature selection , 2011, Comput. Vis. Image Underst..

[35]  Melisande Biet,et al.  Rotor Faults Diagnosis Using Feature Selection and Nearest Neighbors Rule: Application to a Turbogenerator , 2013, IEEE Transactions on Industrial Electronics.

[36]  Kashif Javed,et al.  The correctness problem: evaluating the ordering of binary features in rankings , 2013, Knowledge and Information Systems.

[37]  Rómer Rosales,et al.  Inferring body pose without tracking body parts , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[38]  Mubarak Shah,et al.  A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.

[39]  Ankur Agarwal,et al.  Recovering 3D human pose from monocular images , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Xuelong Li,et al.  Rank Preserving Discriminant Analysis for Human Behavior Recognition on Wireless Sensor Networks , 2014, IEEE Transactions on Industrial Informatics.

[41]  Chuanhou Gao,et al.  Binary Coding SVMs for the Multiclass Problem of Blast Furnace System , 2013, IEEE Transactions on Industrial Electronics.

[42]  Ren C. Luo,et al.  Enriched Indoor Map Construction Based on Multisensor Fusion Approach for Intelligent Service Robot , 2012, IEEE Transactions on Industrial Electronics.

[43]  Chi-Man Vong,et al.  A New Framework of Simultaneous-Fault Diagnosis Using Pairwise Probabilistic Multi-Label Classification for Time-Dependent Patterns , 2013, IEEE Transactions on Industrial Electronics.

[44]  Xu Zhao,et al.  Generative tracking of 3D human motion by hierarchical annealed genetic algorithm , 2008, Pattern Recognit..

[45]  A. Izenman Linear Discriminant Analysis , 2013 .

[46]  Charles R. Giardina,et al.  Elliptic Fourier features of a closed contour , 1982, Comput. Graph. Image Process..

[47]  Jiwu Huang,et al.  Near-Duplicate Image Recognition and Content-based Image Retrieval using Adaptive Hierarchical Geometric Centroids , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[48]  Jie Lu,et al.  A Situation Retrieval Model for Cognitive Decision Support in Digital Business Ecosystems , 2013, IEEE Transactions on Industrial Electronics.

[49]  Panagiotis Symeonidis,et al.  Extended feature combination model for recommendations in location-based mobile services , 2014, Knowledge and Information Systems.

[50]  Trevor Darrell,et al.  Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[51]  Guang Han,et al.  An image registration method based on similarity of edge information , 2012, ISIE.

[52]  Liang-Tien Chia,et al.  Laplacian Sparse Coding, Hypergraph Laplacian Sparse Coding, and Applications , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[54]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[55]  Xuelong Li,et al.  Selecting Key Poses on Manifold for Pairwise Action Recognition , 2012, IEEE Transactions on Industrial Informatics.