Visually Similar K-poselets Based Human Pose Recognition

In the paper, we propose the visually similar k-poselets to recognize human poses (e.g., stoop, squat) in still images. Compared with the original k-poselets that are collected according to similar keypoints configurations, we further introduce appearance similarity constraints to generate visually similar k-poselets. The number of selected visually similar k-poselets for each pose category is iteratively decreased based on discriminative criterion. The pose dictionary, constructed with learned visually similar and discriminative k-poselets of different poses, is applied in pose recognition. The experimental results on our released human pose database verify the effectiveness of the proposed visually similar k-poselets based pose recognition method.

[1]  Alexei A. Efros,et al.  Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[2]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Xiaogang Wang,et al.  Learning Mid-level Filters for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Alexei A. Efros,et al.  Unsupervised Discovery of Mid-Level Discriminative Patches , 2012, ECCV.

[5]  Václav Hlavác,et al.  Pose primitive based human action recognition in videos or still images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Yun Fu,et al.  Exploring discriminative pose sub-patterns for effective action classification , 2013, ACM Multimedia.

[7]  Jitendra Malik,et al.  Using k-Poselets for Detecting People and Localizing Their Keypoints , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Marc Pollefeys,et al.  Foreground Consistent Human Pose Estimation Using Branch and Bound , 2014, ECCV.

[9]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[11]  Kang Zheng,et al.  Combining local appearance and holistic view: Dual-Source Deep Neural Networks for human pose estimation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Jean Meunier,et al.  Human posture recognition by combining silhouette and infrared cast shadows , 2015, 2015 International Conference on Image Processing Theory, Tools and Applications (IPTA).

[13]  Derek Hoiem,et al.  Learning Collections of Part Models for Object Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Ramakant Nevatia,et al.  Action recognition in cluttered dynamic scenes using Pose-Specific Part Models , 2011, 2011 International Conference on Computer Vision.

[15]  Larry S. Davis,et al.  Representing Videos Using Mid-level Discriminative Patches , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Sei-ichiro Kamata,et al.  Learning discriminative and shareable patches for scene classification , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Yannick Benezeth,et al.  Posture Recognition Based on Fuzzy Logic for Home Monitoring of the Elderly , 2012, IEEE Transactions on Information Technology in Biomedicine.