Video pose estimation with global motion cues

This paper addresses the problem of pose estimation in video sequences in which human pose changes drastically over time. Popular strategies for video pose estimation first yield multiple pose candidates for each frame and then achieve consistent pose estimation by enforcing temporal constraints across frames. To enrich pose candidates, previous methods typically employ local motion cues to propagate pose detections to adjacent frames. Reasonable pose proposals can be achieved only when the local motion estimation is accurate and good detections exist among adjacent frames, both of which are hard to be satisfied under drastic human pose changes. In this paper, we propose to propagate pose detections to entire video sequence through global motion cues which provide a long term holistic non-rigid motion transformation for the given video. We exploit the temporal continuity of both single parts and part pairs in the inference over a spatio-temporal model to stitch the reasonable trajectory fragments for each part and obtain the final pose estimation. Experimental results demonstrate remarkable performance improvement in comparison with the state-of-the-art methods.

[1]  Deva Ramanan,et al.  N-best maximal decoders for part models , 2011, 2011 International Conference on Computer Vision.

[2]  Ben Taskar,et al.  Sidestepping Intractable Inference with Structured Ensemble Cascades , 2010, NIPS.

[3]  Varun Ramakrishna,et al.  Tracking Human Pose by Tracking Symmetric Parts , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Silvio Savarese,et al.  Breaking the Chain: Liberation from the Temporal Markov Assumption for Tracking Human Poses , 2013, 2013 IEEE International Conference on Computer Vision.

[5]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[6]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Luc Van Gool,et al.  Does Human Action Recognition Benefit from Pose Estimation? , 2011, BMVC.

[8]  Alan L. Yuille,et al.  Articulated Pose Estimation by a Graphical Model with Image Dependent Pairwise Relations , 2014, NIPS.

[9]  Yuandong Tian,et al.  Exploring the Spatial Hierarchy of Mixture Models for Human Pose Estimation , 2012, ECCV.

[10]  Varun Ramakrishna,et al.  Pose Machines: Articulated Pose Estimation via Inference Machines , 2014, ECCV.

[11]  Bernt Schiele,et al.  Pictorial structures revisited: People detection and articulated pose estimation , 2009, CVPR.

[12]  Huijun Di,et al.  A Mixture of Transformed Hidden Markov Models for Elastic Motion Estimation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Andrew Zisserman,et al.  2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images , 2012, International Journal of Computer Vision.

[14]  Yao Lu,et al.  Human pose estimation with global motion cues , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[15]  Jitendra Malik,et al.  Large displacement optical flow , 2009, CVPR.

[16]  Ben Taskar,et al.  Parsing human motion with stretchable models , 2011, CVPR 2011.

[17]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[18]  Yi Yang,et al.  Unsupervised Video Adaptation for Parsing Human Motion , 2014, ECCV.

[19]  Jitendra Malik,et al.  Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[21]  Ben Taskar,et al.  Adaptive pose priors for pictorial structures , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Jun Yu,et al.  Human pose recovery by supervised spectral embedding , 2015, Neurocomputing.

[23]  Cordelia Schmid,et al.  Estimating Human Pose with Flowing Puppets , 2013, 2013 IEEE International Conference on Computer Vision.

[24]  Brendan J. Frey,et al.  A comparison of algorithms for inference and learning in probabilistic graphical models , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Yi Yang,et al.  Articulated Human Detection with Flexible Mixtures of Parts , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Limin Wang,et al.  Video Action Detection with Relational Dynamic-Poselets , 2014, ECCV.

[27]  Xuelong Li,et al.  Tracking Human Pose Using Max-Margin Markov Models , 2015, IEEE Transactions on Image Processing.

[28]  Sri Adi Widodo,et al.  ANALISIS FAKTOR TINGKAT KECEMASAN, MOTIVASI DAN PRESTASI BELAJAR MAHASISWA , 2017 .

[29]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[30]  Cordelia Schmid,et al.  Mixing Body-Part Sequences for Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Hao Jiang,et al.  Human pose estimation using consistent max-covering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[32]  Sheng Tang,et al.  Robust human body segmentation based on part appearance and spatial constraint , 2013, Neurocomputing.

[33]  Ahmed Taha,et al.  ON BEHAVIOR ANALYSIS IN VIDEO SURVEILLANCE , 2013 .

[34]  Peter V. Gehler,et al.  Strong Appearance and Expressive Spatial Models for Human Pose Estimation , 2013, 2013 IEEE International Conference on Computer Vision.

[35]  Qingmin Liao,et al.  Depth-images-based pose estimation using regression forests and graphical models , 2015, Neurocomputing.

[36]  Marc Pollefeys,et al.  Foreground Consistent Human Pose Estimation Using Branch and Bound , 2014, ECCV.

[37]  Andrew Zisserman,et al.  Progressive search space reduction for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Jianming Zhang,et al.  Attribute-based knowledge transfer learning for human pose estimation , 2013, Neurocomputing.

[39]  Alan L. Yuille,et al.  An Approach to Pose-Based Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Alessio Del Bue,et al.  Human behavior analysis in video surveillance: A Social Signal Processing perspective , 2013, Neurocomputing.

[41]  Katerina Fragkiadaki,et al.  Pose from Flow and Flow from Pose , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Qingmin Liao,et al.  Latent variable pictorial structure for human pose estimation on depth images , 2016, Neurocomputing.

[43]  Peter V. Gehler,et al.  Human Pose Estimation with Fields of Parts , 2014, ECCV.