论文信息 - Reconstructing 3 D Human Pose from 2 D Image Landmarks

Reconstructing 3 D Human Pose from 2 D Image Landmarks

Reconstructing an arbitrary configuration of 3D points from their projection in an image is an ill-posed problem. When the points hold semantic meaning, such as anatomical landmarks on a body, human observers can often infer a plausible 3D configuration, drawing on extensive visual memory. We present an activity-independent method to recover the 3D configuration of a human figure from 2D locations of anatomical landmarks in a single image, leveraging a large motion capture corpus as a proxy for visual memory. Our method solves for anthropometrically regular body pose and explicitly estimates the camera via a matching pursuit algorithm operating on the image projections. Anthropometric regularity (i.e., that limbs obey known proportions) is a highly informative prior, but directly applying such constraints is intractable. Instead, we enforce a necessary condition on the sum of squared limblengths that can be solved for in closed form to discourage implausible configurations in 3D. We evaluate performance on a wide variety of human poses captured from di↵erent viewpoints and show generalization to novel 3D configurations and robustness to missing data.

T. Kanade | Yaser Sheikh | V. Ramakrishna

[1] P. Schönemann,et al. A generalized solution of the orthogonal procrustes problem , 1966 .

[2] W. Gander. Least squares with a quadratic constraint , 1980 .

[3] Hsi-Jian Lee,et al. Determination of 3D human body postures from a single view , 1985, Comput. Vis. Graph. Image Process..

[4] Y. C. Pati,et al. Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[5] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[6] Rómer Rosales,et al. Specialized mappings and the estimation of human body pose from a single image , 2000, Proceedings Workshop on Human Motion.

[7] Camillo J. Taylor,et al. Reconstruction of Articulated Objects from Point Correspondences in a Single Uncalibrated Image , 2000, Comput. Vis. Image Underst..

[8] Ioannis A. Kakadiaris,et al. Estimating Anthropometry and Pose from a Single Uncalibrated Image , 2001, Comput. Vis. Image Underst..

[9] Timothy F. Cootes,et al. Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[10] Trevor Darrell,et al. Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[11] Takeo Kanade,et al. Real-time combined 2D+3D active appearance models , 2004, CVPR 2004.

[12] Ankur Agarwal,et al. 3D human pose from silhouettes by relevance vector regression , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[13] Joel A. Tropp,et al. Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[14] R. Chellappa,et al. View independent human body pose estimation from a single perspective image , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[15] Simon Baker,et al. Active Appearance Models Revisited , 2004, International Journal of Computer Vision.

[16] A. Elgammal,et al. Inferring 3D body pose from silhouettes using activity manifold learning , 2004, CVPR 2004.

[17] Jessica K. Hodgins,et al. Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces , 2004, SIGGRAPH 2004.

[18] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[19] Jitendra Malik,et al. Recovering 3D human body configurations using shape contexts , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] P. Downing,et al. The neural basis of visual body perception , 2007, Nature Reviews Neuroscience.

[21] Joel A. Tropp,et al. Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[22] Jinxiang Chai,et al. Modeling 3D human poses from uncalibrated monocular images , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[23] Pascal Fua,et al. Reconstructing sharply folding surfaces: A convex formulation , 2009, CVPR.

[24] Raquel Urtasun,et al. Implicitly Constrained Gaussian Process Regression for Monocular Non-Rigid Pose Estimation , 2010, NIPS.

[25] Simon Lucey,et al. Deterministic 3D Human Pose Estimation Using Rigid Structure , 2010, ECCV.

[26] Francesc Moreno-Noguer,et al. Exploring Ambiguities for Monocular Non-rigid Shape Estimation , 2010, ECCV.

[27] Hao Jiang. 3D Human Pose Reconstruction Using Millions of Exemplars , 2010, 2010 20th International Conference on Pattern Recognition.