论文信息 - Estimating 3D body pose using uncalibrated cameras

Estimating 3D body pose using uncalibrated cameras

An approach for estimating 3D body pose from multiple, uncalibrated views is proposed. First, a mapping from image features to 2D body joint locations is computed using a statistical framework that yields a set of several body pose hypotheses. The concept of a "virtual camera" is introduced that makes this mapping invariant to translation, image-plane rotation, and scaling of the input. As a consequence, the calibration matrices (intrinsics) of the virtual cameras can be considered completely known, and their poses are known up to a single angular displacement parameter Given pose hypotheses obtained in the multiple virtual camera views, the recovery of 3D body pose and camera relative orientations is formulated as a stochastic optimization problem. An Expectation-Maximization algorithm is derived that can obtain the locally most likely (self-consistent) combination of body pose hypotheses. Performance of the approach is evaluated with synthetic sequences as well as real video sequences of human motion.

[1] Hans-Hellmut Nagel,et al. Tracking Persons in Monocular Image Sequences , 1999, Comput. Vis. Image Underst..

[2] Camillo J. Taylor,et al. Reconstruction of Articulated Objects from Point Correspondences in a Single Uncalibrated Image , 2000, Comput. Vis. Image Underst..

[3] K. Rohr. Towards model-based recognition of human movements in image sequences , 1994 .

[4] H. Hartley. Maximum Likelihood Estimation from Incomplete Data , 1958 .

[5] Alex Pentland,et al. Recovery of non-rigid motion and structure , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6] Rómer Rosales,et al. Specialized mappings and the estimation of human body pose from a single image , 2000, Proceedings Workshop on Human Motion.

[7] Andrew Blake,et al. Articulated body motion capture by annealed particle filtering , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[8] Ioannis A. Kakadiaris,et al. Estimating anthropometry and pose from a single image , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[9] Ming-Kuei Hu,et al. Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[10] Rómer Rosales,et al. Inferring body pose without tracking body parts , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[11] Yoshiaki Shirai,et al. Three-Dimensional Computer Vision , 1987, Symbolic Computation.

[12] Matthew Brand,et al. Shadow puppetry , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[13] Jitendra Malik,et al. Tracking people with twists and exponential maps , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[14] Larry S. Davis,et al. 3-D model-based tracking of humans in action: a multi-view approach , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15] David J. Fleet,et al. Stochastic Tracking of 3D Human Figures Using 2D Image Motion , 2000, ECCV.

[16] Hsi-Jian Lee,et al. Determination of 3D human body postures from a single view , 1985, Comput. Vis. Graph. Image Process..

[17] Takeo Kanade,et al. A unified factorization algorithm for points, line segments and planes with uncertainty models , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[18] Brian Sallans,et al. A Hierarchical Community of Experts , 1999, Learning in Graphical Models.

[19] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[20] Camillo J. Taylor,et al. Reconstruction of articulated objects from point correspondences in a single uncalibrated image , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[21] J. O'Rourke,et al. Model-based image analysis of human motion using constraint propagation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] William T. Freeman,et al. Bayesian Reconstruction of 3D Human Motion from Single-Camera Video , 1999, NIPS.

[23] Stan Sclaroff,et al. 3D hand pose reconstruction using specialized mappings , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[24] Alex Pentland,et al. Recovery of Nonrigid Motion and Structure , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[25] Takeo Kanade,et al. Model-based tracking of self-occluding articulated objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[26] Yang Song,et al. Towards detection of human motion , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[27] Geoffrey E. Hinton,et al. A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[28] R. Okafor. Maximum likelihood estimation from incomplete data , 1987 .

[29] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[30] J. Freidman,et al. Multivariate adaptive regression splines , 1991 .

[31] Frank Dellaert,et al. Structure from motion without correspondence , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).