论文信息 - Coupled Visual and Kinematic Manifold Models for Tracking

Coupled Visual and Kinematic Manifold Models for Tracking

In this paper, we consider modeling data lying on multiple continuous manifolds. In particular, we model the shape manifold of a person performing a motion observed from different viewpoints along a view circle at a fixed camera height. We introduce a model that ties together the body configuration (kinematics) manifold and visual (observations) manifold in a way that facilitates tracking the 3D configuration with continuous relative view variability. The model exploits the low-dimensionality nature of both the body configuration manifold and the view manifold, where each of them are represented separately. The resulting representation is used for tracking complex motions within a Bayesian framework, in which the model provides a low-dimensional state representation as well as a constrained dynamic model for both body configuration and view variations. Experimental results estimating the 3D body posture from a single camera are presented for the HUMANEVA dataset and other complex motion video sequences.

Ahmed M. Elgammal | Chan-Su Lee | A. Elgammal | Chan-Su Lee | Chan-Su Lee

[1] Joshua B. Tenenbaum,et al. Separating Style and Content with Bilinear Models , 2000, Neural Computation.

[2] Vladimir Pavlovic,et al. Impact of Dynamics on Subspace Embedding and Tracking of Sequences , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3] Demetri Terzopoulos,et al. Multilinear Analysis of Image Ensembles: TensorFaces , 2002, ECCV.

[4] David C. Hogg. Model-based vision: a program to see a walking person , 1983, Image Vis. Comput..

[5] Michael J. Black,et al. HumanEva: Synchronized Video and Motion Capture Dataset for Evaluation of Articulated Human Motion , 2006 .

[6] Trevor Darrell,et al. On modelling nonlinear shape-and-texture appearance manifolds , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7] Rui Li,et al. Articulated Pose Estimation in a Learned Smooth Space of Feasible Solutions , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[8] Cristian Sminchisescu,et al. Generative modeling for continuous non-linearly embedded visual inference , 2004, ICML.

[9] Adrian Hilton,et al. A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[10] Stan Sclaroff,et al. 3D hand pose reconstruction using specialized mappings , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[11] Ahmed M. Elgammal,et al. Homeomorphic Manifold Analysis: Learning Decomposable Generative Models for Human Motion Analysis , 2006, WDV.

[12] K. Rohr. Towards model-based recognition of human movements in image sequences , 1994 .

[13] A. Elgammal,et al. Inferring 3D body pose from silhouettes using activity manifold learning , 2004, CVPR 2004.

[14] Rui Li,et al. Simultaneous Learning of Nonlinear Manifold and Dynamical Models for High-dimensional Time Series , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[15] Hiroshi Murase,et al. Learning and recognition of 3D objects from appearance , 1993, [1993] Proceedings IEEE Workshop on Qualitative Vision.

[16] Jake K. Aggarwal,et al. Human Motion Analysis: A Review , 1999, Comput. Vis. Image Underst..

[17] Alex Pentland,et al. Space-time gestures , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[19] A. Elgammal,et al. Separating style and content on a nonlinear manifold , 2004, CVPR 2004.

[20] Aaron F. Bobick,et al. Recognition of human body motion using phase space constraints , 1995, Proceedings of IEEE International Conference on Computer Vision.

[21] Narendra Ahuja,et al. Learning Nonlinear Manifolds from Time Series , 2006, ECCV.

[22] Ahmed M. Elgammal,et al. Nonlinear manifold learning for dynamic shape and dynamic appearance , 2007, Comput. Vis. Image Underst..

[23] Michael J. Black,et al. Parameterized Modeling and Recognition of Activities , 1999, Comput. Vis. Image Underst..

[24] A. Atiya,et al. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[25] Cristian Sminchisescu,et al. Discriminative density propagation for 3D human motion estimation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26] Larry S. Davis,et al. 3-D model-based tracking of humans in action: a multi-view approach , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[27] Joos Vandewalle,et al. A Multilinear Singular Value Decomposition , 2000, SIAM J. Matrix Anal. Appl..

[28] Ahmed M. Elgammal,et al. Tracking People on a Torus , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29] Trevor Darrell,et al. Inferring 3D structure with a statistical image-based shape model , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[30] F. Girosi,et al. Networks for approximation and learning , 1990, Proc. IEEE.

[31] J. Tenenbaum,et al. A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[32] J. O'Rourke,et al. Model-based image analysis of human motion using constraint propagation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Trevor Darrell,et al. Face Recognition from Long-Term Observations , 2002, ECCV.

[34] Ioannis A. Kakadiaris,et al. Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[35] Trevor Darrell,et al. Learning appearance manifolds from video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[36] Dariu Gavrila,et al. The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[37] Matthew Brand,et al. Shadow puppetry , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[38] L. Davis,et al. el-based tracking of humans in action: , 1996 .

[39] Ahmed M. Elgammal,et al. Simultaneous Inference of View and Body Pose using Torus Manifolds , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[40] David J. Fleet,et al. 3D People Tracking with Gaussian Process Dynamical Models , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[41] Trevor Darrell,et al. Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[42] Octavia I. Camps,et al. Modeling Correspondences for Multi-Camera Tracking Using Nonlinear Manifold Learning and Target Dynamics , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[43] J. Magnus,et al. Matrix Differential Calculus with Applications in Statistics and Econometrics , 1991 .

[44] Jitendra Malik,et al. Estimating Human Body Configurations Using Shape Context Matching , 2002, ECCV.

[45] David J. Fleet,et al. Priors for people tracking from small training sets , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[46] David J. Fleet,et al. Gaussian Process Dynamical Models , 2005, NIPS.

[47] Alex Pentland,et al. Recognition of Space-Time Gestures using a Distributed Representation , 1993 .

[48] Takeo Kanade,et al. Model-based tracking of self-occluding articulated objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[49] David J. Fleet,et al. Stochastic Tracking of 3D Human Figures Using 2D Image Motion , 2000, ECCV.

[50] Bernhard Schölkopf,et al. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[51] Ankur Agarwal,et al. 3D human pose from silhouettes by relevance vector regression , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[52] M. Alex O. Vasilescu. Human motion signatures: analysis, synthesis, recognition , 2002, Object recognition supported by user interaction for service robots.