Binocular Full-Body Pose Recognition and Orientation Inference Using Multilinear Analysis

In this chapter, we propose an approach to full-body pose recognition and body orientation estimation using multilinear analysis. We extract low-dimensional pose and body orientation coefficient vectors by performing tensor decomposition and projection on silhouette images obtained from wide baseline binocular cameras. The coefficient vectors are then used as feature vectors in pose recognition and body orientation estimation. To do pose recognition, pose coefficient vectors obtained from synthesized pose silhouettes are used to train a family of support vector machines as pose classifiers. Using orientation coefficient vectors, a 1-D orientation manifold is learned and further used for the estimation of body orientation. Experiment results obtained using both synthetic and real image data showed that the performance of our approach is comparable to existing pose recognition approaches, and that our approach outperformed the traditional tensor-based recognition approach in the comparative test.

[1]  Ioannis A. Kakadiaris,et al.  Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Odest Chadwicke Jenkins,et al.  Tracking human motion and actions for interactive robots , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[3]  Rómer Rosales,et al.  Learning Body Pose via Specialized Maps , 2001, NIPS.

[4]  Mohan M. Trivedi,et al.  Human Body Model Acquisition and Tracking Using Voxel Data , 2003, International Journal of Computer Vision.

[5]  Jin Kjölberg,et al.  Designing full body movement interaction using modern dance as a starting point , 2004, DIS '04.

[6]  Demetri Terzopoulos,et al.  Multilinear Analysis of Image Ensembles: TensorFaces , 2002, ECCV.

[7]  Takeo Kanade,et al.  Shape-from-silhouette of articulated objects and its use for human body kinematics estimation and motion capture , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[8]  Kee Chang Lee,et al.  Virtual Stage: A Location-Based Karaoke System , 1998, IEEE Multim..

[9]  Ying Wu,et al.  Vision-Based Gesture Recognition: A Review , 1999, Gesture Workshop.

[10]  Inderjit S. Dhillon,et al.  Clustering on the Unit Hypersphere using von Mises-Fisher Distributions , 2005, J. Mach. Learn. Res..

[11]  Isaac Cohen,et al.  Posture and Gesture Recognition using 3D Body Shapes Decomposition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[12]  S. Mitra,et al.  Gesture Recognition: A Survey , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[13]  Mun Wai Lee,et al.  Integrating component cues for human pose tracking , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[14]  Rui Li,et al.  Monocular Tracking of 3D Human Motion with a Coordinated Mixture of Factor Analyzers , 2006, ECCV.

[15]  Gang Qian,et al.  Dance posture recognition using wide-baseline orthogonal stereo cameras , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[16]  M. Alex O. Vasilescu,et al.  TensorTextures: multilinear image-based rendering , 2004, SIGGRAPH 2004.

[17]  Jwu-Sheng Hu,et al.  3-D Human Posture Recognition System Using 2-D Shape Features , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[18]  Hiroshi Mizoguchi,et al.  Human pose recognition by memory-based hierarchical feature matching , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[19]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[20]  Roberto Cipolla,et al.  Hierarchical Part-Based Human Body Pose Estimation , 2005, BMVC.

[21]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[22]  François Brémond,et al.  Applying 3D human model in a posture recognition system , 2006, Pattern Recognit. Lett..

[23]  Loren Olson,et al.  A gesture-driven multimodal interactive dance system , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[24]  Loren Olson,et al.  Movement-based interactive dance performance , 2006, MM '06.

[25]  H. Kiers An alternating least squares algorithms for PARAFAC2 and three-way DEDICOM , 1993 .

[26]  Surendra Ranganath,et al.  Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  David J. Fleet,et al.  3D People Tracking with Gaussian Process Dynamical Models , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[28]  Nicholas R. Howe,et al.  Silhouette lookup for monocular 3D pose tracking , 2007, Image Vis. Comput..

[29]  Yung-Yaw Chen,et al.  Human Posture Recognition by Simple Rules , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[30]  A. Elgammal,et al.  Inferring 3D body pose from silhouettes using activity manifold learning , 2004, CVPR 2004.

[31]  Hanspeter Pfister,et al.  Face transfer with multilinear models , 2005, SIGGRAPH 2005.

[32]  Ahmed M. Elgammal,et al.  Modeling View and Posture Manifolds for Tracking , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[33]  Shuji Hashimoto,et al.  EyesWeb: Toward Gesture and Affect Recognition in Interactive Dance and Music Systems , 2000, Computer Music Journal.

[34]  Seong-Whan Lee Automatic gesture recognition for intelligent human-robot interaction , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[35]  Yun Yuan,et al.  Posture and Activity Recognition Using Projection Histogram and PCA Methods , 2008, 2008 Congress on Image and Signal Processing.

[36]  Thomas B. Moeslund,et al.  A Survey of Computer Vision-Based Human Motion Capture , 2001, Comput. Vis. Image Underst..

[37]  Lars Elden,et al.  Matrix methods in data mining and pattern recognition , 2007, Fundamentals of algorithms.

[38]  Tieniu Tan,et al.  Recent developments in human motion analysis , 2003, Pattern Recognit..

[39]  Feiyue Huang,et al.  Viewpoint Insensitive Posture Representation for Action Recognition , 2006, AMDO.

[40]  Joos Vandewalle,et al.  A Multilinear Singular Value Decomposition , 2000, SIAM J. Matrix Anal. Appl..