Synthesis and recognition of biological motion patterns based on linear superposition of prototypical motion sequences

The linear combination of prototypical views has been shown to provide a powerful method for the recognition and analysis of images of three-dimensional stationary objects. We present preliminary results on an extension of this idea to video sequences. For this extension, the computation of correspondences in space-time turns out to be the central theoretical problem, which we solve with a new correspondence algorithm. Using simulated images of biological motion we demonstrate the usefulness of the superposition of prototypical sequences for the synthesis of new video sequences, and for the analysis and recognition of actions. Our method permits to impose a topology over the space of video sequences of action patterns. This topology is more complicated than a linear space. We present a new method that is based on the structural risk minimization principle of statistical learning theory, which permits to exploit this knowledge about the topology of the pattern space for recognition.

[1]  J. O'Rourke,et al.  Model-based image analysis of human motion using constraint propagation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[3]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[4]  Edward H. Adelson,et al.  Analyzing and recognizing walking figures in XYT , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Alex Pentland,et al.  Task-Specific Gesture Analysis in Real-Time Using Interpolated Views , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Tomaso Poggio,et al.  Image Representations for Visual Learning , 1996, Science.

[7]  James W. Davis,et al.  The Representation and Recognition of Action Using Temporal Templates , 1997, CVPR 1997.

[8]  Tomaso A. Poggio,et al.  Linear Object Classes and Image Synthesis From a Single Example Image , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Alex Pentland,et al.  Coding, Analysis, Interpretation, and Recognition of Facial Expressions , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[11]  Tony Ezzat,et al.  MikeTalk: a talking facial display based on morphing visemes , 1998, Proceedings Computer Animation '98 (Cat. No.98EX169).

[12]  Michael J. Black,et al.  Parameterized Modeling and Recognition of Activities , 1999, Comput. Vis. Image Underst..