Nonparametric Recognition of Nonrigid Motion

The recognition of nonrigid motion, particularly that arising from human movement (and by extension from the locomotory activity of animals) has typically made use of high-level parametric models representing the various body parts (legs, arms, trunk, head, etc.) and their connections to each other. Such model-based recognition has been successful in some cases; however, the methods are often difficult to apply to real-world scenes, and are severely limited in their generalizability. The first problem arises from the difficulty of acquiring and tracking the requisite model parts, usually specific joints such as knees, elbows or ankles. This generally requires some prior high-level understanding and segmentation of the scene, or initialization by a human operator. The second problem is due to the fact that the human model is not much good for dogs or birds; for each new type of motion, a new model must be hand-crafted. In this paper, we show that the recognition of human or animal locomotion, and, in fact, any repetitive activity, can be done using low-level, non-parametric representations. Such an approach has the advantage that the same underlying representation is used for all examples, and no individual tailoring of models or prior scene understanding is required. We show in particular that repetitive motion is such a strong cue that the moving actor can be segmented, normalized spatially and temporally, and recognized by matching against a spatio-temporal template of motion features. We have implemented a real-time system that can recognize and classify repetitive motion activities in normal gray-scale image sequences. Results on a number of real-world sequences are described.

[1]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[2]  John J. Hopfield,et al.  CONCENTRATION INFORMATION IN TIME: ANALOG NEURAL NETWORKS WITH APPLICATIONS TO SPEECH RECOGNITION PROBLEMS. , 1987 .

[3]  J. Ewert Neuroethology of releasing mechanisms: Prey-catching in toads , 1987, Behavioral and Brain Sciences.

[4]  Norman I. Badler,et al.  Temporal scene analysis: conceptual descriptions of object movements. , 1975 .

[5]  William Rowan,et al.  The Study of Instinct , 1953 .

[6]  Hans-Hellmut Nagel,et al.  Algorithmic characterization of vehicle trajectories from image sequences by motion verbs , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  C. Koch,et al.  The analysis of visual motion: from computational theory to neuronal mechanisms. , 1986, Annual review of neuroscience.

[8]  Charles R. Dyer,et al.  Cyclic motion detection using spatiotemporal surfaces and curves , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[9]  E. Petajan,et al.  An improved automatic lipreading system to enhance speech recognition , 1988, CHI '88.

[10]  Thomas S. Huang,et al.  Correction to "Estimating 3-D motion parameters of a rigid planar patch,II: Singular value decomposition" , 1983 .

[11]  J. Cutting Six tenets for event perception , 1981, Cognition.

[12]  Jerome A. Feldman,et al.  Time, Space and Form in Vision , 1988 .

[13]  J. O'Rourke,et al.  Model-based image analysis of human motion using constraint propagation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  MotionSteven M. Seitzseitz,et al.  A ne Invariant Detection of Periodic Motion , 1994 .

[15]  Steven M. Seitz,et al.  Affine invariant detection of periodic motion , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Jocelyn Crane,et al.  Imaginal behavior of a Trinidad butterfly, Heliconius erato hydara Hewitson, with special reference to the social use of color , 1955, Zoologica : scientific contributions of the New York Zoological Society..

[17]  Randal C. Nelson,et al.  Recognition of motion from temporal texture , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[19]  Randal C. Nelson,et al.  Detecting activities , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Biing-Hwang Juang,et al.  Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[21]  Allen A. Montgomery,et al.  Automatic optically-based recognition of speech , 1988, Pattern Recognit. Lett..

[22]  Mubarak Shah,et al.  The trajectory primal sketch: a multi-scale scheme for representing motion characteristics , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  R. H. Smythe M.R.C.V.S. Vision in the Animal World , 1975, Palgrave Macmillan UK.

[24]  P. J. Burt,et al.  Change Detection and Tracking Using Pyramid Transform Techniques , 1985, Other Conferences.

[25]  E. Wolf,et al.  FLICKER AND THE REACTIONS OF BEES TO FLOWERS , 1937, The Journal of general physiology.

[26]  Teuvo Kohonen,et al.  Improved versions of learning vector quantization , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[27]  Dana H. Ballard,et al.  Self-Teaching Through Correlated Input , 1993 .

[28]  Andy Hon Wai Chun,et al.  A Representation for Temporal Sequence and Duration in Massively Parallel Networks , 1986, AAAI.