Detection and Recognition of Periodic, Nonrigid Motion

The recognition of nonrigid motion, particularly that arising from human movement (and by extension from the locomotory activity of animals) has typically made use of high-level parametric models representing the various body parts (legs, arms, trunk, head etc.) and their connections to each other. Such model-based recognition has been successful in some cases; however, the methods are often difficult to apply to real-world scenes, and are severely limited in their generalizability. The first problem arises from the difficulty of acquiring and tracking the requisite model parts, usually specific joints such as knees, elbows or ankles. This generally requires some prior high-level understanding and segmentation of the scene, or initialization by a human operator. The second problem, with generalization, is due to the fact that the human model is not much good for dogs or birds, and for each new type of motion, a new model must be hand-crafted. In this paper, we show that the recognition of human or animal locomotion, and, in fact, any repetitive activity can be done using low-level, non-parametric representations. Such an approach has the advantage that the same underlying representation is used for all examples, and no individual tailoring of models or prior scene understanding is required. We show in particular, that repetitive motion is such a strong cue, that the moving actor can be segmented, normalized spatially and temporally, and recognized by matching against a spatio-temporal template of motion features. We have implemented a real-time system that can recognize and classify repetitive motion activities in normal gray-scale image sequences. Results on a number of real-world sequences are described.

[1]  Charles R. Dyer,et al.  Cyclic motion detection using spatiotemporal surfaces and curves , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[2]  Virginia R. de Sa,et al.  Unsupervised Classification Learning from Cross-Modal Environmental Structure , 1994 .

[3]  Dana H. Ballard,et al.  Self-Teaching Through Correlated Input , 1993 .

[4]  Andy Hon Wai Chun,et al.  A Representation for Temporal Sequence and Duration in Massively Parallel Networks , 1986, AAAI.

[5]  William Rowan,et al.  The Study of Instinct , 1953 .

[6]  J. Ewert Neuroethology of releasing mechanisms: Prey-catching in toads , 1987, Behavioral and Brain Sciences.

[7]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[8]  John J. Hopfield,et al.  CONCENTRATION INFORMATION IN TIME: ANALOG NEURAL NETWORKS WITH APPLICATIONS TO SPEECH RECOGNITION PROBLEMS. , 1987 .

[9]  C. Koch,et al.  The analysis of visual motion: from computational theory to neuronal mechanisms. , 1986, Annual review of neuroscience.

[10]  Mubarak Shah,et al.  The trajectory primal sketch: a multi-scale scheme for representing motion characteristics , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  R. H. Smythe M.R.C.V.S. Vision in the Animal World , 1975, Palgrave Macmillan UK.

[12]  R. F. Rashid,et al.  Towards a system for the interpretation of moving light displays , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jerome A. Feldman,et al.  Time, Space and Form in Vision , 1988 .

[14]  Hans-Hellmut Nagel,et al.  Algorithmic characterization of vehicle trajectories from image sequences by motion verbs , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Thomas S. Huang,et al.  Correction to "Estimating 3-D motion parameters of a rigid planar patch,II: Singular value decomposition" , 1983 .

[16]  P. J. Burt,et al.  Change Detection and Tracking Using Pyramid Transform Techniques , 1985, Other Conferences.

[17]  E. Wolf,et al.  FLICKER AND THE REACTIONS OF BEES TO FLOWERS , 1937, The Journal of general physiology.

[18]  Teuvo Kohonen,et al.  Improved versions of learning vector quantization , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[19]  Biing-Hwang Juang,et al.  Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[20]  Allen A. Montgomery,et al.  Automatic optically-based recognition of speech , 1988, Pattern Recognit. Lett..

[21]  Randal C. Nelson,et al.  Detecting activities , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[22]  R. Smythe,et al.  Vision in the animal world , 1975 .

[23]  J. O'Rourke,et al.  Model-based image analysis of human motion using constraint propagation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  H. Evans The Study of Instinct , 1952 .

[25]  MotionSteven M. Seitzseitz,et al.  A ne Invariant Detection of Periodic Motion , 1994 .

[26]  Norman I. Badler,et al.  Temporal scene analysis: conceptual descriptions of object movements. , 1975 .

[27]  Randal C. Nelson,et al.  Recognition of motion from temporal texture , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[29]  J. Cutting Six tenets for event perception , 1981, Cognition.

[30]  R. Oka,et al.  Recognition of dexterous manipulations from time-varying images , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[31]  E. Petajan,et al.  An improved automatic lipreading system to enhance speech recognition , 1988, CHI '88.

[32]  R. Oka,et al.  Spotting Recognition of Human Gestures from Motion Images , 1994 .

[33]  D. D. Hoffman,et al.  The interpretation of biological motion , 1982, Biological Cybernetics.

[34]  Steven M. Seitz,et al.  Affine invariant detection of periodic motion , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Randal C. Nelson Qualitative detection of motion by a moving observer , 2004, International Journal of Computer Vision.