Learning deformable models for tracking human motion

The analysis and automatic interpretation of images containing moving non-rigid objects, such as walking people, has been the subject of considerable research in the field of computer vision and pattern recognition. In order to build fast and reliable systems some kind of prior model is generally required. A model enables the system to cope with situations where there is considerable background clutter or where information is missing from the image data. This may be due to imaging errors (e.g. blurring due to motion) or due to part of an object becoming hidden from view. Conventional approaches to the problem of tracking non-rigid objects require complex hand-crafted models which are not easily adapted to different problems. A more recent approach uses training information to build models for image analysis. This thesis extends this approach by building flexible 2D models, automatically, from sequences of training images. Efficient methods are described for using the resulting models for real time contour tracking using optimal linear filtering techniques. The method is further extended by incorporating a feedback scheme to generate a more compact linear model which is shown to be more robust and accurate for tracking. Models of the shape of an object do not utilise the temporal information contained within the training sequences. A novel method is described for automatically learning a spatiotemporal, physically-based model that allows the system to accurately predict the expected change in object shape over time. This approach is shown to increase the reliability of the system, requiring only a modest increase in computational processing. The system can be automatically trained on video sequences to learn constraints on the apparent shape and motion of a particular non-rigid object in a particular environment. Results show the system is capable of tracking several walking pedestrians in real time without the use of expensive dedicated hardware. The output from this system has potential uses in the areas of surveillance, animation and gait analysis.

[1]  Christopher J. Taylor,et al.  Automatic Landmark Generation for Point Distribution Models , 1994, BMVC.

[2]  P. G. Ciarlet,et al.  Introduction to Numerical Linear Algebra and Optimisation , 1989 .

[3]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Takeo Kanade,et al.  DigitEyes: vision-based hand tracking for human-computer interaction , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[5]  Christopher J. Taylor,et al.  Model-Based Interpretation of 3D Medical Images , 1993, BMVC.

[6]  Timothy F. Cootes,et al.  Building and using flexible models incorporating grey-level information , 1993, 1993 (4th) International Conference on Computer Vision.

[7]  Geoffrey E. Hinton,et al.  Combining deformable models and neural networks for handprinted digit recognition , 1994 .

[8]  Timothy F. Cootes,et al.  A Probabilistic Fitness Measure for Deformable Template Models , 1994, BMVC.

[9]  Noel A. Murphy,et al.  Long sequence analysis of human motion using eigenvector decomposition , 1993, Other Conferences.

[10]  Charles Kervrann,et al.  Robust tracking of stochastic deformable models in long image sequences , 1994, Proceedings of 1st International Conference on Image Processing.

[11]  Richard Szeliski,et al.  Tracking with Kalman snakes , 1993 .

[12]  David C. Hogg,et al.  Building a Model of a Road Junction Using Moving Vehicle Information , 1992, BMVC.

[13]  Timothy F. Cootes,et al.  Tracking and Recognising Hand Gestures using Statistical Shape Models , 1995, BMVC.

[14]  Frederic Fol Leymarie,et al.  Tracking Deformable Objects in the Plane Using an Active Contour Model , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[16]  Andrew Blake,et al.  Dynamic contours: real-time active splines , 1993 .

[17]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[18]  Timothy F. Cootes,et al.  The Use of Active Shape Models for Locating Structures in Medical Images , 1993, IPMI.

[19]  A.H. Haddad,et al.  Applied optimal estimation , 1976, Proceedings of the IEEE.

[20]  Jitendra Malik,et al.  Robust Multiple Car Tracking with Occlusion Reasoning , 1994, ECCV.

[21]  R. Tennant Algebra , 1941, Nature.

[22]  James D. Foley,et al.  Fundamentals of interactive computer graphics , 1982 .

[23]  Stan Sclaroff,et al.  On modal modeling for medical images: underconstrained shape description and data compression , 1994, Proceedings of IEEE Workshop on Biomedical Image Analysis.

[24]  Alex Pentland,et al.  Closed-form solutions for physically-based shape modeling and recognition , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  I. G. BONNER CLAPPISON Editor , 1960, The Electric Power Engineering Handbook - Five Volume Set.

[26]  Michael Isard,et al.  3D position, attitude and shape input using video tracking of hands and lips , 1994, SIGGRAPH.

[27]  Alex Pentland,et al.  A modal framework for correspondence and description , 1993, 1993 (4th) International Conference on Computer Vision.

[28]  C. Nastar,et al.  Spatio-temporal analysis of nonrigid motion from 4D data , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[29]  T D Albright,et al.  Visual motion perception. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Hsi-Jian Lee,et al.  Knowledge-guided visual perception of 3-D human gait from a single image sequence , 1992, IEEE Trans. Syst. Man Cybern..

[31]  David C. Hogg,et al.  Extending the Point Distribution Model Using Polar Coordinates , 1995, CAIP.

[32]  Timothy F. Cootes,et al.  Training Models of Shape from Sets of Examples , 1992, BMVC.

[33]  Timothy F. Cootes,et al.  Modelling Object Appearance using The Grey-Level Surface , 1994, BMVC.

[34]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[35]  A. Jacobson,et al.  Morphometric tools for landmark data , 1993 .

[36]  E.E. Pissaloux,et al.  Image Processing , 1994, Proceedings. Second Euromicro Workshop on Parallel and Distributed Processing.

[37]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[38]  Yee-Hong Yang,et al.  First Sight: A Human Body Outline Labeling System , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Andrew Blake,et al.  Statistical Background Modelling for Tracking with a Virtual Camera , 1995, BMVC.

[40]  Alex Pentland,et al.  Recovery of non-rigid motion and structure , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[41]  Timothy F. Cootes,et al.  Active Shape Models: Evaluation of a Multi-Resolution Method for Improving Image Search , 1994, BMVC.

[42]  Timothy F. Cootes,et al.  Active Shape Models - 'smart snakes' , 1992, BMVC.

[43]  David C. Hogg Model-based vision: a program to see a walking person , 1983, Image Vis. Comput..

[44]  John C. Platt,et al.  Elastically deformable models , 1987, SIGGRAPH.

[45]  Timothy F. Cootes,et al.  Combining Point Distribution Models with Shape Models Based on Finite Element Analysis , 1994, BMVC.

[46]  J. Gower Generalized procrustes analysis , 1975 .

[47]  Zhengyou Zhang,et al.  Multi-Sensor Multi-Target Tracking - Strategies for Events that become Invisible , 1995, BMVC.

[48]  David C. Hogg,et al.  Learning the Distribution of Object Trajectories for Event Recognition , 1995, BMVC.

[49]  Karl Rohr,et al.  Incremental recognition of pedestrians from image sequences , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[50]  E. Adelson,et al.  Analyzing gait with spatiotemporal surfaces , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[51]  Jake K. Aggarwal,et al.  Segmentation through the detection of changes due to motion , 1979 .

[52]  Michael Isard,et al.  Learning to Track the Visual Motion of Contours , 1995, Artif. Intell..