Statistical models of visual shape and motion

This paper1 addresses some problems in the interpretation of visually observed shapes, both planar and three-dimensional, in motion. Mumford (1996), interpreting the Pattern Theory developed over a number of years by Grenander (1976), views images as pure patterns that have been distorted by a combination of four kinds of degradations. This view applies naturally to the analysis of static, two-dimensional images. The four degradations are given here, together with comments on how they need to be extended to take account of three-dimensional objects in motion.

[1]  R. Fisher The Advanced Theory of Statistics , 1943, Nature.

[2]  Maurice G. Kendall,et al.  The Advanced Theory of Statistics, Vol. 2: Inference and Relationship , 1979 .

[3]  N. L. Johnson,et al.  Linear Statistical Inference and Its Applications , 1966 .

[4]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[5]  Calyampudi R. Rao,et al.  Linear Statistical Inference and Its Applications. , 1975 .

[6]  A.H. Haddad,et al.  Applied optimal estimation , 1976, Proceedings of the IEEE.

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[9]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Graham C. Goodwin,et al.  Adaptive filtering prediction and control , 1984 .

[11]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[12]  A. Zinober Matrices: Methods and Applications , 1992 .

[13]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[14]  A. L. Sutherland,et al.  Finding spiral structures in images of galaxies , 1990, Philosophical Transactions of the Royal Society of London. Series A: Physical and Engineering Sciences.

[15]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[16]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[18]  A. B. Rami Shani,et al.  Matrices: Methods and Applications , 1992 .

[19]  C Tomasi,et al.  Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[20]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[21]  Kanti V. Mardia,et al.  Techniques for online gesture recognition on workstations , 1993, Image Vis. Comput..

[22]  Timothy F. Cootes,et al.  Building and using flexible models incorporating grey-level information , 1993, 1993 (4th) International Conference on Computer Vision.

[23]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[24]  Michael I. Miller,et al.  REPRESENTATIONS OF KNOWLEDGE IN COMPLEX SYSTEMS , 1994 .

[25]  Geir Storvik,et al.  A Bayesian Approach to Dynamic Contours Through Stochastic Sampling and Simulated Annealing , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Michael Isard,et al.  3D position, attitude and shape input using video tracking of hands and lips , 1994, SIGGRAPH.

[27]  David C. Hogg,et al.  Learning Flexible Models from Image Sequences , 1994, ECCV.

[28]  Anup Basu,et al.  Motion Tracking with an Active Camera , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Jitendra Malik,et al.  Robust Multiple Car Tracking with Occlusion Reasoning , 1994, ECCV.

[30]  Aaron F. Bobick,et al.  Recognition of human body motion using phase space constraints , 1995, Proceedings of IEEE International Conference on Computer Vision.

[31]  David Beymer,et al.  Face recognition from one example view , 1995, Proceedings of IEEE International Conference on Computer Vision.

[32]  Aaron F. Bobick,et al.  A state-based technique for the summarization and recognition of gesture , 1995, Proceedings of IEEE International Conference on Computer Vision.

[33]  Timothy F. Cootes,et al.  A unified approach to coding and interpreting face images , 1995, Proceedings of IEEE International Conference on Computer Vision.

[34]  David C. Hogg,et al.  An Adaptive Eigenshape Model , 1995, BMVC.

[35]  Michael Isard,et al.  Learning to Track the Visual Motion of Contours , 1995, Artif. Intell..

[36]  G. Kitagawa Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[37]  D. Mumford Perception as Bayesian Inference: Pattern theory: A unifying perspective , 1996 .

[38]  Andrew Blake,et al.  Statistical mosaics for tracking , 1996, Image Vis. Comput..

[39]  David C. Knill,et al.  Introduction: a Bayesian formulation of visual perception , 1996 .

[40]  Tomaso A. Poggio,et al.  Image Synthesis from a Single Example Image , 1996, ECCV.

[41]  David C. Hogg,et al.  Generating Spatiotemporal Models from Examples , 1995, BMVC.

[42]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[43]  D. Mumford Pattern theory: a unifying perspective , 1996 .

[44]  Andrew Blake,et al.  Learning Dynamics of Complex Motions from Image Sequences , 1996, ECCV.

[45]  Anthony G. Cohn,et al.  Generation of Semantic Regions from Image Sequences , 1996, ECCV.

[46]  Joshua B. Tenenbaum,et al.  Learning bilinear models for two-factor problems in vision , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  Andrew Blake,et al.  Separability of pose and expression in facial tracking and animation , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[48]  Michael Isard,et al.  A mixed-state condensation tracker with automatic model-switching , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[49]  Andrew Blake,et al.  A probabilistic contour discriminant for object localisation , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[50]  Michael Isard,et al.  Active Contours , 2000, Springer London.

[51]  Andrew Blake,et al.  Using expectation-maximisation to learn dynamical models from visual data , 1999, Image Vis. Comput..