Discerning nonrigid 3D shapes from motion cues

Many organisms and objects deform nonrigidly when moving, requiring perceivers to separate shape changes from object motions. Surprisingly, the abilities of observers to correctly infer nonrigid volumetric shapes from motion cues have not been measured, and structure from motion models predominantly use variants of rigidity assumptions. We show that observers are equally sensitive at discriminating cross-sections of flexing and rigid cylinders based on motion cues, when the cylinders are rotated simultaneously around the vertical and depth axes. A computational model based on motion perspective (i.e., assuming perceived depth is inversely proportional to local velocity) predicted the psychometric curves better than shape from motion factorization models using shape or trajectory basis functions. Asymmetric percepts of symmetric cylinders, arising because of asymmetric velocity profiles, provided additional evidence for the dominant role of relative velocity in shape perception. Finally, we show that inexperienced observers are generally incapable of using motion cues to detect inflation/deflation of rigid and flexing cylinders, but this handicap can be overcome with practice for both nonrigid and rigid shapes. The empirical and computational results of this study argue against the use of rigidity assumptions in extracting 3D shape from motion and for the primacy of motion deformations computed from motion shears.

[1]  D. Proffitt,et al.  Depth perception in motion parallax and stereokinesis. , 1993, Journal of experimental psychology. Human perception and performance.

[2]  J. Allman,et al.  Stimulus specific responses from beyond the classical receptive field: neurophysiological mechanisms for local-global comparisons in visual neurons. , 1985, Annual review of neuroscience.

[3]  D D Hoffman,et al.  Inferring Structure from Motion in Two-View and Multiview Displays , 1993, Perception.

[4]  R A Andersen,et al.  The response of area MT and V1 neurons to transparent motion , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[5]  Ellen C. Hildreth,et al.  Measurement of Visual Motion , 1984 .

[6]  H. C. Longuet-Higgins,et al.  The interpretation of a moving retinal image , 1980, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[7]  R. Andersen,et al.  Response of MSTd neurons to simulated 3D orientation of rotating planes. , 2002, Journal of neurophysiology.

[8]  Tim S. Meese,et al.  Speed gradients and the perception of surface slant: Analysis is two-dimensional not one-dimensional , 1995, Vision Research.

[9]  S. Ullman The interpretation of structure from motion , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[10]  K. Nakayama,et al.  Single visual neurons code opposing motion independent of direction. , 1983, Science.

[11]  Gregory J Zelinsky,et al.  The role of "rescue saccades" in tracking objects through occlusions. , 2010, Journal of vision.

[12]  F. Domini,et al.  3-D structure perceived from dynamic information: a new theory , 2003, Trends in Cognitive Sciences.

[13]  J J Koenderink,et al.  Depth and shape from differential perspective in the presence of bending deformations. , 1986, Journal of the Optical Society of America. A, Optics and image science.

[14]  J T Todd,et al.  The perception of 3-dimensional affine structure from minimal apparent motion sequences , 1990, Perception & psychophysics.

[15]  G Sperling,et al.  Kinetic depth effect and identification of shape. , 1989, Journal of experimental psychology. Human perception and performance.

[16]  M Nawrot,et al.  A neural network model of kinetic depth , 1991, Visual Neuroscience.

[17]  Keiji Tanaka,et al.  Integration of direction signals of image motion in the superior temporal sulcus of the macaque monkey , 1986, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[18]  K. Nakayama,et al.  A Velocity Analogue of Brightness Contrast , 1973, Perception.

[19]  Hermann von Helmholtz,et al.  Treatise on Physiological Optics , 1962 .

[20]  F. Domini,et al.  Recovery of 3-D structure from motion is neither euclidean nor affine , 1998 .

[21]  A. Watson,et al.  Quest: A Bayesian adaptive psychometric method , 1983, Perception & psychophysics.

[22]  J. Gibson,et al.  Parallax and perspective during aircraft landings. , 1955, The American journal of psychology.

[23]  J. Marsh,et al.  Anisotropies in the Perception of Three-Dimensional Surfaces , .

[24]  G Sperling,et al.  How to study the kinetic depth effect experimentally. , 1990, Journal of experimental psychology. Human perception and performance.

[25]  C Caudek,et al.  Perceiving surface slant from deformation of optic flow. , 1999, Journal of experimental psychology. Human perception and performance.

[26]  Bart Farell,et al.  A new theory of structure-from-motion perception. , 2009, Journal of vision.

[27]  T. Poggio,et al.  Cognitive neuroscience: Neural mechanisms for the recognition of biological movements , 2003, Nature Reviews Neuroscience.

[28]  B. Rogers,et al.  Simultaneous and Successive Contrast Effects in the Perception of Depth from Motion-Parallax and Stereoscopic Information , 1982, Perception.

[29]  Roberto Cipolla,et al.  Robust structure from motion using motion parallax , 1993, 1993 (4th) International Conference on Computer Vision.

[30]  A. Fitzgibbon,et al.  Humans Ignore Motion and Stereo Cues in Favor of a Fictional Stable World , 2006, Current Biology.

[31]  A. Yuille,et al.  The perceived motion of a stereokinetic stimulus , 2006, Vision Research.

[32]  Norberto M. Grzywacz,et al.  A computational theory for the perception of coherent visual motion , 1988, Nature.

[33]  Takeo Watanabe,et al.  3D surface perception from motion involves a temporal–parietal network , 2009, The European journal of neuroscience.

[34]  K. Tanaka,et al.  Analysis of motion of the visual field by direction, expansion/contraction, and rotation cells clustered in the dorsal part of the medial superior temporal area of the macaque monkey. , 1989, Journal of neurophysiology.

[35]  F. Scharnowski,et al.  Long-lasting modulation of feature integration by transcranial magnetic stimulation. , 2009, Journal of vision.

[36]  J. Aloimonos,et al.  On the kinetic depth effect , 1989, Biological Cybernetics.

[37]  Jan J. Koenderink,et al.  Local structure of movement parallax of the plane , 1976 .

[38]  M. Hogervorst,et al.  The role of perspective effects and accelerations in perceived three-dimensional structure-from-motion. , 2000, Journal of experimental psychology. Human perception and performance.

[39]  Brendon O. Watson,et al.  Computing relief structure from motion with a distributed velocity and disparity representation , 2002, Vision Research.

[40]  A. T. Smith,et al.  On the sensitivity of complex cells in feline striate cortex to relative motion , 2004, Experimental Brain Research.

[41]  Q. Zaidi,et al.  The utility of shape attributes in deciphering movements of non-rigid objects. , 2010, Journal of vision.

[42]  G. Orban,et al.  Responses of macaque STS neurons to optic flow components: a comparison of areas MT and MST. , 1994, Journal of neurophysiology.

[43]  G. Mather,et al.  Gender discrimination in biological motion displays based on dynamic cues , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[44]  Matthew Brand,et al.  A direct method for 3D factorization of nonrigid motion observed in 2D , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[45]  J J Koenderink,et al.  Structure from motion: A tolerance analysis , 1996, Perception & psychophysics.

[46]  Tim S. Meese,et al.  On the relationship between deformation and perceived surface slant , 1996, Vision Research.

[47]  J T Todd,et al.  The perceptual analysis of structure from motion for rotating objects undergoing affine stretching transformations , 1993, Perception & psychophysics.

[48]  Alexander Grunewald,et al.  Neural Correlates of Structure-from-Motion Perception in Macaque V1 and MT , 2002, The Journal of Neuroscience.

[49]  Takeo Kanade,et al.  Nonrigid Structure from Motion in Trajectory Space , 2008, NIPS.

[50]  Andrea J. van Doorn,et al.  Invariant Properties of the Motion Parallax Field due to the Movement of Rigid Bodies Relative to an Observer , 1975 .

[51]  T. Meese,et al.  Computation of surface slant from optic flow: Orthogonal components of speed gradient can be combined , 1997, Vision Research.

[52]  R. M. Siegel,et al.  Three-dimensional structure-from-motion selectivity in the anterior superior temporal polysensory area, STPa, of the behaving monkey. , 2005, Cerebral cortex.

[53]  B Rogers,et al.  Motion Parallax as an Independent Cue for Depth Perception , 1979, Perception.

[54]  C Caudek,et al.  Distortions of depth-order relations and parallelism in structure from motion , 1998, Perception & psychophysics.

[55]  R. Andersen,et al.  Encoding of three-dimensional structure-from-motion by primate area MT neurons , 1998, Nature.

[56]  S. Lea,et al.  Perception of Emotion from Dynamic Point-Light Displays Represented in Dance , 1996, Perception.

[57]  Guy Marchal,et al.  Human Cortical Regions Involved in Extracting Depth from Motion , 1999, Neuron.

[58]  Aaron Hertzmann,et al.  Nonrigid Structure-from-Motion: Estimating Shape and Motion with Hierarchical Priors , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[60]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[61]  Ralph M. Siegel,et al.  Optic Flow Selectivity in the Anterior Superior Temporal Polysensory Area, STPa, of the Behaving Monkey , 1999, The Journal of Neuroscience.

[62]  Tomaso Poggio,et al.  Role of learning in three-dimensional form perception , 1996, Nature.

[63]  Donald D. Hoffman,et al.  Discriminating rigid from nonrigid motion: Minimum points and views , 1990, Perception & psychophysics.

[64]  Henning Biermann,et al.  Recovering non-rigid 3D shape from image streams , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[65]  T. Albright,et al.  Contribution of area MT to perception of three-dimensional shape: a computational study , 1996, Vision Research.

[66]  D. Regan,et al.  Shape discrimination and the judgement of perfect symmetry: Dissociation of shape from size , 1992, Vision Research.

[67]  E C Hildreth,et al.  Incremental rigidity scheme for recovering structure from motion: position-based versus velocity-based formulations. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[68]  J T Todd,et al.  The perception of three-dimensional structure from rigid and nonrigid motion , 1984, Perception & psychophysics.