Encoding multielement scenes: statistical learning of visual feature hierarchies.

The authors investigated how human adults encode and remember parts of multielement scenes composed of recursively embedded visual shape combinations. The authors found that shape combinations that are parts of larger configurations are less well remembered than shape combinations of the same kind that are not embedded. Combined with basic mechanisms of statistical learning, this embeddedness constraint enables the development of complex new features for acquiring internal representations efficiently without being computationally intractable. The resulting representations also encode parts and wholes by chunking the visual input into components according to the statistical coherence of their constituents. These results suggest that a bootstrapping approach of constrained statistical learning offers a unified framework for investigating the formation of different internal representations in pattern and scene perception.

[1]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[2]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[3]  O Braddick,et al.  Orientation-Specific Learning in Stereopsis , 1973, Perception.

[4]  H. Simon,et al.  Perception in chess , 1973 .

[5]  D. Scott Perceptual learning. , 1974, Queen's nursing journal.

[6]  K A Ericcson,et al.  Acquisition of a memory skill. , 1980, Science.

[7]  A. Fiorentini,et al.  Perceptual learning specific for orientation and spatial frequency , 1980, Nature.

[8]  E. Bienenstock,et al.  Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex , 1982, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[9]  R. Sekuler,et al.  A specific and enduring improvement in visual motion discrimination. , 1982, Science.

[10]  L. Kaufman,et al.  Handbook of perception and human performance , 1986 .

[11]  J. R. Pomerantz,et al.  THEORETICAL APPROACHES TO PERCEPTUAL ORGANIZATION Simplicity and Likelihood Principles , 1986 .

[12]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[13]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[14]  D Sagi,et al.  Where practice makes perfect in texture discrimination: evidence for primary visual cortex plasticity. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Heinrich H. Bülthoff,et al.  Psychophysical support for a 2D view interpolation theory of object recognition , 1991 .

[16]  R. Shiffrin,et al.  Models for recall and recognition. , 1992, Annual review of psychology.

[17]  I. Biederman,et al.  Dynamic binding in a neural network for shape recognition. , 1992, Psychological review.

[18]  L. Squire Memory and the hippocampus: a synthesis from findings with rats, monkeys, and humans. , 1992, Psychological review.

[19]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[20]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[21]  T Poggio,et al.  Fast perceptual learning in visual hyperacuity. , 1991, Science.

[22]  I. Biederman,et al.  Recognizing depth-rotated objects: evidence and conditions for three-dimensional viewpoint invariance. , 1993, Journal of experimental psychology. Human perception and performance.

[23]  David I. Perrett,et al.  Modeling visual recognition from neurobiological constraints , 1994, Neural Networks.

[24]  Nk Logothetis,et al.  Image-based Object Recognition. , 1994 .

[25]  L. Kaufman,et al.  Handbook of Perception and Human Performance. Volume 2. Cognitive Processes and Performance , 1994 .

[26]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[27]  H H Bülthoff,et al.  How are three-dimensional objects represented in the brain? , 1994, Cerebral cortex.

[28]  Geoffrey E. Hinton,et al.  The Helmholtz Machine , 1995, Neural Computation.

[29]  R. Zemel,et al.  Learning sparse multiple cause models , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[30]  Heinrich H. Bülthoff,et al.  Image-based object recognition , 1995 .

[31]  David C. Knill,et al.  Introduction: a Bayesian formulation of visual perception , 1996 .

[32]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[33]  Alan L. Yuille,et al.  Perception as Bayesian Inference: Introduction , 1996 .

[34]  D G Pelli,et al.  Pixel independence: measuring spatial interactions on a CRT display. , 1997, Spatial vision.

[35]  Bartlett W. Mel SEEMORE: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to Visual Object Recognition , 1997, Neural Computation.

[36]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[37]  S. Hochstein,et al.  Task difficulty and the specificity of perceptual learning , 1997, Nature.

[38]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[39]  D G Pelli,et al.  The VideoToolbox software for visual psychophysics: transforming numbers into movies. , 1997, Spatial vision.

[40]  Brendan J. Frey,et al.  Graphical Models for Machine Learning and Digital Communication , 1998 .

[41]  Heinrich H Bülthoff,et al.  Image-based object recognition in man, monkey and machine , 1998, Cognition.

[42]  Robert L. Goldstone,et al.  The development of features in object concepts , 1998, Behavioral and Brain Sciences.

[43]  Robert L. Goldstone Unitization during category learning. , 2000, Journal of experimental psychology. Human perception and performance.

[44]  D. Field,et al.  The roles of polarity and symmetry in the perceptual grouping of contour fragments. , 2000, Spatial vision.

[45]  Tomaso Poggio,et al.  Models of object recognition , 2000, Nature Neuroscience.

[46]  Bartlett W. Mel,et al.  Minimizing Binding Errors Using Learned Conjunctive Features , 2000, Neural Computation.

[47]  R. Aslin,et al.  PSYCHOLOGICAL SCIENCE Research Article UNSUPERVISED STATISTICAL LEARNING OF HIGHER-ORDER SPATIAL STRUCTURES FROM VISUAL SCENES , 2022 .

[48]  Refractor Vision , 2000, The Lancet.

[49]  Richard N Aslin,et al.  Statistical learning of new visual feature combinations by infants , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[50]  W. Geisler,et al.  Bayesian natural selection and the evolution of perceptual systems. , 2002, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[51]  R. Aslin,et al.  Statistical learning of higher-order temporal structure from visual shape sequences. , 2002, Journal of experimental psychology. Learning, memory, and cognition.

[52]  D. Foster,et al.  Recognizing novel three–dimensional objects by summing signals from parts and views , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[53]  R. Henson,et al.  Multiple levels of visual object constancy revealed by event-related fMRI of repetition priming , 2002, Nature Neuroscience.

[54]  A. Yonelinas The Nature of Recollection and Familiarity: A Review of 30 Years of Research , 2002 .

[55]  A. Hyvärinen,et al.  A multi-layer sparse coding network learns contour coding from natural images , 2002, Vision Research.

[56]  M. Tarr,et al.  Visual Object Recognition , 1996, ISTCS.

[57]  Joshua B. Tenenbaum,et al.  Inferring causal networks from observations and interventions , 2003, Cogn. Sci..

[58]  Heiko Wersing,et al.  Learning Optimized Features for Hierarchical Models of Invariant Object Recognition , 2003, Neural Computation.

[59]  N. Logothetis,et al.  Integration of Local Features into Global Shapes Monkey and Human fMRI Studies , 2003, Neuron.

[60]  M. Lewicki,et al.  Learning higher-order structures in natural images , 2003, Network.

[61]  David W Jacobs,et al.  What makes viewpoint-invariant properties perceptually salient? , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[62]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[63]  David M. Sobel,et al.  A theory of causal learning in children: causal maps and Bayes nets. , 2004, Psychological review.

[64]  F. Craik,et al.  The Oxford handbook of memory , 2006 .