Ensemble summary statistics as a basis for rapid visual categorization.

Ensemble summary statistics represent multiple objects on the high level of abstraction-that is, without representing individual features and ignoring spatial organization. This makes them especially useful for the rapid visual categorization of multiple objects of different types that are intermixed in space. Rapid categorization implies our ability to judge at one brief glance whether all visible objects represent different types or just variants of one type. A framework presented here states that processes resembling statistical tests can underlie that categorization. At an early stage (primary categorization), when independent ensemble properties are distributed along a single sensory dimension, the shape of that distribution is tested in order to establish whether all features can be represented by a single or multiple peaks. When primary categories are separated, the visual system either reiterates the shape test to recognize subcategories (in-depth processing) or implements mean comparison tests to match several primary categories along a new dimension. Rapid categorization is not free from processing limitations; the role of selective attention in categorization is discussed in light of these limitations.

[1]  Jüri Allik,et al.  An almost general theory of mean size perception , 2013, Vision Research.

[2]  L. Feigenson,et al.  Multiple Spatially Overlapping Sets Can Be Enumerated in Parallel , 2006, Psychological science.

[3]  R. Rosenholtz A simple saliency model predicts a number of motion popout phenomena , 1999, Vision Research.

[4]  Sang Chul Chong,et al.  Mean Size as a Unit of Visual Working Memory , 2014, Perception.

[5]  Sung Jun Joo,et al.  Statistical processing: Not so implausible after all , 2008, Perception & Psychophysics.

[6]  Daniel J Simons,et al.  Ensemble representations: effects of set size and item heterogeneity on average size perception. , 2013, Acta psychologica.

[7]  Jason M Haberman,et al.  Correspondences Rapid extraction of mean emotion and gender from sets of faces , 2007 .

[8]  A. Treisman,et al.  Representation of statistical properties , 2003, Vision Research.

[9]  B. Bauer Does Stevens’s Power Law for Brightness Extend to Perceptual Brightness Averaging? , 2009 .

[10]  R. Watt,et al.  The computation of orientation statistics from visual texture , 1997, Vision Research.

[11]  H. Nothdurft Feature analysis and the role of similarity in preattentive vision , 1992, Perception & psychophysics.

[12]  Joshua A Solomon,et al.  Visual discrimination of orientation statistics in crowded and uncrowded arrays. , 2010, Journal of vision.

[13]  Justin Halberda,et al.  Memory for Multiple Visual Ensembles in Infancy Representing Individual Objects , 2022 .

[14]  H. Nothdurft The role of features in preattentive vision: Comparison of orientation, motion and color cues , 1993, Vision Research.

[15]  J. Fockert,et al.  Attention modulates set representation by statistical properties , 2008, Perception & psychophysics.

[16]  G. Alvarez Representing multiple objects as an ensemble enhances visual cognition , 2011, Trends in Cognitive Sciences.

[17]  D. Simons,et al.  Better than average: Alternatives to statistical summary representations for rapid judgments of average size , 2008, Perception & psychophysics.

[18]  J. Wolfe,et al.  From Perception to Consciousness: Searching with Anne Treisman , 2012 .

[19]  Cathleen M Moore,et al.  Summary statistics of size: fixed processing capacity for multiple ensembles but unlimited processing capacity for single ensembles. , 2014, Journal of experimental psychology. Human perception and performance.

[20]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[21]  Chris Oriet,et al.  Rapid averaging? Not so fast! , 2011, Psychonomic bulletin & review.

[22]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[23]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[24]  Jeremy M. Wolfe,et al.  Second-order parallel processing: visual search for the odd item in a subset. , 1995, Journal of experimental psychology. Human perception and performance.

[25]  Hee Yeon Im,et al.  The effects of sampling and internal noise on the representation of ensemble average size , 2012, Attention, Perception, & Psychophysics.

[26]  Stefan Treue,et al.  Seeing multiple directions of motion—physiology and psychophysics , 2000, Nature Neuroscience.

[27]  A. Treisman,et al.  Dividing attention across feature dimensions in statistical processing of perceptual groups , 2008, Perception & psychophysics.

[28]  Igor Utochkin,et al.  Distractor heterogeneity effects in visual search are mediated by segmentability , 2014 .

[29]  Drew H. Abney,et al.  Journal of Experimental Psychology : Human Perception and Performance Influence of Musical Groove on Postural Sway , 2015 .

[30]  D. Foster,et al.  Asymmetries in oriented-line detection indicate two orthogonal filters in early vision , 1991, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[31]  M. Bravo,et al.  The role of attention in different visual-search tasks , 1992, Perception & psychophysics.

[32]  Aude Oliva,et al.  Spatial ensemble statistics are efficient codes that can be represented with reduced attention , 2009, Proceedings of the National Academy of Sciences.

[33]  Krista A. Ehinger,et al.  Rethinking the Role of Top-Down Attention in Vision: Effects Attributable to a Lossy Representation in Peripheral Vision , 2011, Front. Psychology.

[34]  David C Burr,et al.  Subitizing but not estimation of numerosity requires attentional resources. , 2010, Journal of vision.

[35]  David Whitney,et al.  An aftereffect of adaptation to mean size , 2012, Visual cognition.

[36]  Timothy F. Brady,et al.  Hierarchical Encoding in Visual Working Memory , 2010, Psychological science.

[37]  Maria Olkkonen,et al.  The central tendency bias in color perception: effects of internal and external noise. , 2014, Journal of vision.

[38]  Derrick G. Watson,et al.  The efficiency of feature-based subitization and counting. , 2005, Journal of experimental psychology. Human perception and performance.

[39]  Ronald A. Rensink The Dynamic Representation of Scenes , 2000 .

[40]  Susan L. Franzel,et al.  Guided search: an alternative to the feature integration model for visual search. , 1989, Journal of experimental psychology. Human perception and performance.

[41]  I. Utochkin,et al.  Parallel averaging of size is possible but range-limited: a reply to Marchant, Simons, and De Fockert. , 2014, Acta psychologica.

[42]  C. Chubb,et al.  A 'dipper' function for texture discrimination based on orientation variance. , 2008, Journal of vision.

[43]  Yaoda Xu,et al.  The association of color memory and the enumeration of multiple spatially overlapping sets. , 2013, Journal of vision.

[44]  D. Ariely Seeing Sets: Representation by Statistical Properties , 2001, Psychological science.

[45]  Z W Pylyshyn,et al.  Tracking multiple independent targets: evidence for a parallel tracking mechanism. , 1988, Spatial vision.

[46]  Hee Yeon Im,et al.  Computation of mean size is based on perceived size , 2009, Attention, perception & psychophysics.

[47]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[48]  H. Bastian Sensation and Perception.—I , 1869, Nature.

[49]  Jason M Haberman,et al.  Seeing the mean: ensemble coding for sets of faces. , 2009, Journal of experimental psychology. Human perception and performance.

[50]  E. Spelke,et al.  Language and Conceptual Development series Core systems of number , 2004 .

[51]  A. Treisman Features and Objects: The Fourteenth Bartlett Memorial Lecture , 1988, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[52]  Jennifer E. Corbett,et al.  The whole is indeed more than the sum of its parts: perceptual averaging in the absence of individual item representation. , 2011, Acta psychologica.

[53]  A. Oliva,et al.  The Representation of Simple Ensemble Visual Features Outside the Focus of Attention , 2008, Psychological science.

[54]  D. Hubel,et al.  Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[55]  J. Wolfe,et al.  Guided Search 2.0 A revised model of visual search , 1994, Psychonomic bulletin & review.

[56]  A. Treisman,et al.  Statistical processing: computing the average size in perceptual groups , 2005, Vision Research.

[57]  Karla K Evans,et al.  Distributed versus focused attention (count vs estimate). , 2011, Wiley interdisciplinary reviews. Cognitive science.

[58]  Kevin W Eliceiri,et al.  NIH Image to ImageJ: 25 years of image analysis , 2012, Nature Methods.

[59]  H E Egeth,et al.  Local processes in preattentive feature detection. , 1991, Journal of experimental psychology. Human perception and performance.

[60]  Shaul Hochstein,et al.  Computing an Average When Part of the Population Is Not Perceived , 2015, Journal of Cognitive Neuroscience.

[61]  J. Solomon The history of dipper functions , 2009, Attention, perception & psychophysics.

[62]  K. Nakayama,et al.  Situating visual search , 2011, Vision Research.

[63]  Robert L. Goldstone,et al.  Categorical perception. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[64]  Jonathan S. Cant,et al.  Object Ensemble Processing in Human Anterior-Medial Ventral Visual Cortex , 2012, The Journal of Neuroscience.

[65]  A. Treisman How the deployment of attention determines what we see , 2006, Visual cognition.

[66]  Edward K. Vogel,et al.  The capacity of visual working memory for features and conjunctions , 1997, Nature.

[67]  J. Wolfe,et al.  Second-order parallel processing: visual search for the odd item in a subset. , 1995, Journal of experimental psychology. Human perception and performance.

[68]  N. Cowan The magical number 4 in short-term memory: A reconsideration of mental storage capacity , 2001, Behavioral and Brain Sciences.

[69]  Nicolas Robitaille,et al.  When more is less: extraction of summary statistics benefits from larger sets. , 2011, Journal of vision.

[70]  Jason M Haberman,et al.  The visual system discounts emotional deviants when extracting average expression , 2010, Attention, perception & psychophysics.

[71]  J. Wolfe,et al.  The role of categorization in visual search for orientation. , 1992, Journal of experimental psychology. Human perception and performance.

[72]  J. Lund,et al.  Compulsory averaging of crowded orientation signals in human vision , 2001, Nature Neuroscience.

[73]  Ken Nakayama,et al.  Serial and parallel processing of visual feature conjunctions , 1986, Nature.

[74]  G. Fouriezos,et al.  Visual statistical decisions , 2008, Perception & psychophysics.

[75]  David Whitney,et al.  Ensemble perception: Summarizing the scene and broadening the limits of visual processing. , 2012 .

[76]  Antonio Torralba,et al.  Depth Estimation from Image Structure , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[77]  I. Utochkin Visual search with negative slopes: the statistical power of numerosity guides attention. , 2013, Journal of vision.

[78]  P. Cavanagh,et al.  The Capacity of Visual Short-Term Memory is Set Both by Visual Information Load and by Number of Objects , 2004, Psychological science.

[79]  Daniel J. Simons,et al.  Average size perception and the allure of a new mechanism , 2008 .