Averaging facial expression over time.

The visual system groups similar features, objects, and motion (e.g., Gestalt grouping). Recent work suggests that the computation underlying perceptual grouping may be one of summary statistical representation. Summary representation occurs for low-level features, such as size, motion, and position, and even for high level stimuli, including faces; for example, observers accurately perceive the average expression in a group of faces (J. Haberman & D. Whitney, 2007, 2009). The purpose of the present experiments was to characterize the time-course of this facial integration mechanism. In a series of three experiments, we measured observers' abilities to recognize the average expression of a temporal sequence of distinct faces. Faces were presented in sets of 4, 12, or 20, at temporal frequencies ranging from 1.6 to 21.3 Hz. The results revealed that observers perceived the average expression in a temporal sequence of different faces as precisely as they perceived a single face presented repeatedly. The facial averaging was independent of temporal frequency or set size, but depended on the total duration of exposed faces, with a time constant of approximately 800 ms. These experiments provide evidence that the visual system is sensitive to the ensemble characteristics of complex objects presented over time.

[1]  Max Wertheimer,et al.  Untersuchungen zur Lehre von der Gestalt , .

[2]  M. Wertheimer Untersuchungen zur Lehre von der Gestalt. II , 1923 .

[3]  M. Posner,et al.  On the genesis of abstract ideas. , 1968, Journal of experimental psychology.

[4]  P. Ekman Pictures of Facial Affect , 1976 .

[5]  M. Potter Short-term conceptual memory for pictures. , 1976, Journal of experimental psychology. Human learning and memory.

[6]  J. Russell A circumplex model of affect. , 1980 .

[7]  R. L. Solso,et al.  Prototype formation of faces: A case of pseudo-memory , 1981 .

[8]  Robert Sekuler,et al.  Coherent global motion percepts from stochastic local motions , 1984, Vision Research.

[9]  S. McKee,et al.  Sequential recruitment in the discrimination of velocity. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[10]  O. Braddick,et al.  The combination of motion signals over time , 1989, Vision Research.

[11]  Andrew P. Duchon,et al.  The human visual system averages speed information , 1992, Vision Research.

[12]  H. Nothdurft Faces and Facial Expressions do not Pop Out , 1993, Perception.

[13]  A. Ohman,et al.  Masking the face: recognition of emotional facial expressions as a function of the parameters of backward masking. , 1993, Scandinavian journal of psychology.

[14]  Gregory Bock,et al.  Higher-order processing in the visual system , 1994 .

[15]  Kimron Shapiro,et al.  Direct measurement of attentional dwell time in human vision , 1994, Nature.

[16]  W. Newsome,et al.  Neuronal and psychophysical sensitivity to motion signals in extrastriate area MST of the macaque monkey , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[17]  H C Nothdurft,et al.  Common properties of visual segmentation. , 1994, Ciba Foundation symposium.

[18]  S. McKee,et al.  Detecting a trajectory embedded in random-direction motion noise , 1995, Vision Research.

[19]  Ronald A. Rensink,et al.  TO SEE OR NOT TO SEE: The Need for Attention to Perceive Changes in Scenes , 1997 .

[20]  J. Findlay,et al.  Face Detection in Peripheral Vision: Do Faces Pop Out? , 1997, Perception.

[21]  Elena K. Festa,et al.  Recruitment mechanisms in speed and fine-direction discrimination tasks , 1997, Vision Research.

[22]  Edward K. Vogel,et al.  The capacity of visual working memory for features and conjunctions , 1997, Nature.

[23]  David C. Burr,et al.  Seeing biological motion , 1998, Nature.

[24]  D. Simons,et al.  Failure to detect changes to people during a real-world interaction , 1998 .

[25]  K. Nakayama,et al.  Robust representations for faces: evidence from visual search. , 1999, Journal of experimental psychology. Human perception and performance.

[26]  Frans A. J. Verstraten,et al.  Limits of attentive tracking reveal temporal properties of attention , 2000, Vision Research.

[27]  P. Cavanagh Seeing the forest but not the trees , 2001, Nature Neuroscience.

[28]  D. Ariely Seeing Sets: Representation by Statistical Properties , 2001, Psychological science.

[29]  Felix Wichmann,et al.  The psychometric function: II. Bootstrap-based confidence intervals and sampling , 2001, Perception & psychophysics.

[30]  J. Lund,et al.  Compulsory averaging of crowded orientation signals in human vision , 2001, Nature Neuroscience.

[31]  F A Wichmann,et al.  Ning for Helpful Comments and Suggestions. This Paper Benefited Con- Siderably from Conscientious Peer Review, and We Thank Our Reviewers the Psychometric Function: I. Fitting, Sampling, and Goodness of Fit , 2001 .

[32]  Michael S. Landy,et al.  Visual perception of texture , 2002 .

[33]  A. Treisman,et al.  Representation of statistical properties , 2003, Vision Research.

[34]  J. Wolfe Moving towards solutions to some enduring controversies in visual search , 2003, Trends in Cognitive Sciences.

[35]  E. McKone,et al.  Isolating the special component of face recognition: peripheral identification and a Mooney face. , 2004, Journal of experimental psychology. Learning, memory, and cognition.

[36]  J. Beck Textural segmentation, second-order statistics, and textural elements , 1983, Biological Cybernetics.

[37]  L. Chalupa,et al.  The visual neurosciences , 2004 .

[38]  J. Hietanen,et al.  Positive facial expressions are recognized faster than negative facial expressions, but why? , 2004, Psychological research.

[39]  Randolph Blake,et al.  The role of temporal structure in human vision. , 2005, Behavioral and cognitive neuroscience reviews.

[40]  A. Treisman,et al.  Statistical processing: computing the average size in perceptual groups , 2005, Vision Research.

[41]  D. Pelli,et al.  Are faces processed like words? A diagnostic test for recognition by parts. , 2005, Journal of vision.

[42]  E. Louie,et al.  Holistic crowding: selective interference between configural representations of faces in crowded scenes. , 2007, Journal of vision.

[43]  Jason M Haberman,et al.  Correspondences Rapid extraction of mean emotion and gender from sets of faces , 2007 .

[44]  R. Blake,et al.  Perception of human motion. , 2007, Annual review of psychology.

[45]  Z. Kourtzi,et al.  Linking form and motion in the primate brain , 2008, Trends in Cognitive Sciences.

[46]  A. Oliva,et al.  The Representation of Simple Ensemble Visual Features Outside the Focus of Attention , 2008, Psychological science.

[47]  D. Burr,et al.  A Visual Sense of Number , 2007, Current Biology.

[48]  Jason M Haberman,et al.  Seeing the mean: ensemble coding for sets of faces. , 2009, Journal of experimental psychology. Human perception and performance.

[49]  J. D. de Fockert,et al.  Rapid extraction of mean identity from sets of faces. , 2009, Quarterly journal of experimental psychology.

[50]  Timothy D. Sweeny,et al.  Within-hemifield perceptual averaging of facial expressions predicted by neural averaging. , 2009, Journal of vision.

[51]  Brian J. Scholl,et al.  Perceptually averaging in a continuous visual world: Extracting statistical summary representations over time , 2010 .

[52]  Alice R. Albrecht,et al.  Perceptually Averaging in a Continuous Visual World , 2010, Psychological science.