Establishing and maintaining perceptual coherence: unimodal and multimodal evidence

Abstract Studies of perceptual organization of speech aim to produce a direct characterization of the fundamental aspect of speech perception by which the listener resolves the acoustically heterogeneous elements of speech into a coherent sensory stream. This primitive function determines a sensory sample fit to analyze for linguistic properties. Organizational functions that apply to speech appear to be fast, unlearned, intended, nonsymbolic and indifferent to the familiar short-term sensory qualities of vocally produced sound. Although primitive auditory organizing principles deriving from the classic Gestalt set fail to accommodate the acoustic variety of speech signals, a claim of specialization in the auditory perceptual organization of speech is not warranted. Multimodal instances of perceptual organization reveal that the detection of coherence across modalities shares formal properties with the detection of coherence within the auditory modality.

[1]  E. E. David,et al.  Human communication : a unified view , 1972 .

[2]  Daniel P. W. Ellis,et al.  The auditory organization of speech and other sources in listeners and computational models , 2001, Speech Commun..

[3]  P. Howell,et al.  Some properties of auditory memory for rapid formant transitions , 1977, Memory & cognition.

[4]  J. L. Miller,et al.  On the role of visual rate information in phonetic perception , 1985, Perception & psychophysics.

[5]  P. Gribble,et al.  Temporal constraints on the McGurk effect , 1996, Perception & psychophysics.

[6]  David B. Pisoni,et al.  Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances , 1998, Speech Commun..

[7]  O. G. Selfridge,et al.  Pandemonium: a paradigm for learning , 1988 .

[8]  Jessica M. Foxton,et al.  Effects of attention and unilateral neglect on auditory stream segregation. , 2001, Journal of experimental psychology. Human perception and performance.

[9]  Peter D. Eimas,et al.  Organization in the Perception of Speech by Young Infants , 1992 .

[10]  D. Klatt Review of selected models of speech perception , 1989 .

[11]  J. Hochberg ORGANIZATION AND THE GESTALT TRADITION , 1974 .

[12]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[13]  J. Driver Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading , 1996, Nature.

[14]  Dominic W. Massaro,et al.  SPEECH RECOGNITION AND SENSORY INTEGRATION , 1998 .

[15]  P. Kuhl,et al.  Integral processing of visual place and auditory voicing information during phonetic perception. , 1991, Journal of experimental psychology. Human perception and performance.

[16]  Zachary M. Smith,et al.  Chimaeric sounds reveal dichotomies in auditory perception , 2002, Nature.

[17]  Béatrice de Gelder,et al.  Auditory-visual interaction in voice localization and in bimodal speech recognition: the effects of desynchronization , 1997, AVSP.

[18]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[19]  Robert E. Remez,et al.  The effect of variation in naturalness on phonetic perceptual identification , 2002 .

[20]  S Rosenbloom,et al.  Space Perception in Early Infancy: Perception within a Common Auditory-Visual Space , 1971, Science.

[21]  G. Murch Visual and auditory perception , 1972 .

[22]  D. Pisoni,et al.  Reaction times to comparisons within and across phonetic categories , 1974, Perception & psychophysics.

[23]  A. Bregman,et al.  Fusion of auditory components: Effects of the frequency of amplitude modulation , 1990, Perception & psychophysics.

[24]  J. P. Egan Articulation testing methods , 1948, The Laryngoscope.

[25]  M. P. Friedman,et al.  HANDBOOK OF PERCEPTION , 1977 .

[26]  Jennifer S. Pardo,et al.  On the Bistability of Sine Wave Analogues of Speech , 2001, Psychological science.

[27]  Jennifer S. Pardo,et al.  On the perceptual organization of speech. , 1994, Psychological review.

[28]  Jon Barker,et al.  Is the sine-wave speech cocktail party worth attending? , 1999, Speech Commun..

[29]  D. Pisoni,et al.  Speech perception without traditional speech cues. , 1981, Science.

[30]  William D. Marslen-Wilson,et al.  Lexical Representation and Process , 1991 .

[31]  Mitchell Steinschneider,et al.  Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey , 2001, Hearing Research.

[32]  David B. Pisoni,et al.  Audio-Visual Perception of Sinewave Speech in an Adult Cochlear Implant User: A Case Study , 2001, Ear and hearing.

[33]  K. D. Kryter,et al.  ARTICULATION-TESTING METHODS: CONSONANTAL DIFFERENTIATION WITH A CLOSED-RESPONSE SET. , 1965, The Journal of the Acoustical Society of America.

[34]  L. Elliot Backward and Forward Masking of Probe Tones of Different Frequencies , 1962 .

[35]  Louis Goldstein,et al.  Primary auditory stream segregation of repeated consonant—vowel sequences , 1974 .

[36]  Keith Johnson,et al.  The Role of Speech Perception in Phonology , 2001 .

[37]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[38]  T D Carrell,et al.  The effect of amplitude comodulation on auditory object formation in sentence perception , 1992, Perception & psychophysics.