Perception of Speech and Sound

The transformation of acoustical signals into auditory sensations can be characterized by psychophysical quantities such as loudness, tonality, or perceived pitch. The resolution limits of the auditory system produce spectral and temporal masking phenomena and impose constraints on the perception of amplitude modulations. Binaural hearing (i.e., utilizing the acoustical difference across both ears) employs interaural time and intensity differences to produce localization and binaural unmasking phenomena such as the binaural intelligibility level difference, i.e., the speech reception threshold difference between listening to speech in noise monaurally versus listening with both ears.

[1]  R. Fay,et al.  Speech Processing in the Auditory System , 2010, Springer Handbook of Auditory Research.

[2]  M. Halle,et al.  Preliminaries to Speech Analysis: The Distinctive Features and Their Correlates , 1961 .

[3]  M R Schroeder,et al.  Flat-spectrum speech. , 1986, The Journal of the Acoustical Society of America.

[4]  B Hagerman,et al.  Sentences for testing speech intelligibility in noise. , 1982, Scandinavian audiology.

[5]  Manfred R. Schroeder,et al.  Computer Speech: Recognition, Compression, Synthesis , 1999 .

[6]  K. Wagener,et al.  Design, optimization and evaluation of a Danish sentence test in noise: Diseño, optimización y evaluación de la prueba Danesa de frases en ruido , 2003, International journal of audiology.

[7]  H Müsch,et al.  Using statistical decision theory to predict speech intelligibility. I. Model structure. , 2001, The Journal of the Acoustical Society of America.

[8]  Ma Conway,et al.  Handbook of perception and cognition , 1996 .

[9]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[10]  Christian Kaernbach,et al.  The memory of noise. , 2004, Experimental psychology.

[11]  B Kollmeier,et al.  Development and evaluation of a German sentence test for objective and subjective speech intelligibility assessment. , 1997, The Journal of the Acoustical Society of America.

[12]  S A Shamma,et al.  Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex. , 2001, Journal of neurophysiology.

[13]  M. P. Friedman,et al.  HANDBOOK OF PERCEPTION , 1977 .

[14]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[15]  Richard Lippmann,et al.  Speech recognition by machines and humans , 1997, Speech Commun..

[16]  Deborah Hayes,et al.  Handbook of Clinical Audiology (4th ed.) , 1994 .

[17]  Daniel Pressnitzer,et al.  The psychophysics and physiology of comodulation masking release , 2003, Experimental Brain Research.

[18]  T. Dau Modeling auditory processing of amplitude modulation , 1997 .

[19]  B. Moore,et al.  Auditory Frequency Selectivity , 1986, Nato ASI Series.

[20]  B Kollmeier,et al.  Speech intelligibility prediction in hearing-impaired listeners based on a psychoacoustically motivated perception model. , 1996, The Journal of the Acoustical Society of America.

[21]  B. Kollmeier,et al.  Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers. , 1997, The Journal of the Acoustical Society of America.

[22]  C. Schreiner,et al.  Periodicity coding in the inferior colliculus of the cat. II. Topographical organization. , 1988, Journal of neurophysiology.

[23]  R. Beutelmann,et al.  Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners. , 2006, The Journal of the Acoustical Society of America.

[24]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[25]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[26]  H Müsch,et al.  Using statistical decision theory to predict speech intelligibility. II. Measurement and prediction of consonant-discrimination performance. , 2001, The Journal of the Acoustical Society of America.

[27]  H. Fletcher,et al.  The Perception of Speech and Its Relation to Telephony , 1950 .

[28]  J. C. Steinberg,et al.  Factors Governing the Intelligibility of Speech Sounds , 1945 .

[29]  S. S. Stevens,et al.  Critical Band Width in Loudness Summation , 1957 .

[30]  M. D. Wang,et al.  Consonant confusions in noise: a study of perceptual features. , 1973, The Journal of the Acoustical Society of America.

[31]  Birger Kollmeier,et al.  Objective Modeling of Speech Quality with a Psychoacoustically Validated Auditory Model , 2000 .

[32]  S. Soli,et al.  Development of the Hearing in Noise Test for the measurement of speech reception thresholds in quiet and in noise. , 1994, The Journal of the Acoustical Society of America.

[33]  J Tchorz,et al.  A model of auditory perception as front end for automatic speech recognition. , 1999, The Journal of the Acoustical Society of America.

[34]  Birger Kollmeier,et al.  Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests. , 2002, The Journal of the Acoustical Society of America.

[35]  A. M. Mimpen,et al.  Improving the reliability of testing the speech reception threshold for sentences. , 1979, Audiology : official organ of the International Society of Audiology.

[36]  T. Houtgast,et al.  A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria , 1985 .