Speech Processing in the Auditory System: An Overview

[1]  Björn Lindblom,et al.  Explaining Phonetic Variation: A Sketch of the H&H Theory , 1990 .

[2]  Claude E. Shannon,et al.  A mathematical theory of communication , 1948, MOCO.

[3]  Francis M. Wiener,et al.  The Pressure Distribution in the Auditory Canal in a Progressive Sound Field , 1946 .

[4]  I. Lehiste chapter 7 – Suprasegmental Features of Speech , 1976 .

[5]  R. Plomp,et al.  Perceptual and physical space of vowel sounds. , 1969, The Journal of the Acoustical Society of America.

[6]  Roger Shattuck,et al.  The forbidden experiment : the story of the Wild Boy of Aveyron , 1980 .

[7]  P. Ladefoged Three areas of experimental phonetics , 1967 .

[8]  S. Shamma Speech processing in the auditory system. I: The representation of speech sounds in the responses of the auditory nerve. , 1985, The Journal of the Acoustical Society of America.

[9]  Peter D. Eimas,et al.  Perspectives on the study of speech , 1981 .

[10]  M B Sachs,et al.  Nonlinearities in auditory-nerve fiber responses to bandlimited noise. , 1980, The Journal of the Acoustical Society of America.

[11]  Noam Chomsky,et al.  New Horizons in the Study of Language and Mind: Naturalism and dualism in the study of language and mind , 2008 .

[12]  N. G. Zagoruyko,et al.  Automatic recognition of 200 words , 1970 .

[13]  B Blesser,et al.  Speech perception under conditions of spectral transformation. I. Phonetic characteristics. , 1972, Journal of speech and hearing research.

[14]  S. Greenberg,et al.  A specialization for speech perception? , 1989, Science.

[15]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[16]  G. W. Harding,et al.  Degeneration in the cochlea after noise damage: primary versus secondary events. , 2000, The American journal of otology.

[17]  D. Irvine The Auditory Brainstem , 1986, Progress in Sensory Physiology.

[18]  Colin Humphries,et al.  Role of left posterior superior temporal gyrus in phonological processing for speech perception and production , 2001 .

[19]  P. Lieberman Uniquely Human: The Evolution of Speech, Thought, and Selfless Behavior , 1991 .

[20]  Daniel P. W. Ellis,et al.  The auditory organization of speech and other sources in listeners and computational models , 2001, Speech Commun..

[21]  E. F. Evans,et al.  Psychophysics and Physiology of Hearing , 1979 .

[22]  B. Kollmeier,et al.  Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction. , 1994, The Journal of the Acoustical Society of America.

[23]  William A. Ainsworth,et al.  Speech Recognition by Machine , 1988 .

[24]  Richard R. Fay,et al.  Comparative Hearing: Mammals , 1994, Springer Handbook of Auditory Research.

[25]  J. Flanagan Estimates of the Maximum Precision Necessary in Quantizing Certain , 1957 .

[26]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[27]  Gerald Langner,et al.  Periodicity coding in the auditory system , 1992, Hearing Research.

[28]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[29]  Noam Chomsky,et al.  The faculty of language: what is it, who has it, and how did it evolve? , 2002 .

[30]  C V Pavlovic,et al.  An articulation index based procedure for predicting the speech recognition performance of hearing-impaired individuals. , 1986, The Journal of the Acoustical Society of America.

[31]  J. Gibson The Senses Considered As Perceptual Systems , 1967 .

[32]  J. L. Goldstein,et al.  Auditory nerve rate-level functions for two-tone stimuli: Possible relation to basilar membrane nonlinearity , 1989, Hearing Research.

[33]  C. G. Rice Mechanisms of speech recognition , 1976 .

[34]  C. Schreiner,et al.  Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms. , 1988, Journal of neurophysiology.

[35]  A. M. Mimpen,et al.  The ear as a frequency analyzer. II. , 1964, The Journal of the Acoustical Society of America.

[36]  C. D. Geisler,et al.  Frequency selectivity of single cochlear-nerve fibers based on the temporal response pattern to two-tone signals. , 1986, The Journal of the Acoustical Society of America.

[37]  Oded Ghitza,et al.  Temporal non-place information in the auditory-nerve firing patterns as a front-end for speech recognition in a noisy environment , 1988 .

[38]  Rolf Carlson,et al.  The representation of speech in the peripheral auditory system : proceedings of the Symposium on the Representation of Speech in the peripheral Auditory System, held in Stockholm, Sweden on May 17-19, 1982 , 1982 .

[39]  H. Fletcher,et al.  The Perception of Speech and Its Relation to Telephony , 1950 .

[40]  Richard M. Stern,et al.  Chapter 10 – Models of Binaural Interaction , 1995 .

[41]  Dennis H. Klatt,et al.  Speech perception: a model of acoustic–phonetic analysis and lexical access , 1979 .

[42]  J R Dubno,et al.  Frequency selectivity and consonant recognition for hearing-impaired and normal-hearing listeners with equivalent masked thresholds. , 1995, The Journal of the Acoustical Society of America.

[43]  E. Lenneberg Understanding language without ability to speak: a case report. , 1962, Journal of abnormal and social psychology.

[44]  A. Liberman,et al.  An effect of linguistic experience: The discrimination of [r] and [l] by native speakers of Japanese and English , 1975 .

[45]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[46]  David J. Anderson,et al.  Temporal Position of Discharges in Single Auditory Nerve Fibers within the Cycle of a Sine‐Wave Stimulus: Frequency and Intensity Effects , 1971 .

[47]  Richard R. Fay,et al.  The Mammalian Auditory Pathway: Neuroanatomy , 1992, Springer Handbook of Auditory Research.

[48]  B E Walden,et al.  Spectral distribution of prosodic information. , 1996, Journal of speech and hearing research.

[49]  J. C. Steinberg,et al.  Factors Governing the Intelligibility of Speech Sounds , 1945 .

[50]  E. E. David,et al.  Human communication : a unified view , 1972 .

[51]  D D Dirks,et al.  Application of the Articulation Index and the Speech Transmission Index to the recognition of speech by normal-hearing and hearing-impaired listeners. , 1986, Journal of speech and hearing research.

[52]  C. W. Carter,et al.  The words and sounds of telephone conversations , 1930 .

[53]  D. Pisoni,et al.  Acoustic-phonetic representations in word recognition , 1987, Cognition.

[54]  H. Sussman,et al.  An investigation of locus equations as a source of relational invariance for stop place categorization , 1991 .

[55]  William A. Ainsworth Pitch change as a cue to syllabification , 1986 .

[56]  Draga Zec Sonority constraints on syllable structure , 1995, Phonology.

[57]  Keith R. Kluender,et al.  Contributions of nonhuman animal models to understanding human speech perception , 2000 .

[58]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[59]  S. Greenberg,et al.  The ear as a speech analyzer , 1988 .

[60]  Ch. von der Malsburg,et al.  A neural cocktail-party processor , 1986, Biological Cybernetics.

[61]  Robert Patuzzi,et al.  Non-Linear Aspects of Outer Hair Cell Transduction and the Temporary Threshold Shifts after Acoustic Trauma , 2002, Audiology and Neurotology.

[62]  Steven Greenberg,et al.  Speech intelligibility in the presence of cross-channel spectral asynchrony , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[63]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[64]  D D Dirks,et al.  Auditory filter characteristics and consonant recognition for hearing-impaired listeners. , 1989, The Journal of the Acoustical Society of America.

[65]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[66]  E. Zwicker “Negative Afterimage” in Hearing , 1964 .

[67]  George A. Miller,et al.  Language and Communication , 1951 .

[68]  Ma Conway,et al.  Handbook of perception and cognition , 1996 .

[69]  Paul Mermelstein,et al.  Difference limens for formant frequencies of steady‐state and consonant‐bound vowels , 1976 .

[70]  C. Darwin,et al.  The Quarterly Journal of Experimental Psychology Section a Human Experimental Psychology Perceptual Grouping of Speech Components Differing in Fundamental Frequency and Onset-time Perceptual Grouping of Speech Components Differing in Fundamental Frequency and Onset-time , 2022 .

[71]  M. Liberman,et al.  Response properties of cochlear efferent neurons: monaural vs. binaural stimulation and the effects of noise. , 1988, Journal of neurophysiology.

[72]  T. Brennen Processing the facial image , 1994 .

[73]  M. Sachs,et al.  Rate-place and temporal-place representations of vowels in the auditory nerve and anteroventral cochlear nucleus , 1988 .

[74]  Richard Lippmann,et al.  Accurate consonant perception without mid-frequency speech energy , 1996, IEEE Trans. Speech Audio Process..

[75]  E R Hafter,et al.  Just-noticeable differences of frequency for masked tones. , 1980, The Journal of the Acoustical Society of America.

[76]  K. Davis,et al.  Automatic Recognition of Spoken Digits , 1952 .

[77]  P. Ladefoged A course in phonetics , 1975 .

[78]  Steven Greenberg,et al.  ON THE ORIGINS OF SPEECH INTELLIGIBILITY IN THE REAL WORLD , 1997 .

[79]  Malcolm J. Crocker,et al.  Encyclopedia of Acoustics , 1998 .

[80]  Shihab Shamma,et al.  Auditory Representations of Timbre and Pitch , 1996 .

[81]  Steven Greenberg,et al.  Speech intelligibility derived from exceedingly sparse spectral information , 1998, ICSLP.

[82]  Wolfgang Fuerst,et al.  Introductory digital signal processing with computer applications , 1989 .

[83]  J. L. Goldstein,et al.  A central spectrum model: a synthesis of auditory-nerve timing and place cues in monaural communication of frequency spectrum. , 1983, The Journal of the Acoustical Society of America.

[84]  Steven Greenberg,et al.  The relation between speech intelligibility and the complex modulation spectrum , 2001, INTERSPEECH.

[85]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[86]  R. Klinke,et al.  HEARING — Physiological Bases and Psychophysics , 1983, Springer Berlin Heidelberg.

[87]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[88]  R. Ruben,et al.  Auditory Deprivation and Its Consequences: From Animal Models to Humans , 1996 .

[89]  Q. Summerfield,et al.  Auditory enhancement of changes in spectral amplitude. , 1987, The Journal of the Acoustical Society of America.

[90]  L. Lisker,et al.  A Cross-Language Study of Voicing in Initial Stops: Acoustical Measurements , 1964 .

[91]  A. Boothroyd,et al.  Mathematical treatment of context effects in phoneme and word recognition. , 1988, The Journal of the Acoustical Society of America.

[92]  B. C. Griffith,et al.  The discrimination of speech sounds within and across phoneme boundaries. , 1957, Journal of experimental psychology.

[93]  J. Gibson The Ecological Approach to Visual Perception , 1979 .

[94]  M. Hauser The Evolution of Communication , 1996 .

[95]  W. S. Rhode,et al.  Lateral suppression and inhibition in the cochlear nucleus of the cat. , 1994, Journal of neurophysiology.

[96]  C D Geisler,et al.  A two-stage nonlinear cochlear model possesses automatic gain control. , 1986, The Journal of the Acoustical Society of America.

[97]  T. Houtgast,et al.  A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria , 1985 .

[98]  W. Ganong Phonetic categorization in auditory word perception. , 1980, Journal of experimental psychology. Human perception and performance.

[99]  John G. Proakis,et al.  Digital Signal Processing: Principles, Algorithms, and Applications , 1992 .

[100]  W Hemmert,et al.  Resonant tectorial membrane motion in the inner ear: its crucial role in frequency tuning. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[101]  Martin Cooke,et al.  Modelling auditory processing and organisation , 1993, Distinguished dissertations in computer science.

[102]  L. A. Chistovich Central auditory processing of peripheral vowel spectra. , 1985, The Journal of the Acoustical Society of America.

[103]  A van Wieringen,et al.  Frequency and duration discrimination of short first-formant speechlike transitions. , 1994, The Journal of the Acoustical Society of America.

[104]  J D Miller,et al.  Speech perception by the chinchilla: identification function for synthetic VOT stimuli. , 1978, The Journal of the Acoustical Society of America.

[105]  H. Helmholtz,et al.  On the Sensations of Tone as a Physiological Basis for the Theory of Music , 2005 .

[106]  Björn Lindblom,et al.  Economy of Speech Gestures , 1983 .

[107]  W. Warr Organization of Olivocochlear Efferent Systems in Mammals , 1992 .

[108]  D. Buonomano,et al.  Learning and Generalization of Auditory Temporal–Interval Discrimination in Humans , 1997, The Journal of Neuroscience.

[109]  Steven Greenberg,et al.  From here to utility - melding phonetic insight with speech technology , 2001, INTERSPEECH.

[110]  E. Knudsen Instructed learning in the auditory localization pathway of the barn owl , 2002, Nature.

[111]  Steven Greenberg,et al.  Beyond the phoneme: a juncture-accent model of spoken language , 2002 .

[112]  T. Houtgast,et al.  The Modulation Transfer Function in Room Acoustics as a Predictor of Speech Intelligibility , 1973 .

[113]  Steven Greenberg,et al.  UNDERSTANDING SPEECH UNDERSTANDING: TOWARDS A UNIFIED THEORY OF SPEECH PERCEPTION , 1996 .

[114]  Keith R. Kluender,et al.  Effects of glide slope, noise intensity, and noise duration on the extrapolation of FM glides through noise , 1992, Perception & psychophysics.

[115]  Alessandra Angelucci,et al.  Induction of visual orientation modules in auditory cortex , 2000, Nature.

[117]  Jennifer S. Pardo,et al.  On the perceptual organization of speech. , 1994, Psychological review.

[118]  Ken W. Grant,et al.  Evaluating the articulation index for auditory–visual consonant recognition , 1993 .

[119]  Mark B. Gardner,et al.  The Dependence of Hearing Impairment on Sound Intensity , 1937 .

[120]  J. Flanagan A Difference Limen for Vowel Formant Frequency , 1955 .

[121]  S. S. Stevens,et al.  Critical Band Width in Loudness Summation , 1957 .

[122]  M. D. Wang,et al.  Consonant confusions in noise: a study of perceptual features. , 1973, The Journal of the Acoustical Society of America.

[123]  Hynek Hermansky,et al.  Should recognizers have ears? , 1998, Speech Commun..

[124]  I. Pollack Message Uncertainty and Message Reception , 1959 .

[125]  Philip Lieberman,et al.  The Biology and Evolution of Language , 1984 .

[126]  D J Van Tasell,et al.  Speech waveform envelope cues for consonant recognition. , 1987, The Journal of the Acoustical Society of America.

[127]  J. Licklider,et al.  A duplex theory of pitch perception , 1951, Experientia.

[128]  S. Blumstein,et al.  Invariant cues for place of articulation in stop consonants. , 1978, The Journal of the Acoustical Society of America.

[129]  B. Delgutte,et al.  Speech coding in the auditory nerve: IV. Sounds with consonant-like dynamic characteristics. , 1984, The Journal of the Acoustical Society of America.

[130]  M. Sachs,et al.  The representations of the steady-state vowel sound /e/ in the discharge patterns of cat anteroventral cochlear nucleus neurons. , 1990, Journal of neurophysiology.

[131]  Dwight L. Bolinger,et al.  Intonation and Its Uses: Melody in Grammar and Discourse , 1989 .

[132]  Vittorio Gallese,et al.  Mirror Neurons and the Evolution of Brain and Language , 2002 .

[133]  G. S. Ohm Ueber die Definition des Tones, nebst daran geknüpfter Theorie der Sirene und ähnlicher tonbildender Vorrichtungen , 1843 .

[134]  Steven Greenberg,et al.  LINGUISTIC DISSECTION OF SWITCHBOARD-CORPUS AUTOMATIC SPEECH RECOGNITION SYSTEMS , 2000 .

[135]  K. Kluender,et al.  Effects of first formant onset properties on voicing judgments result from processes not specific to humans. , 1991, The Journal of the Acoustical Society of America.

[136]  Louis C. W. Pols,et al.  Discrimination of short and rapid speechlike transitions , 1998 .

[137]  Steven Greenberg,et al.  Speaking in shorthand - A syllable-centric perspective for understanding pronunciation variation , 1999, Speech Commun..

[138]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[139]  P K Kuhl,et al.  Enhanced discriminability at the phonetic boundaries for the voicing feature in macaques , 1982, Perception & psychophysics.

[140]  Elizabeth Shriberg,et al.  Using prosodic and lexical information for speaker identification , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[141]  N. Viemeister Temporal modulation transfer functions based upon modulation thresholds. , 1979, The Journal of the Acoustical Society of America.

[142]  W. S. Rhode,et al.  Physiological study of neurons in the dorsal and posteroventral cochlear nucleus of the unanesthetized cat. , 1987, Journal of neurophysiology.

[143]  Kenneth N. Stevens,et al.  On the quantal nature of speech , 1972 .

[144]  A.R.D. Thornton,et al.  Foundations of Modern Auditory Theory , 1970 .

[145]  C. Fowler An event approach to the study of speech perception from a direct realist perspective , 1986 .

[146]  A. Liberman,et al.  Tempo of frequency change as a cue for distinguishing classes of speech sounds. , 1956, Journal of experimental psychology.

[147]  Norman J. Lass,et al.  Principles of Experimental Phonetics , 1996 .

[148]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[149]  J M Festen,et al.  Relations between auditory functions in normal hearing. , 1981, The Journal of the Acoustical Society of America.

[150]  D. D. Greenwood A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[151]  A. Liberman,et al.  The motor theory of speech perception revised , 1985, Cognition.

[152]  M. M. Gibson,et al.  Initial discharge latency and threshold considerations for some neurons in cochlear nuclear complex of the cat. , 1978, Journal of neurophysiology.

[153]  Steven Greenberg,et al.  A Composite Model of the Auditory Periphery for the Processing of Speech (Invited) , 1988 .

[154]  W. Strange,et al.  Effects of discrimination training on the perception of /r-l/ by Japanese adults learning English , 1984, Perception & psychophysics.

[155]  J. L. Mozziconacci PITCH VARIATIONS AND EMOTION IN SPEECH , 1995 .

[156]  P. D. Eimas,et al.  Selective adaptation of linguistic feature detectors , 1973 .

[157]  B. Lindblom,et al.  Numerical Simulation of Vowel Quality Systems: The Role of Perceptual Contrast , 1972 .

[158]  W. Huggins A Phase Principle for Complex‐Frequency Analysis and Its Implications in Auditory Theory , 1952 .

[159]  M. Sachs,et al.  Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[160]  R. Fay,et al.  The Mammalian auditory pathway : neurophysiology , 1992 .

[161]  R. Plomp,et al.  Effect of spectral envelope smearing on speech reception. II. , 1992, The Journal of the Acoustical Society of America.

[162]  C. Schreiner,et al.  Representation of amplitude modulation in the auditory cortex of the cat. I. The anterior auditory field (AAF) , 1986, Hearing Research.

[163]  N Suga,et al.  Experience-dependent plasticity in the auditory cortex and the inferior colliculus of bats: role of the corticofugal system. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[164]  W A Ainsworth,et al.  Perception of pitch movement on tonic syllables in British English. , 1986, The Journal of the Acoustical Society of America.

[165]  J. Ohala The Origin of Sound Patterns in Vocal Tract Constraints , 1983 .

[166]  D Kewley-Port,et al.  Time-varying features as correlates of place of articulation in stop consonants. , 1983, The Journal of the Acoustical Society of America.

[167]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[168]  Guy J. Brown,et al.  Computational auditory scene analysis , 1994, Comput. Speech Lang..

[169]  D. M. Green,et al.  Intensity discrimination as a function of frequency and sensation level. , 1977, The Journal of the Acoustical Society of America.

[170]  M. Breeuwer,et al.  Speechreading supplemented with frequency‐selective sound‐pressure information , 1984 .

[171]  Richard B Ivry,et al.  A neural instantiation of the motor theory of speech perception Comment from Richard B. Ivry and Timothy C. Justus to Nicolson et al. , 2001, Trends in Neurosciences.

[172]  D. Poeppel,et al.  Task-induced asymmetry of the auditory evoked M100 neuromagnetic field elicited by speech sounds. , 1996, Brain research. Cognitive brain research.

[173]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[174]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[175]  S. G. Nooteboom,et al.  Intonation and the perceptual separation of simultaneous voices , 1982 .

[176]  M. Merzenich,et al.  Plasticity in the frequency representation of primary auditory cortex following discrimination training in adult owl monkeys , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[177]  A M Liberman,et al.  Perception of the speech code. , 1967, Psychological review.

[178]  G. Clements Papers in Laboratory Phonology: The role of the sonority cycle in core syllabification , 1990 .

[179]  D. Bolinger Intonation and its parts : melody in spoken English , 1987 .

[180]  Christoph E. Schreiner,et al.  Representation of CV-sounds in cat primary auditory cortex: intensity dependence , 2003, Speech Commun..

[181]  R. Smith Short-term adaptation in single auditory nerve fibers: some poststimulatory effects. , 1977 .

[182]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[183]  D. Crystal,et al.  Intonation and Grammar in British English , 1967 .

[184]  Shinji Maeda,et al.  Multi-pulse LPC modeling of articulatory movements , 1998, Speech Commun..

[185]  Steven Greenberg,et al.  Speech intelligibility derived from asynchronous processing of auditory-visual information , 2001, AVSP.

[186]  Anthony W. Gummer,et al.  Mechanical Transduction in Outer Hair Cells , 2002, Audiology and Neurotology.

[187]  D. D. Greenwood Critical Bandwidth and the Frequency Coordinates of the Basilar Membrane , 1961 .

[188]  C. Richter Cochlear Implants: Fundamentals and Applications , 2004 .

[189]  Michael Studdert-Kennedy Mirror neurons, vocal imitation, and the evolution of particulate speech , 2002 .

[190]  N. Suga,et al.  Specificity of combination-sensitive neurons for processing of complex biosonar signals in auditory cortex of the mustached bat. , 1983, Journal of neurophysiology.

[191]  H Müsch,et al.  Using statistical decision theory to predict speech intelligibility. I. Model structure. , 2001, The Journal of the Acoustical Society of America.

[192]  D. Pisoni,et al.  Speech perception without traditional speech cues. , 1981, Science.

[193]  Steven Greenberg,et al.  Speech Intelligibility is Highly Tolerant of Cross-Channel Spectral Asynchrony , 1998 .

[194]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[195]  D. D. Greenwood,et al.  The intensitive DL of tones: Dependence of signal/masker ratio on tone level and on spectrum of added noise , 1993, Hearing Research.

[196]  B. Moore,et al.  Effects of spectral smearing on the intelligibility of sentences in noise , 1993 .

[197]  O. Fujimura,et al.  Sweep-tone measurements of vocal-tract characteristics. , 1971, The Journal of the Acoustical Society of America.

[198]  C A Fowler,et al.  Listeners do hear sounds, not tongues. , 1996, The Journal of the Acoustical Society of America.

[199]  Louis C. W. Pols,et al.  Acoustics and perception of dynamic vowel segments , 1993, Speech Commun..

[200]  Richard R. Fay,et al.  Integrative Functions in the Mammalian Auditory Pathway , 2002, Springer Handbook of Auditory Research.

[201]  C. Watson,et al.  Formant-frequency discrimination for isolated English vowels. , 1994, The Journal of the Acoustical Society of America.

[202]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[203]  S. Shamma Speech processing in the auditory system. II: Lateral inhibition and the central processing of speech evoked activity in the auditory nerve. , 1985, The Journal of the Acoustical Society of America.

[204]  Peter F. MacNeilage,et al.  The Production of Speech , 2011, Springer New York.

[205]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[206]  M. Sanders Handbook of Sensory Physiology , 1975 .

[207]  G F Smoorenburg,et al.  Speech reception in quiet and in noisy conditions by individuals with noise-induced hearing loss in relation to their tone audiogram. , 1989, The Journal of the Acoustical Society of America.

[208]  M. Sachs,et al.  Representation of stop consonants in the discharge patterns of auditory-nerve fibers. , 1983, The Journal of the Acoustical Society of America.

[209]  Q. Summerfield,et al.  Lipreading and audio-visual speech perception. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[210]  K. Stevens,et al.  Emotions and speech: some acoustical correlates. , 1972, The Journal of the Acoustical Society of America.

[211]  Jont B. Allen,et al.  How do humans process and recognize speech? , 1993, IEEE Trans. Speech Audio Process..

[212]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[213]  P. D. Eimas,et al.  Speech Perception in Infants , 1971, Science.

[214]  J. Ohala,et al.  Speech perception is hearing sounds, not tongues. , 1994, The Journal of the Acoustical Society of America.

[215]  H Müsch,et al.  Using statistical decision theory to predict speech intelligibility. II. Measurement and prediction of consonant-discrimination performance. , 2001, The Journal of the Acoustical Society of America.

[216]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[217]  Harvey b. Fletcher,et al.  Speech and hearing in communication , 1953 .

[218]  Pickett The Sounds of Speech Communication , 1980 .

[219]  P. Kuhl,et al.  Cross-language analysis of phonetic units in language addressed to infants. , 1997, Science.

[220]  E Villchur Multichannel compression processing for profound deafness. , 1987, Journal of rehabilitation research and development.

[221]  R. R. Riesz Differential Intensity Sensitivity of the Ear for Pure Tones , 1928 .

[222]  K. Grant,et al.  Auditory-visual speech recognition by hearing-impaired subjects: consonant recognition, sentence recognition, and auditory-visual integration. , 1998, The Journal of the Acoustical Society of America.

[223]  A M Liberman,et al.  A specialization for speech perception. , 1989, Science.

[224]  W M Rabinowitz,et al.  Relations among different measures of speech reception in subjects using a cochlear implant. , 1992, The Journal of the Acoustical Society of America.

[225]  Michael Studdert-Kennedy,et al.  Launching Language: The Gestural Origin of Discrete Infinity , 2002 .

[226]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[227]  Richard R. Fay,et al.  Clinical Aspects of Hearing , 1996, Springer Handbook of Auditory Research.

[228]  R. Plomp The Role of Modulation in Hearing , 1983 .

[229]  M. Liberman,et al.  Efferent-mediated protection from acoustic overexposure: relation to slow effects of olivocochlear stimulation. , 1995, Journal of neurophysiology.

[230]  D. M. Green,et al.  Frequency discrimination as a function of frequency and sensation level. , 1977, The Journal of the Acoustical Society of America.

[231]  P. Ladefoged,et al.  The sounds of the world's languages , 1996 .

[232]  K. Grant,et al.  Measures of auditory-visual integration in nonsense syllables and sentences. , 1998, The Journal of the Acoustical Society of America.

[233]  J. E. Rose,et al.  Phase-locked response to low-frequency tones in single auditory nerve fibers of the squirrel monkey. , 1967, Journal of neurophysiology.

[234]  M. Sachs,et al.  Effects of nonlinearities on speech encoding in the auditory nerve. , 1979, The Journal of the Acoustical Society of America.

[235]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[236]  D H Klatt,et al.  Review of text-to-speech conversion for English. , 1987, The Journal of the Acoustical Society of America.

[237]  P. Ladefoged Preliminaries to linguistic phonetics , 1967 .

[238]  Shihab A. Shamma,et al.  The acoustic features of speech sounds in a model of auditory processing: vowels and voiceless fricatives , 1988 .

[239]  W. S. Rhode,et al.  A composite model of the auditory periphery for the processing of speech based on the filter response functions of single auditory-nerve fibers. , 1991, The Journal of the Acoustical Society of America.

[240]  Philip Lieberman,et al.  Eve Spoke: Human Language and Human Evolution , 1998 .