Environmental robustness in speech recognition using physiologically-motivated signal processing

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13 Chapter

[1]  Biing-Hwang Juang,et al.  A family of distortion measures based upon projection operation for robust speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2]  Oded Ghitza,et al.  Speech analysis/Synthesis based on matching the synthesized and the original representations in the auditory nerve level , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Richard M. Stern,et al.  Efficient Cepstral Normalization for Robust Speech Recognition , 1993, HLT.

[4]  I. Whitfield Discharge Patterns of Single Fibers in the Cat's Auditory Nerve , 1966 .

[5]  J. Moorer,et al.  The optimum comb method of pitch period analysis of continuous digitized speech , 1974 .

[6]  Shihab A. Shamma,et al.  The acoustic features of speech sounds in a model of auditory processing: vowels and voiceless fricatives , 1988 .

[7]  Jeffrey N. Marcus,et al.  Significance tests for comparing speech recognizer performance using small test sets , 1989, EUROSPEECH.

[8]  Charles Robert Jankowski,et al.  A comparison of auditory models for automatic speech recognition , 1992 .

[9]  Richard F. Lyon A computational model of binaural localization and separation , 1983, ICASSP.

[10]  Roy D. Patterson,et al.  A Multi-representation Model for Auditory Processing of Sounds , 1992 .

[11]  S. Seneff A joint synchrony/mean-rate model of auditory speech processing , 1990 .

[12]  Victor Zue,et al.  A comparative study of acoustic representations of speech for vowel classification using multi-layer perceptrons , 1990, ICSLP.

[13]  Oded Ghitza Robustness against noise: The role of timing-synchrony measurement , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Oded Ghitza,et al.  Auditory nerve representation as a front-end for speech recognition in a noisy environment , 1986 .

[15]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[16]  A. Oppenheim,et al.  Computation of spectra with unequal resolution using the fast Fourier transform , 1971 .

[17]  M. Hunt,et al.  Speech recognition using an auditory model with pitch-synchronous analysis , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Raj Reddy,et al.  Large-vocabulary speaker-independent continuous speech recognition: the sphinx system , 1988 .

[19]  Oded Ghitza,et al.  Temporal non-place information in the auditory-nerve firing patterns as a front-end for speech recognition in a noisy environment , 1988 .

[20]  T.H. Crystal,et al.  Linear prediction of speech , 1977, Proceedings of the IEEE.

[21]  B. Mazor,et al.  Telephone channel normalization for automatic speech recognition , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  M. Sachs,et al.  Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[23]  Richard Scott Goldhor,et al.  Representation of consonants in the peripheral auditory system : a modeling study of the correspondence between response properties and phonetic features , 1985 .

[24]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[25]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[26]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[27]  Richard F. Lyon,et al.  A perceptual pitch detector , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[28]  E. Zwicker,et al.  Subdivision of the audible frequency range into critical bands , 1961 .

[29]  I. G. BONNER CLAPPISON Editor , 1960, The Electric Power Engineering Handbook - Five Volume Set.

[30]  Alejandro Acero,et al.  Acoustical and environmental robustness in automatic speech recognition , 1991 .

[31]  Richard M. Stern,et al.  Multiple Approaches to Robust Speech Recognition , 1992, HLT.

[32]  B. Widrow,et al.  Adaptive noise cancelling: Principles and applications , 1975 .

[33]  Jonathan G. Fiscus,et al.  Tools for the analysis of benchmark speech recognition tests , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[34]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[35]  H. Hermansky,et al.  Optimization of perceptually-based ASR front-end (automatic speech recognition) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[36]  Stephanie Seneff,et al.  Pitch and spectral analysis of speech based on an auditory synchrony model , 1985 .

[37]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[38]  Benjamin Peter Milner,et al.  Speech recognition in adverse environments , 1994 .

[39]  Joseph Picone,et al.  Phone-mediated word alignment for speech recognition evaluation , 1990, IEEE Trans. Acoust. Speech Signal Process..

[40]  Richard M. Stern,et al.  Multi-microphone correlation-based processing for robust speech recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[41]  M. Hunt,et al.  Speaker dependent and independent speech recognition experiments with an auditory model , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[42]  Richard F. Lyon,et al.  A computational model of filtering, detection, and compression in the cochlea , 1982, ICASSP.

[43]  Stephen Cox,et al.  Some statistical issues in the comparison of speech recognition algorithms , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[44]  B. Atal Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.

[45]  Yariv Ephraim,et al.  A linear predictive front-end processor for speech recognition in noisy environments , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.