论文信息 - Environmental robustness in speech recognition using physiologically-motivated signal processing

Environmental robustness in speech recognition using physiologically-motivated signal processing

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13 Chapter

Yoshiaki Ohshima | Y. Ohshima

[1] Biing-Hwang Juang,et al. A family of distortion measures based upon projection operation for robust speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2] Oded Ghitza,et al. Speech analysis/Synthesis based on matching the synthesized and the original representations in the auditory nerve level , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Richard M. Stern,et al. Efficient Cepstral Normalization for Robust Speech Recognition , 1993, HLT.

[4] I. Whitfield. Discharge Patterns of Single Fibers in the Cat's Auditory Nerve , 1966 .

[5] J. Moorer,et al. The optimum comb method of pitch period analysis of continuous digitized speech , 1974 .

[6] Shihab A. Shamma,et al. The acoustic features of speech sounds in a model of auditory processing: vowels and voiceless fricatives , 1988 .

[7] Jeffrey N. Marcus,et al. Significance tests for comparing speech recognizer performance using small test sets , 1989, EUROSPEECH.

[8] Charles Robert Jankowski,et al. A comparison of auditory models for automatic speech recognition , 1992 .

[9] Richard F. Lyon. A computational model of binaural localization and separation , 1983, ICASSP.

[10] Roy D. Patterson,et al. A Multi-representation Model for Auditory Processing of Sounds , 1992 .

[11] S. Seneff. A joint synchrony/mean-rate model of auditory speech processing , 1990 .

[12] Victor Zue,et al. A comparative study of acoustic representations of speech for vowel classification using multi-layer perceptrons , 1990, ICSLP.

[13] Oded Ghitza. Robustness against noise: The role of timing-synchrony measurement , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14] Oded Ghitza,et al. Auditory nerve representation as a front-end for speech recognition in a noisy environment , 1986 .

[15] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[16] A. Oppenheim,et al. Computation of spectra with unequal resolution using the fast Fourier transform , 1971 .

[17] M. Hunt,et al. Speech recognition using an auditory model with pitch-synchronous analysis , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18] Raj Reddy,et al. Large-vocabulary speaker-independent continuous speech recognition: the sphinx system , 1988 .

[19] Oded Ghitza,et al. Temporal non-place information in the auditory-nerve firing patterns as a front-end for speech recognition in a noisy environment , 1988 .

[20] T.H. Crystal,et al. Linear prediction of speech , 1977, Proceedings of the IEEE.

[21] B. Mazor,et al. Telephone channel normalization for automatic speech recognition , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22] M. Sachs,et al. Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers. , 1979, The Journal of the Acoustical Society of America.

[23] Richard Scott Goldhor,et al. Representation of consonants in the peripheral auditory system : a modeling study of the correspondence between response properties and phonetic features , 1985 .

[24] M. Ross,et al. Average magnitude difference function pitch extractor , 1974 .

[25] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[26] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[27] Richard F. Lyon,et al. A perceptual pitch detector , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[28] E. Zwicker,et al. Subdivision of the audible frequency range into critical bands , 1961 .

[29] I. G. BONNER CLAPPISON. Editor , 1960, The Electric Power Engineering Handbook - Five Volume Set.

[30] Alejandro Acero,et al. Acoustical and environmental robustness in automatic speech recognition , 1991 .

[31] Richard M. Stern,et al. Multiple Approaches to Robust Speech Recognition , 1992, HLT.

[32] B. Widrow,et al. Adaptive noise cancelling: Principles and applications , 1975 .

[33] Jonathan G. Fiscus,et al. Tools for the analysis of benchmark speech recognition tests , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[34] F. Jelinek,et al. Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[35] H. Hermansky,et al. Optimization of perceptually-based ASR front-end (automatic speech recognition) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[36] Stephanie Seneff,et al. Pitch and spectral analysis of speech based on an auditory synchrony model , 1985 .

[37] Robert M. Gray,et al. An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[38] Benjamin Peter Milner,et al. Speech recognition in adverse environments , 1994 .

[39] Joseph Picone,et al. Phone-mediated word alignment for speech recognition evaluation , 1990, IEEE Trans. Acoust. Speech Signal Process..

[40] Richard M. Stern,et al. Multi-microphone correlation-based processing for robust speech recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[41] M. Hunt,et al. Speaker dependent and independent speech recognition experiments with an auditory model , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[42] Richard F. Lyon,et al. A computational model of filtering, detection, and compression in the cochlea , 1982, ICASSP.

[43] Stephen Cox,et al. Some statistical issues in the comparison of speech recognition algorithms , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[44] B. Atal. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.

[45] Yariv Ephraim,et al. A linear predictive front-end processor for speech recognition in noisy environments , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.