Robust speech recognition using the modulation spectrogram
暂无分享,去创建一个
[1] A. W. F. Huggins. Temporally segmented speech , 1975 .
[2] Steven Greenberg,et al. Incorporating information from syllable-length time scales into automatic speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[3] Q Summerfield,et al. Perceiving vowels from uniform spectra: Phonetic exploration of an auditory aftereffect , 1984, Perception & psychophysics.
[4] T. Houtgast,et al. The Modulation Transfer Function in Room Acoustics as a Predictor of Speech Intelligibility , 1973 .
[5] T. Houtgast,et al. Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics , 1980 .
[6] D. D. Greenwood. Critical Bandwidth and the Frequency Coordinates of the Basilar Membrane , 1961 .
[7] Q Summerfield,et al. Minimal spectral contrast of formant peaks for vowel recognition as a function of spectral slope , 1994, Perception & psychophysics.
[8] Li Deng,et al. Maximum-likelihood estimation for articulatory speech recognition using a stochastic target model , 1995, EUROSPEECH.
[9] S. Furui,et al. Speaker-independent isolated word recognition based on emphasized spectral dynamics , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[10] Steven Greenberg,et al. ON THE ORIGINS OF SPEECH INTELLIGIBILITY IN THE REAL WORLD , 1997 .
[11] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .
[12] T. Houtgast,et al. A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria , 1985 .
[13] Brian Kingsbury,et al. Recognizing reverberant speech with RASTA-PLP , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[14] T Houtgast,et al. A physical method for measuring speech-transmission quality. , 1980, The Journal of the Acoustical Society of America.
[15] R. Plomp,et al. Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.
[16] Steven Greenberg,et al. The modulation spectrogram: in pursuit of an invariant representation of speech , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[17] T. Langhans,et al. Speech enhancement by nonlinear multiband envelope filtering , 1982, ICASSP.
[18] Saeed Vaseghi,et al. An analysis of cepstral-time matrices for noise and channel robust speech recognition , 1995, EUROSPEECH.
[19] Nelson Morgan,et al. The Hybrid HMM/MLP Approach , 1994 .
[20] C. Schreiner,et al. Representation of amplitude modulation in the auditory cortex of the cat. I. The anterior auditory field (AAF) , 1986, Hearing Research.
[21] Steven Greenberg,et al. INSIGHTS INTO SPOKEN LANGUAGE GLEANED FROM PHONETIC TRANSCRIPTION OF THE SWITCHBOARD CORPUS , 1996 .
[22] M. Klitzner,et al. Hedonic integration: Test of a linear model , 1975 .
[23] R. G. Leonard,et al. A database for speaker-independent digit recognition , 1984, ICASSP.
[24] Steven Greenberg,et al. Speech Intelligibility is Highly Tolerant of Cross-Channel Spectral Asynchrony , 1998 .
[25] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[26] C. Schreiner,et al. Representation of amplitude modulation in the auditory cortex of the cat. II. Comparison between cortical fields , 1988, Hearing Research.
[27] Steven Greenberg,et al. Speech intelligibility in the presence of cross-channel spectral asynchrony , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[28] B. Kollmeier,et al. Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction. , 1994, The Journal of the Acoustical Society of America.
[29] Hynek Hermansky,et al. Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain , 1985, Speech Commun..
[30] Steven Greenberg,et al. Improving ASR Performance For Reverberant Speech , 1997 .
[31] Hynek Hermansky,et al. RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..