DNN-Based Automatic Speech Recognition as a Model for Human Phoneme Perception
暂无分享,去创建一个
[1] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .
[2] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[3] Odette Scharenborg,et al. The interspeech 2008 consonant challenge , 2008, INTERSPEECH.
[4] Tim Jürgens,et al. Modelling the human-machine gap in speech reception: microscopic speech intelligibility prediction for normal-hearing subjects with an auditory model , 2007, INTERSPEECH.
[5] Richard M. Stern,et al. Hearing Is Believing: Biologically Inspired Methods for Robust Automatic Speech Recognition , 2012, IEEE Signal Processing Magazine.
[6] T. Brand,et al. Microscopic prediction of speech recognition for listeners with normal hearing in noise using an auditory model. , 2009, The Journal of the Acoustical Society of America.
[7] Marc René Schädler,et al. Comparing Different Flavors of Spectro-Temporal Features for ASR , 2011, INTERSPEECH.
[8] Jon Barker,et al. A framework for the evaluation of microscopic intelligibility models , 2015, INTERSPEECH.
[9] W. Dreschler,et al. ICRA noises: artificial noise signals with speech-like spectral and temporal properties for hearing instrument assessment. International Collegium for Rehabilitative Audiology. , 2001, Audiology : official organ of the International Society of Audiology.
[10] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[11] Bernd T. Meyer,et al. Spectro-temporal Gabor features for speaker recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Nelson Morgan,et al. Longer Features: They do a speech detector good , 2012, INTERSPEECH.
[13] Niko Moritz,et al. Should deep neural nets have ears? the role of auditory features in deep learning approaches , 2014, INTERSPEECH.
[14] Birger Kollmeier,et al. Learning from human errors: prediction of phoneme confusions based on modified ASR training , 2010, INTERSPEECH.
[15] Hynek Hermansky,et al. Should recognizers have ears? , 1998, Speech Commun..
[16] B. Kollmeier,et al. Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition. , 2012, The Journal of the Acoustical Society of America.