Speech fragment decoding techniques for simultaneous speaker identification and speech recognition
暂无分享,去创建一个
Ning Ma | Jon Barker | Martin Cooke | André Coy | M. Cooke | Ning Ma | J. Barker | André Coy
[1] R. M. Warren,et al. Spectral redundancy: Intelligibility of sentences heard through narrow spectral slits , 1995, Perception & psychophysics.
[2] Jon Barker,et al. The foreign language cocktail party problem: Energetic and informational masking effects in non-native speech perception. , 2008, The Journal of the Acoustical Society of America.
[3] Q. Summerfield,et al. Modeling the perception of concurrent vowels: vowels with different fundamental frequencies. , 1990, The Journal of the Acoustical Society of America.
[4] C. Darwin. AUDITORY GROUPING AND ATTENTION TO SPEECH , 2001 .
[5] Martin Cooke,et al. A glimpsing model of speech perception in noise. , 2006, The Journal of the Acoustical Society of America.
[6] Ning Ma,et al. Exploiting correlogram structure for robust speech recognition with multiple speech sources , 2007, Speech Commun..
[7] Jon Barker,et al. An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.
[8] Daniel P. W. Ellis,et al. Decoding speech in the presence of other sources , 2005, Speech Commun..
[9] Ning Ma,et al. Recent advances in speech fragment decoding techniques , 2006, INTERSPEECH.
[10] CookeMartin,et al. Robust automatic speech recognition with missing and unreliable acoustic data , 2001 .
[11] Jon Barker,et al. Soft decisions in missing data techniques for robust automatic speech recognition , 2000, INTERSPEECH.
[12] B. Shinn-Cunningham,et al. Note on informational masking (L) , 2003 .
[13] Tuomas Virtanen,et al. Speech recognition using factorial hidden Markov models for separation in the feature space , 2006, INTERSPEECH.
[14] M. Ericson,et al. Informational and energetic masking effects in the perception of multiple simultaneous talkers. , 2001, The Journal of the Acoustical Society of America.
[15] R Meddis,et al. Modeling the identification of concurrent vowels with different fundamental frequencies. , 1992, The Journal of the Acoustical Society of America.
[16] John R. Hershey,et al. Super-human multi-talker speech recognition: the IBM 2006 speech separation challenge system , 2006, INTERSPEECH.
[17] Roger K. Moore. Computer Speech and Language , 1986 .
[18] Jos B. T. M. Roerdink,et al. The Watershed Transform: Definitions, Algorithms and Parallelization Strategies , 2000, Fundam. Informaticae.
[19] B. Shinn-Cunningham,et al. Note on informational masking. , 2003, The Journal of the Acoustical Society of America.
[20] Alain de Cheveigné,et al. Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancell , 1993 .
[21] Jon Barker,et al. An automatic speech recognition system based on the scene analysis account of auditory perception , 2007, Speech Commun..
[22] Hermann Ney,et al. Data driven search organization for continuous speech recognition , 1992, IEEE Trans. Signal Process..
[23] John R. Hershey,et al. Monaural speech separation and recognition challenge , 2010, Comput. Speech Lang..
[24] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.