Unsupervised Speech/Non-Speech Detection for Automatic Speech Recognition in Meeting Rooms
暂无分享,去创建一个
Petr Motlícek | Daniel Gatica-Perez | Hari Krishna Maganti | P. Motlícek | D. Gática-Pérez | H. Maganti
[1] Daniel Gatica-Perez,et al. Speech Acquisition in Meetings with an Audio-Visual Sensor Array , 2005, 2005 IEEE International Conference on Multimedia and Expo.
[2] I. McCowan,et al. The multi-channel Wall Street Journal audio visual corpus (MC-WSJ-AV): specification and initial experiments , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..
[3] Hermann Ney,et al. Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach , 2002, Speech Commun..
[4] Mark J. F. Gales,et al. The Cambridge University March 2005 speaker diarisation system , 2005, INTERSPEECH.
[5] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..
[6] Lukás Burget,et al. The AMI System for the Transcription of Speech in Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[7] Jean-Claude Junqua,et al. 16. Robustness and cooperative multimodal humanmachine communication applications , 2000 .
[8] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..
[9] Misha Pavel,et al. On the relative importance of various components of the modulation spectrum for automatic speech recognition , 1999, Speech Commun..
[10] Steven Greenberg,et al. ON THE ORIGINS OF SPEECH INTELLIGIBILITY IN THE REAL WORLD , 1997 .
[11] Jithendra Vepa,et al. The segmentation of multi-channel meeting recordings for automatic speech recognition , 2006, INTERSPEECH.
[12] R. Plomp,et al. Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.
[13] Lawrence R. Rabiner,et al. An algorithm for determining the endpoints of isolated utterances , 1975, Bell Syst. Tech. J..
[14] Malcolm Slaney,et al. Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[15] L. Rabiner,et al. An algorithm for determining the endpoints of isolated utterances , 1974, The Bell System Technical Journal.
[16] Gerasimos Potamianos,et al. Automatic Speech Recognition and Speech Activity Detection in the CHIL Smart Room , 2005, MLMI.
[17] Steve Young,et al. The HTK book version 3.4 , 2006 .
[18] Hervé Bourlard,et al. Speech/music segmentation using entropy and dynamism features in a HMM classification framework , 2003, Speech Commun..
[19] Hynek Hermansky,et al. Auditory Modeling in Automatic Recognition of Speech , 1996 .
[20] Daben Liu,et al. Speech and language technologies for audio indexing and retrieval , 2000, Proceedings of the IEEE.