Dominant speaker detection based on voicing for adaptive audio-visual ASR robust to speech noise
暂无分享,去创建一个
[1] T. Baer,et al. Harmonics-to-noise ratio as an index of the degree of hoarseness. , 1982, The Journal of the Acoustical Society of America.
[2] Paul Duchnowski,et al. Adaptive bimodal sensor fusion for automatic speechreading , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[3] Alexandrina Rogozan,et al. Adaptive determination of audio and visual weights for automatic speech recognition , 1997, AVSP.
[4] Hervé Glotin,et al. A measure of speech and pitch reliability from voicing , 1999, IJCAI 1999.
[5] Hervé Glotin,et al. A new SNR-feature mapping for robust multistream speech recognition , 1999 .
[6] Hervé Glotin,et al. Test of several external posterior weighting functions for multiband full combination ASR , 2000, INTERSPEECH.
[7] Juergen Luettin,et al. Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..
[8] Hervé Glotin,et al. Weighting schemes for audio-visual fusion in speech recognition , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[9] Hervé Glotin. Elaboration et comparaison de systèmes adaptatifs multi-flux de reconnaissance robuste de la parole : incorporation des indices de voisement et de localisation , 2001 .
[10] Martin Heckmann,et al. Optimal weighting of posteriors for audio-visual speech recognition , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[11] Juergen Luettin,et al. Asynchronous stream modeling for large vocabulary audio-visual speech recognition , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).