Multimodal Fusion and Learning with Uncertain Features Applied to Audiovisual Speech Recognition
暂无分享,去创建一个
Petros Maragos | Vassilis Pitsikalis | Athanasios Katsamanis | George Papandreou | G. Papandreou | P. Maragos | Athanasios Katsamanis | Vassilis Pitsikalis
[1] Jocelyn Sietsma,et al. Creating artificial neural networks that generalize , 1991, Neural Networks.
[2] J.N. Gowdy,et al. CUAVE: A new audio-visual database for multimodal human-computer interface research , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] Petros Maragos,et al. Multimodal fusion by adaptive compensation for feature uncertainty with application to audiovisual speech recognition , 2006, 2006 14th European Signal Processing Conference.
[4] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[5] Douglas A. Reynolds,et al. Integrated models of signal and background with application to speaker identification in noise , 1994, IEEE Trans. Speech Audio Process..
[6] Timothy F. Cootes,et al. Active Appearance Models , 1998, ECCV.
[7] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[8] Hervé Glotin,et al. Multi-stream adaptive evidence combination for noise robust ASR , 2001, Speech Commun..
[9] Kevin P. Murphy,et al. Dynamic Bayesian Networks for Audio-Visual Speech Recognition , 2002, EURASIP J. Adv. Signal Process..
[10] Chalapathy Neti,et al. Recent advances in the automatic recognition of audiovisual speech , 2003, Proc. IEEE.
[11] William H. Press,et al. Numerical recipes , 1990 .
[12] Li Deng,et al. Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion , 2005, IEEE Transactions on Speech and Audio Processing.
[13] Petros Maragos,et al. Adaptive multimodal fusion by uncertainty compensation , 2006, INTERSPEECH.
[14] Mari Ostendorf,et al. ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition , 1993, IEEE Trans. Speech Audio Process..
[15] Timothy F. Cootes,et al. Extraction of Visual Features for Lipreading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..
[16] Robert D. Nowak,et al. Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..
[17] Brendan J. Frey,et al. Learning Dynamic Noise Models from Noisy Speech for Robust Speech Recognition , 2001 .
[18] Hervé Glotin,et al. Weighting schemes for audio-visual fusion in speech recognition , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[19] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.
[20] Néstor Becerra Yoma,et al. Speaker verification in noise using a stochastic version of the weighted Viterbi algorithm , 2002, IEEE Trans. Speech Audio Process..
[21] Juergen Luettin,et al. Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..
[22] Alexandros Potamianos,et al. Stream Weight Computation for Multi-Stream Classifiers , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.