A multi-channel/multi-speaker interactive 3D audio-visual speech corpus in Mandarin
暂无分享,去创建一个
Jun Yu | Lan Wang | Rongfeng Su | Wenpeng Zhou | Lan Wang | Rongfeng Su | Jun Yu | Wenpeng Zhou
[1] Lan Wang,et al. Multi-level adaptive network for accented Mandarin speech recognition , 2014, 2014 4th IEEE International Conference on Information Science and Technology.
[2] John Mason,et al. Robust voice activity detection using cepstral features , 1993, Proceedings of TENCON '93. IEEE Region 10 International Conference on Computers, Communications and Automation.
[3] Timothy F. Cootes,et al. Extraction of Visual Features for Lipreading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..
[4] Mohammed Bennamoun,et al. A 3D Audio-visual Corpus for Speech Recognition , 2012 .
[5] Javier R. Movellan,et al. Visual Speech Recognition with Stochastic Networks , 1994, NIPS.
[6] Eric David Petajan,et al. Automatic Lipreading to Enhance Speech Recognition (Speech Reading) , 1984 .
[7] Conrad Sanderson,et al. Biometric Person Recognition: Face, Speech and Fusion , 2008 .
[8] J. Kalita,et al. Outlier Identification using Symmetric Neighborhoods , 2012 .
[9] Hui Chen,et al. A multi-channel/multi-speaker articulatory database in Mandarin for speech visualization , 2014, The 9th International Symposium on Chinese Spoken Language Processing.
[10] Jon Barker,et al. An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.
[11] Juergen Luettin,et al. A comparison of model and transform-based visual features for audio-visual LVCSR , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..
[12] James R. Glass,et al. A segment-based audio-visual speech recognizer: data collection, development, and initial experiments , 2004, ICMI '04.
[13] Naomi Harte,et al. TCD-TIMIT: An Audio-Visual Corpus of Continuous Speech , 2015, IEEE Transactions on Multimedia.
[14] Andrea F. Abate,et al. 2D and 3D face recognition: A survey , 2007, Pattern Recognit. Lett..
[15] Tieniu Tan,et al. Depth vs. intensity: which is more important for face recognition? , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..