Multimodal group action clustering in meetings
暂无分享,去创建一个
Samy Bengio | Daniel Gatica-Perez | Iain McCowan | Dong Zhang | Guillaume Lathoud | Samy Bengio | I. McCowan | D. Gática-Pérez | G. Lathoud | Dong Zhang | Guillaume Lathoud
[1] J. Markel,et al. The SIFT algorithm for fundamental frequency estimation , 1972 .
[2] A. Nakamura,et al. Nature (London , 1975 .
[3] Peter D. Bricker,et al. The role of audible and visible back-channel responses in interpersonal communication. , 1977 .
[4] Roger Bakeman,et al. Observing Interaction: An Introduction to Sequential Analysis , 1986 .
[5] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[6] Eric Fosler-Lussier,et al. Combining multiple estimators of speaking rate , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[7] Alex Pentland,et al. A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..
[8] Juergen Luettin,et al. Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..
[9] Hagen Soltau,et al. Advances in automatic meeting record creation and access , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[10] Michael S. Brandstein,et al. Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.
[11] M. Irani,et al. Event-Based Video Analysis, , 2001 .
[12] Andreas Stolcke,et al. The Meeting Project at ICSI , 2001, HLT.
[13] Alex Pentland,et al. Towards Measuring Human Interactions in Conversational Settings , 2001 .
[14] E.,et al. GROUPS : INTERACTION AND PERFORMANCE , 2001 .
[15] Ramakant Nevatia,et al. Multi-agent event recognition , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[16] Hervé Bourlard,et al. Unknown-multiple speaker clustering using HMM , 2002, INTERSPEECH.
[17] Stuart J. Russell,et al. Dynamic bayesian networks: representation, inference and learning , 2002 .
[18] Anoop Gupta,et al. Distributed meetings: a meeting capture and broadcasting system , 2002, MULTIMEDIA '02.
[19] Mari Ostendorf,et al. Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data , 2003, NAACL.
[20] Elizabeth Shriberg,et al. Relationship between dialogue acts and hot spots in meetings , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[21] Shih-Fu Chang,et al. Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).
[22] Jitendra Ajmera,et al. A robust speaker clustering algorithm , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[23] Jianbo Shi,et al. Detecting unusual activity in video , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..
[24] Steve Renals,et al. Dynamic Bayesian networks for meeting structuring , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[25] Eric Horvitz,et al. Layered representations for learning and inferring office activity from multiple sensory channels , 2004, Comput. Vis. Image Underst..
[26] Samy Bengio,et al. Modeling Individual and Group Actions in Meetings: A Two-Layer HMM Framework , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.
[27] Samy Bengio,et al. Automatic analysis of multimodal group actions in meetings , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.