Automatic analysis of multimodal group actions in meetings
暂无分享,去创建一个
Samy Bengio | Daniel Gatica-Perez | Iain McCowan | Dong Zhang | Guillaume Lathoud | Mark Barnard | Samy Bengio | I. McCowan | D. Gática-Pérez | G. Lathoud | M. Barnard | Dong Zhang | Guillaume Lathoud
[1] Samy Bengio,et al. On automatic annotation of meeting databases , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).
[2] Steve Renals,et al. Dynamic Bayesian networks for meeting structuring , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] J. Carletta,et al. A simulation of small group discussion , 2002 .
[4] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[5] Mari Ostendorf,et al. Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data , 2003, NAACL.
[6] Stefan Eickeler,et al. Content-based video indexing of TV broadcast news using hidden Markov models , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[7] Hervé Glotin,et al. Multi-stream adaptive evidence combination for noise robust ASR , 2001, Speech Commun..
[8] A. Nakamura,et al. Nature (London , 1975 .
[9] Ramakant Nevatia,et al. Multi-agent event recognition , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[10] Tanja Schultz,et al. SMaRT: the Smart Meeting Room Task at ISL , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[11] Rosalind W. Picard,et al. Automated Posture Analysis for Detecting Learner's Interest Level , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.
[12] David C. Hogg,et al. The acquisition and use of interaction behaviour models , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).
[13] Jean-Marc Odobez,et al. A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking , 2003, ICCV 2003.
[14] Daniel P. W. Ellis,et al. Audio information access from meeting rooms , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[15] J. Markel,et al. The SIFT algorithm for fundamental frequency estimation , 1972 .
[16] Samy Bengio,et al. Torch: a modular machine learning software library , 2002 .
[17] Oh-Wook Kwon,et al. EMOTION RECOGNITION BY SPEECH SIGNAL , 2003 .
[18] Hagen Soltau,et al. Advances in automatic meeting record creation and access , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[19] Eric Horvitz,et al. Layered representations for human activity recognition , 2002, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces.
[20] David G. Novick,et al. Applying task classification to natural meetings , 1995 .
[21] John Makhoul,et al. Rough'n'Ready: a meeting recorder and browser , 1999, CSUR.
[22] Jiucang Hao,et al. Emotion recognition by speech signals , 2003, INTERSPEECH.
[23] Daniel P. W. Ellis,et al. Pitch-based emphasis detection for characterization of meeting recordings , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[24] K. Parker,et al. Speaking turns in small group interaction: A context-sensitive event sequence model. , 1988 .
[25] Iain McCowan,et al. Segmenting multiple concurrent speakers using microphone arrays , 2003, INTERSPEECH.
[26] Alex Pentland,et al. A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..
[27] Iain McCowan,et al. Microphone array speech recognition: experiments on overlapping speech in meetings , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[28] R. Bales,et al. Symlog, A System for the Multiple Level Observation of Groups , 1979 .
[29] S. Garrod,et al. Group Discussion as Interactive Dialogue or as Serial Monologue: The Influence of Group Size , 2000, Psychological science.
[30] Elizabeth Shriberg,et al. Relationship between dialogue acts and hot spots in meetings , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[31] Hervé Bourlard,et al. New entropy based combination rules in HMM/ANN multi-stream ASR , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[32] Joseph H. DiBiase. A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays , 2000 .
[33] Shih-Fu Chang,et al. Structure analysis of soccer video with hidden Markov models , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[34] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[35] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.
[36] Peter D. Bricker,et al. The role of audible and visible back-channel responses in interpersonal communication. , 1977 .
[37] E.,et al. GROUPS : INTERACTION AND PERFORMANCE , 2001 .
[38] Alex Pentland,et al. Coupled hidden Markov models for complex action recognition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[39] D. E. Green. Group Research. , 1954, Science.
[40] B. Depaulo,et al. Decoding discrepant nonverbal cues. , 1978 .
[41] David G. Novick,et al. Coordinating turn-taking with gaze , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[42] Eric Fosler-Lussier,et al. Combining multiple estimators of speaking rate , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[43] W. Eric L. Grimson,et al. Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).
[44] Eric Horvitz,et al. Layered representations for learning and inferring office activity from multiple sensory channels , 2004, Comput. Vis. Image Underst..
[45] James W. Davis,et al. The KidsRoom: A Perceptually-Based Interactive and Immersive Story Environment , 1999, Presence.
[46] Anoop Gupta,et al. Distributed meetings: a meeting capture and broadcasting system , 2002, MULTIMEDIA '02.
[47] Zdravko Kacic,et al. Improved emotion recognition with large set of statistical features , 2003, INTERSPEECH.
[48] James M. Rehg,et al. Statistical Color Models with Application to Skin Detection , 2004, International Journal of Computer Vision.
[49] Michael S. Brandstein,et al. Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.
[50] Andreas Stolcke,et al. The Meeting Project at ICSI , 2001, HLT.
[51] Samy Bengio,et al. Modeling human interaction in meetings , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[52] Matthew Brand,et al. Coupled hidden Markov models for modeling interacting processes , 1997 .
[53] Elizabeth Shriberg,et al. Spotting "hot spots" in meetings: human judgments and prosodic cues , 2003, INTERSPEECH.
[54] Gerhard Rigoll,et al. Action Recognition in Meeting Scenarios using Global Motion Features , 2003 .
[55] Juergen Luettin,et al. Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..
[56] Juergen Luettin,et al. Audio-Visual Automatic Speech Recognition: An Overview , 2004 .
[57] Darren Moore,et al. The IDIAP Smart Meeting Room , 2002 .
[58] Alex Pentland,et al. Action Reaction Learning: Automatic Visual Analysis and Synthesis of Interactive Behaviour , 1999, ICVS.
[59] Samy Bengio,et al. Modeling Individual and Group Actions in Meetings: A Two-Layer HMM Framework , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.
[60] John S. Boreczky,et al. A hidden Markov model framework for video segmentation using audio and image features , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).