Portable meeting recorder

The design and implementation of a portable meeting recorder is presented. Composed of an omni-directional video camera with four-channel audio capture, the system saves a view of all the activity in a meeting and the directions from which people spoke. Subsequent analysis computes metadata that includes video activity analysis of the compressed data stream and audio processing that helps locate events that occurred during the meeting. Automatic calculation of the room in which the meeting occurred allows for efficient navigation of a collection of recorded meetings. A user interface is populated from the metadata description to allow for simple browsing and location of significant events.

[1]  Don Kimber,et al.  FlyCam: practical panoramic video , 2000, ACM Multimedia.

[2]  Hagen Soltau,et al.  Advances in automatic meeting record creation and access , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Ming-Ting Sun,et al.  CHAPTER 9 – MPEG-1 and MPEG-2 Video Standards , 1999 .

[4]  Andreas Stolcke,et al.  Multispeaker speech activity detection for the ICSI meeting recorder , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[5]  YangMing-Hsuan,et al.  Detecting Faces in Images , 2002 .

[6]  Ahmed M. Elgammal,et al.  Face detection in complex environments from color images , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[7]  Don Kimber,et al.  FlyCam: practical panoramic video and automatic camera control , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[8]  Chitra Dorai,et al.  Perceived visual motion descriptors from MPEG-2 for content-based HDTV annotation and retrieval , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[9]  Anil K. Jain,et al.  Face Detection in Color Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Ramesh A. Gopinath,et al.  Improved speaker segmentation and segments clustering using the bayesian information criterion , 1999, EUROSPEECH.

[12]  Gopal Sarma Pingali,et al.  Multimedia retrieval through spatio-temporal activity maps , 2001, MULTIMEDIA '01.

[13]  B. S. Manjunath,et al.  A Motion Activity Descriptor and Its Extraction in Compressed Domain , 2001, IEEE Pacific Rim Conference on Multimedia.

[14]  John K. Tsotsos,et al.  Eyes 'n ears face detection , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[15]  B. S. Manjunath,et al.  Panoramic video capturing and compressed domain virtual camera control , 2001, MULTIMEDIA '01.

[16]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[17]  Andrew Merlino,et al.  Segmentation, Content Extraction and Visualization of Broadcast News Video using Multistream Analysis , 1997 .

[18]  Ajay Divakaran,et al.  Video browsing system based on compressed domain feature extraction , 2000, IEEE Trans. Consumer Electron..

[19]  Laura A. Dabbish,et al.  A multi-view intelligent editor for digital video libraries , 2001, JCDL '01.

[20]  Alexander H. Waibel,et al.  Multimodal people ID for a multimedia meeting browser , 1999, MULTIMEDIA '99.

[21]  Anoop Gupta,et al.  Viewing meeting captured by an omni-directional camera , 2001, CHI.

[22]  Faouzi Kossentini,et al.  Local motion descriptors , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[23]  Andreas Girgensohn,et al.  An intelligent media browser using automatic multimodal analysis , 1998, MULTIMEDIA '98.

[24]  Ralph Gross,et al.  Towards a multimodal meeting record , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[25]  Alexander H. Waibel,et al.  Face recognition in a meeting room , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[26]  Barry Arons,et al.  SpeechSkimmer: a system for interactively skimming recorded speech , 1997, TCHI.

[27]  Sue E. Johnson,et al.  Who spoke when? - automatic segmentation and clustering for determining speaker turns , 1999, EUROSPEECH.

[28]  Alexander G. Hauptmann,et al.  Text, Speech, and Vision for Video Segmentation: The InformediaTM Project , 1995 .

[29]  Berna Erol,et al.  Segmenting People in Meeting Videos Using Mixture Background and Object Models , 2002, IEEE Pacific Rim Conference on Multimedia.

[30]  Don Kimber,et al.  Acoustic Segmentation for Audio Browsers , 1997 .

[31]  Jerry D. Gibson,et al.  Handbook of Image and Video Processing , 2000 .