An analytical evaluation of search by content and interaction patterns on multimodal meeting records

It has been suggested that combining content-based indexing with automatically generated temporal metadata might help improve search and browsing of recordings of computer-mediated collaborative activities such as on-line meetings, which are characterised by extensive multimodal communication. This paper presents an analytical evaluation of the effectiveness of these techniques as implemented through automatic speech recognition and temporal mapping. In particular, it assesses the extent to which this strategy can help uncover contextual relationships between audio and text segments in recorded remote meetings. Results show that even simple temporal mapping can effectively support retrieval of recorded audio segments, improve retrieval performance in situations where speech recognition alone would have exhibited prohibitively high word error rates, and provide a basic form of semantic adaptation.

[1]  Berna Erol,et al.  MinuteAid: multimedia note-taking in an intelligent meeting room , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[2]  Saturnino Luz,et al.  Meeting browsing , 2007, Multimedia Systems.

[3]  Saturnino Luz,et al.  Meeting browsing State-ofthe-art review , 2006 .

[4]  Saturnino Luz,et al.  Temporal Mining of Recorded Collaborative Production of Artefacts , 2006, Industrial Conference on Data Mining.

[5]  Hagen Soltau,et al.  Advances in automatic meeting record creation and access , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[6]  S. Renals,et al.  Content-based access to spoken audio , 2005, IEEE Signal Processing Magazine.

[7]  Masood Masoodian,et al.  Gathering a corpus of multimodal computer-mediated meetings , 2006, LREC.

[8]  Saturnino Luz,et al.  Meeting browser: a system for visualising and accessing audio in multicast meetings , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[9]  Samy Bengio,et al.  Automatic analysis of multimodal group actions in meetings , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Mike Flynn,et al.  Browsing Recorded Meetings with Ferret , 2004, MLMI.

[11]  Klaus Zechner,et al.  Automatic generation of concise summaries of spoken dialogues in unrestricted domains , 2001, SIGIR '01.

[12]  Steve Whittaker,et al.  Accessing Multimodal Meeting Data: Systems, Problems and Possibilities , 2004, MLMI.

[13]  Sadaoki Furu AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION TO INFORMATION EXTRACTION , 1999, ACL 1999.

[14]  D. Tannen Talking Voices: Repetition, Dialogue, and Imagery in Conversational Discourse , 1989 .

[15]  Steve Whittaker,et al.  A meeting browser evaluation test , 2005, CHI Extended Abstracts.

[16]  Alan F. Smeaton Indexing, Browsing, and Searching of Digital Video and Digital Audio Information , 2000, ESSIR.

[17]  Abigail Sellen,et al.  Speech patterns in video-mediated conversations , 1992, CHI.

[18]  Saturnino Luz,et al.  Navigating Multimodal Meeting Recordings with the Meeting Miner , 2006, FQAS.

[19]  Ying Li,et al.  An overview of technologies for e-meeting and e-lecture , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[20]  Ramesh C. Jain Are we doing multimedia? , 2003, IEEE MultiMedia.

[21]  Julia Hirschberg,et al.  Now you hear it, now you don't: empirical studies of audio browsing behavior behavior , 1998, ICSLP.

[22]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[23]  Masood Masoodian,et al.  RECOLED: A group-aware collaborative text editor for capturing document history , 2005 .

[24]  Marios C. Angelides,et al.  Enriching MPEG-7 User Models with Content Metadata , 2006, 2006 First International Workshop on Semantic Media Adaptation and Personalization (SMAP'06).

[25]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[26]  Gregory D. Abowd,et al.  Making multimedia meeting records more meaningful , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).