Access to recorded interviews: A research agenda

Recorded interviews form a rich basis for scholarly inquiry. Examples include oral histories, community memory projects, and interviews conducted for broadcast media. Emerging technologies offer the potential to radically transform the way in which recorded interviews are made accessible, but this vision will demand substantial investments from a broad range of research communities. This article reviews the present state of practice for making recorded interviews available and the state-of-the-art for key component technologies. A large number of important research issues are identified, and from that set of issues, a coherent research agenda is proposed.

[1]  Johan Oomen,et al.  First Analysis of Metadata in the Cultural Heritage Domain , 2006 .

[2]  K. Sparck Jones,et al.  General query expansion techniques for spoken document retrieval , 1999 .

[3]  Fridus Steijlen Memories of the east , 2002 .

[4]  Franciska de Jong,et al.  Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition , 2007, SAMT.

[5]  Susan T. Dumais,et al.  Milestones in Time: The Value of Landmarks in Retrieving Information from Personal Stores , 2003, INTERACT.

[6]  Scott R. Klemmer,et al.  Books with voices: paper transcripts as a physical interface to oral histories , 2003, CHI '03.

[7]  Helena Ahonen-Myka,et al.  Utilizing Temporal Information in Topic Detection and Tracking , 2003, ECDL.

[8]  Hermann Ney,et al.  Improved methods for vocal tract normalization , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[9]  Eric Horvitz,et al.  Learning Predictive Models of Memory Landmarks , 2004 .

[10]  Franciska de Jong,et al.  THE ROLE OF AUTOMATED SPEECH AND AUDIO ANALYSIS IN SEMANTIC MULTIMEDIA ANNOTATION , 2006 .

[11]  Douglas W. Oard,et al.  Searching spontaneous conversational speech , 2007, SIGF.

[12]  Franciska de Jong,et al.  Infolink: Analysis of Dutch Broadcast News and Cross-Media Browsing , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[13]  Richard Wright,et al.  Accessing the spoken word , 2005, International Journal on Digital Libraries.

[14]  Lukás Burget,et al.  Combination of speech features using smoothed heteroscedastic linear discriminant analysis , 2004, INTERSPEECH.

[15]  Franciska de Jong,et al.  Radio Oranje: searching the queen's speech(es) , 2007, SIGIR.

[16]  Jonathan G. Fiscus,et al.  Automatic Language Model Adaptation for Spoken Document Retrieval , 2000, RIAO.

[17]  Tobias Lauer,et al.  An elastic audio slider for interactive speech skimming , 2004, NordiCHI '04.

[18]  Douglas W. Oard,et al.  Searching large collections of recorded speech: A preliminary study , 2005, ASIST.

[19]  Jane Hunter The Application of an Event-Aware Metadata Model to an Online Oral History Project , 2006 .

[20]  Alan F. Smeaton,et al.  Taiscéalaí: Information Retrieval from an Archive of Spoken Radio News , 1998, ECDL.

[21]  Salim Roukos,et al.  Extracting Social Networks and Biographical Facts From Conversational Speech Transcripts , 2007, ACL.

[22]  Bhuvana Ramabhadran,et al.  Automatic recognition of spontaneous speech for access to multilingual oral history archives , 2004, IEEE Transactions on Speech and Audio Processing.

[23]  Sadaoki Furui,et al.  A new approach to automatic speech summarization , 2003, IEEE Trans. Multim..

[24]  Barbara Ulargiu Accessibility of oral history collections: an investigation into current practices and future developments. , 2000 .

[25]  Djoerd Hiemstra,et al.  The TIJAH XML information retrieval system , 2006, SIGIR '06.

[26]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[27]  Alexandre Allauzen,et al.  Open vocabulary ASR for audiovisual document indexation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[28]  Kenney Ng,et al.  Subword-based approaches for spoken document retrieval , 2000, Speech Commun..

[29]  Rong Yan,et al.  Merging storyboard strategies and automatic retrieval for improving interactive video search , 2007, CIVR '07.

[30]  Jane Hunter,et al.  The Application of an Event-Aware Metadata Model to an Online Oral History Archive , 2000, ECDL.

[31]  Pengyi Zhang,et al.  Knowledge-Based Approaches to the Segmentation of Oral History Interviews , 2006 .

[32]  Bhuvana Ramabhadran,et al.  Cross-Language Access to Recorded Speech in the MALACH Project , 2002, TSD.

[33]  Pedro J. Moreno,et al.  A recursive algorithm for the forced alignment of very long audio segments , 1998, ICSLP.

[34]  Franciska de Jong,et al.  Automated Speech and Audio Analysis for Semantic Access to Multimedia , 2006, SAMT.

[35]  Gareth J. F. Jones,et al.  Overview of the CLEF-2005 Cross-Language Speech Retrieval Track , 2005, CLEF.

[36]  Ben Shneiderman,et al.  Strategies for evaluating information visualization tools: multi-dimensional in-depth long-term case studies , 2006, BELIV '06.

[37]  Mary P. Harper,et al.  Reranking for Sentence Boundary Detection in Conversational Speech , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[38]  Scott R. Klemmer,et al.  Books with Voices: Paper Transcripts as a Tangible Interface to Oral Histories , 2002 .

[39]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[40]  Julia Hirschberg,et al.  From text to speech summarization , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[41]  Regina Barzilay,et al.  Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment , 2003, NAACL.

[42]  Ronald Rosenfeld,et al.  Optimizing lexical and N-gram coverage via judicious use of linguistic data , 1995, EUROSPEECH.

[43]  Douglas W. Oard,et al.  Improving text classification for oral history archives with temporal domain knowledge , 2007, SIGIR.

[44]  Julia Hirschberg,et al.  SCAN: designing and evaluating user interfaces to support retrieval from speech archives , 1999, SIGIR '99.

[45]  Bhuvana Ramabhadran,et al.  Supporting access to large digital oral history archives , 2002, JCDL '02.

[46]  Roeland Ordelman,et al.  Exploration of audiovisual heritage using audio indexing technology , 2006 .

[47]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[48]  Karen Spärck Jones,et al.  Effects of out of vocabulary words in spoken document retrieval (poster session) , 2000, SIGIR '00.

[49]  Karen Spärck Jones,et al.  Automatic content-based retrieval of broadcast news , 1995, MULTIMEDIA '95.

[50]  Ryen W. White,et al.  Overview of the CLEF-2006 Cross-Language Speech Retrieval Track , 2006, CLEF.

[51]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[52]  Douglas W. Oard,et al.  Task-based interaction with an integrated multilingual, multimedia information system: a formative evaluation , 2007, JCDL '07.