Radio Oranje: Enhanced Access to a Historical Spoken Word Collection

Access to historical audio collections is typically very restricted: content is often only available on physical (analog) media and the metadata is usually limited to keywords, giving access at the level of relatively large fragments, e.g., an entire tape. Many spoken word heritage collections are now being digitized, which allows the introduction of more advanced search technology. This paper presents an approach that supports online access and search for recordings of historical speeches. A demonstrator has been built, based on the so-called Radio Oranje collection, which contains radio speeches by the Dutch Queen Wilhelmina that were broadcast during World War II. The audio has been aligned with its original 1940s manual transcriptions to create a time-stamped index that enables the speeches to be searched at the word level. Results are presented together with related photos from an external database.

[1]  Franciska de Jong,et al.  Infolink: Analysis of Dutch Broadcast News and Cross-Media Browsing , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[2]  Tobias Lauer,et al.  An elastic audio slider for interactive speech skimming , 2004, NordiCHI '04.

[3]  Julia Hirschberg,et al.  Play it again: a study of the factors underlying speech browsing behavior , 1998, CHI Conference Summary.

[4]  Richard Sproat,et al.  High-accuracy automatic segmentation , 1999, EUROSPEECH.

[5]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[6]  Yeon-Jun Kim,et al.  Automatic segmentation combining an HMM-based approach and spectral boundary correction , 2002, INTERSPEECH.

[7]  S. Rapp Automatic Phonemic Transcription and Linguistic Annotation from Known Text with Hidden Markov Models , 1995 .

[8]  Douglas W. Oard,et al.  A graphical interface for speech-based retrieval , 1998, DL '98.

[9]  Julia Hirschberg,et al.  SCAN: designing and evaluating user interfaces to support retrieval from speech archives , 1999, SIGIR '99.

[10]  C. Fellbaum An Electronic Lexical Database , 1998 .

[11]  Howard D. Wactlar,et al.  Facilitating access to large digital oral history archives through informedia technologies , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[12]  Bhuvana Ramabhadran,et al.  Supporting access to large digital oral history archives , 2002, JCDL '02.

[13]  Roeland Ordelman,et al.  Exploration of audiovisual heritage using audio indexing technology , 2006 .

[14]  Maurizio Omologo,et al.  Automatic segmentation and labeling of speech based on Hidden Markov Models , 1993, Speech Commun..

[15]  Franciska de Jong,et al.  A Spoken Document Retrieval Application in the Oral History Domain , 2005 .

[16]  Franciska de Jong,et al.  Automated Speech and Audio Analysis for Semantic Access to Multimedia , 2006, SAMT.

[17]  Helmer Strik,et al.  Modeling pronunciation variation for ASR: A survey of the literature , 1999, Speech Commun..

[18]  Marti A. Hearst TileBars: visualization of term distribution information in full text information access , 1995, CHI '95.

[19]  Subramanian Sridharan,et al.  Automatic Speech Segmentation with HMM , 2002 .

[20]  Bhuvana Ramabhadran,et al.  Automatic recognition of spontaneous speech for access to multilingual oral history archives , 2004, IEEE Transactions on Speech and Audio Processing.

[21]  Dragutin Petkovic,et al.  Spoken Document Retrieval , 2000 .