Experiments in Spoken Document Retrieval

This paper describes experiments in the retrieval of spoken documents in multimedia systems. Speech documents pose a particular problem for retrieval since their words as well as contents are unknown. The work reported addresses this problem, for a video mail application, by combining state of the art speech recognition with established document retrieval technologies so as to provide an effective and efficient retrieval tool. Tests with a small spoken message collection show that retrieval precision for the spoken file can reach 90% of that obtained when the same file is used, as a benchmark, in text transcription form.

[1]  Peter Schäuble,et al.  A system for retrieving speech documents , 1992, SIGIR '92.

[2]  Philip C. Woodland,et al.  Spontaneous speech recognition for the credit card corpus using the HTK toolkit , 1994, IEEE Trans. Speech Audio Process..

[3]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[4]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[5]  Richard Shillcock,et al.  Proceedings of EUROSPEECH-1991. , 1991 .

[6]  Karen Spärck Jones,et al.  Talker-independent keyword spotting for information retrieval , 1995, EUROSPEECH.

[7]  P.C. Woodland,et al.  The 1994 HTK large vocabulary speech recognition system , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Herbert Gish,et al.  Approaches to topic identification on the switchboard corpus , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Karen Spärck Jones,et al.  Video mail retrieval: the effect of word spotting accuracy on precision , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[10]  M. A. Bush,et al.  Training and search algorithms for an interactive wordspotting system , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Peter Schäuble,et al.  Metadata for integrating speech documents in a text retrieval system , 1994, SGMD.

[12]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[13]  R. C. Rose Techniques for information retrieval from speech messages , 1991 .

[14]  Vijay Balasubramanian,et al.  Speech-Based Retrieval Using Semantic Co-Occurrence Filtering , 1994, HLT.

[15]  K. Sparck Jones,et al.  Simple, proven approaches to text retrieval , 1994 .

[16]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[17]  K. Sparck Jones,et al.  Video mail retrieval using voice: report on collection of naturalistic requests and relevance assessments , 1996 .

[18]  Peter Schäuble,et al.  Assessing the Retrieval Effectiveness of a Speech Retrieval System by Simulating Recognition Errors , 1994, HLT.