Radio and television information filtering through speech recognition

The problem of information overload can be solved by the application of information filtering to the huge amount of data. Information on radio and television can be filtered using speech recognition of the audio track. A prototype system using closed captions has been developed on top of the INQUERY information access system. The challange of integrating speech recognition and information retrieval into a working system is a big one. The open problems are the selection of a document representation model, the recognition and selection of indexing features for speech retrieval and dealing with the erroneous output of recognition processes.

[1]  Hector Garcia-Molina,et al.  SIFT - a Tool for Wide-Area Information Dissemination , 1995, USENIX.

[2]  Alexander I. Rudnicky,et al.  Survey of current speech technology , 1994, CACM.

[3]  James Gettys,et al.  AudioFile: A Network-Transparent System for Distributed Audio Applications , 1993, USENIX Summer.

[4]  Francine R. Chen,et al.  The use of emphasis to automatically summarize a spoken discourse , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  W. Bruce Croft,et al.  Fast Incremental Indexing for Full-Text Information Retrieval , 1994, VLDB.

[6]  W. Bruce Croft,et al.  The INQUERY Retrieval System , 1992, DEXA.

[7]  Peter Schäuble,et al.  A system for retrieving speech documents , 1992, SIGIR '92.

[8]  John K. Ousterhout,et al.  Tcl and the Tk Toolkit , 1994 .

[9]  Thomas D. C. Little,et al.  A digital on-demand video service supporting content-based queries , 1993, MULTIMEDIA '93.

[10]  W. Bruce Croft,et al.  An evaluation of query processing strategies using the TIPSTER collection , 1993, SIGIR.

[11]  Kazem Taghva,et al.  The Effects of Noisy Data on Text Retrieval , 1994, J. Am. Soc. Inf. Sci..

[12]  Thomas D. C. Little,et al.  Video scene decomposition with the motion picture parser , 1994, Electronic Imaging.

[13]  S. M. Hardingy,et al.  An Evaluation of Information Retrieval Accuracy with Simulated Ocr Output , 1992 .

[14]  Marti A. Hearst Multi-Paragraph Segmentation Expository Text , 1994, ACL.

[15]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[16]  W. Bruce Croft,et al.  Supporting Full-Text Information Retrieval with a Persistent Object Store , 1994, EDBT.

[17]  A. Syrdal,et al.  Applied speech technology , 1995 .

[18]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[19]  Kazem Taghva,et al.  Results of applying probabilistic IR to OCR text , 1994, SIGIR '94.

[20]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[21]  Michael E. Lesk What to do when there's too much information , 1989, Hypertext.

[22]  Barry Arons,et al.  Interactively skimming recorded speech , 1994 .

[23]  Peter Willett Document Retrieval Experiments using Indexing Vocabularies of varying Size. Ii. Hashing, truncation, digram and Trigram Encoding of Index Terms , 1979, J. Documentation.

[24]  W. Bruce Croft,et al.  Evaluation of an inference network-based retrieval model , 1991, TOIS.

[25]  Lynn Wilcox,et al.  HMM-based wordspotting for voice editing and indexing , 1991, EUROSPEECH.

[26]  Robert Erfle Specification of Temporal Constraints in Multimedia Documents using HyTime , 1993, Electron. Publ..