论文信息 - Castsearch - Context Based Spoken Document Retrieval

Castsearch - Context Based Spoken Document Retrieval

The paper describes our work on the development of a system for retrieval of relevant stories from broadcast news. The system utilizes a combination of audio processing and text mining. The audio processing consists of a segmentation step that partitions the audio into speech and music. The speech is further segmented into speaker segments and then transcribed using an automatic speech recognition system, to yield text input for clustering using non-negative matrix factorization (NMF). We find semantic topics that are used to evaluate the performance for topic detection. Based on these topics we show that a novel query expansion can be performed to return more intelligent search results. We also show that the query expansion helps overcome errors of the automatic transcription.

Lars Kai Hansen | Lasse Lohilahti Mølgaard | Kasper Winther Jørgensen

[1] Lars Kai Hansen,et al. Unsupervised speaker change detection for broadcast news segmentation , 2006, 2006 14th European Signal Processing Conference.

[2] Lasse Lohilahti Mølgaard,et al. Tools for Automatic Audio Indexing , 2006 .

[3] James Allan,et al. Topic detection and tracking: event-based information organization , 2002 .

[4] Paul Lamere,et al. Sphinx-4: a flexible open source framework for speech recognition , 2004 .

[5] Chih-Jen Lin,et al. Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[6] Michael W. Berry,et al. Document clustering using nonnegative matrix factorization , 2006, Inf. Process. Manag..

[7] Xin Liu,et al. Document clustering based on non-negative matrix factorization , 2003, SIGIR.