Indexing Audio Documents by using Latent Semantic Analysis and SOM

This paper describes an important application for state-of-art automatic speech recognition, natural language processing and information retrieval systems. Methods for enhancing the indexing of spoken documents by using latent semantic analysis and self-organizing maps are presented, motivated and tested. The idea is to extract extra information from the structure of the document collection and use it for more accurate indexing by generating new index terms and stochastic index weights. Indexing methods are evaluated for two broadcast news databases (one French and one English) using the average document perplexity defined in this paper and test queries analyzed by human experts

[1]  Jerome R. Bellegarda,et al.  A statistical language modeling approach integrating local and global constraints , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[2]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[3]  Timo Honkela,et al.  Newsgroup Exploration with WEBSOM Method and Browsing Interface , 1996 .

[4]  Michael W. Berry,et al.  Large-Scale Sparse Singular Value Computations , 1992 .

[5]  Karen Spärck Jones,et al.  The Cambridge University spoken document retrieval system , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[6]  Steve Renals,et al.  The THISL broadcast news retrieval system. , 1999 .

[7]  Mari Ostendorf,et al.  Analyzing and predicting language model improvements , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[8]  Gerard Salton,et al.  The SMART Retrieval System , 1971 .

[9]  Michael J. Witbrock,et al.  Improving the suitability of imperfect transcriptions for information retrieval from spoken documents , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[10]  James Allan,et al.  INQUERY Does Battle With TREC-6 , 1997, TREC.

[11]  Ellen M. Voorhees,et al.  Spoken Document Retrieval: 1998 Evaluation and Investigation of New Metrics , 1999 .

[12]  Johan M. Andersen Baseline System for Hybrid Speech Recognition on French (Experiments on BREF) , 1998 .

[13]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[14]  Timo Honkela,et al.  WEBSOM - Self-organizing maps of document collections , 1998, Neurocomputing.

[15]  M. Kurimo Using Self-organizing Maps and Learning Vector Quantization for Mix- Ture Density Hidden Markov Models Using Self-organizing Maps and Learning Vector Quanti- Zation for Mixture Density Hidden Markov Models. Acta Polytechnica , 1997 .

[16]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[17]  Steve Renals,et al.  Retrieval of broadcast news documents with the THISL system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[18]  Samuel Kaski,et al.  Dimensionality reduction by random mapping: fast similarity computation for clustering , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[19]  Stanley F. Chen,et al.  Evaluation Metrics For Language Models , 1998 .

[20]  Mikko Kurimo,et al.  LATENT SEMANTIC INDEXING BY , 1999 .

[21]  Victor Zue,et al.  Phonetic recognition for spoken document retrieval , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[22]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[23]  Mikko Kurimo,et al.  Latent Semantic Indexing by Self-Organizing Map , 1999 .