Spoken query processing for interactive information retrieval

It has long been recognized that interactivity improves the effectiveness of Information Retrieval systems. Speech is the most natural and interactive medium of communication and recent progress in speech recognition is making it possible to build systems that interact with the user via speech. However, given the typical length of queries submitted to Information Retrieval systems, it is easy to imagine that the effects of word recognition errors in spoken queries must be severely destructive on the system's effectiveness. The experimental work reported in this paper shows that the use of classical Information Retrieval techniques for spoken query processing is robust to considerably high levels of word recognition errors, in particular for long queries. Moreover, in the case of short queries, both standard relevance feedback and pseudo relevance feedback can be effectively employed to improve the effectiveness of spoken query processing.

[1]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[2]  Ellen M. Voorhees,et al.  The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[3]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[4]  Eric Keller,et al.  Fundamentals of speech synthesis and speech recognition: basic concepts, state-of-the-art and future challenges , 1995 .

[5]  Donna K. Harman,et al.  Ranking Algorithms , 1992, Information Retrieval: Data Structures & Algorithms.

[6]  Fabio Crestani,et al.  User's perception of relevance of spoken documents , 2000 .

[7]  Fabio Crestani,et al.  Exploiting the Similarity of Non-Matching Terms at Retrieval Time , 2000, Information Retrieval.

[8]  R. Silipo,et al.  Prosodic stress and topic detection in spoken sentences , 2000, Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000.

[9]  Mark Sanderson,et al.  Advantages of query biased summaries in information retrieval , 1998, SIGIR '98.

[10]  Ellen M. Voorhees,et al.  Overview of the Seventh Text REtrieval Conference , 1998 .

[11]  Donna K. Harman,et al.  Relevance Feedback and Other Query Modification Techniques , 1992, Information retrieval (Boston).

[12]  Fabio Crestani,et al.  Vocal access to a newspaper archive: design issues and preliminary investigations , 1998, DL '99.

[13]  Fabio Crestani,et al.  Promoting Access to White Rose Research Papers Short Queries, Natural Language and Spoken Document Retrieval: Experiments at Glasgow University , 1997 .

[14]  Amit Singhal,et al.  Document expansion for speech retrieval , 1999, SIGIR '99.

[15]  K. Sparck Jones,et al.  Simple, proven approaches to text retrieval , 1994 .

[16]  Mark Sanderson,et al.  Word sense disambiguation and information retrieval , 1994, SIGIR '94.

[17]  Jinxi Xu,et al.  Solving the word mismatch problem through automatic text analysis , 1997 .

[18]  Ellen M. Voorhees,et al.  Overview of the seventh text retrieval conference (trec-7) [on-line] , 1999 .

[19]  A. Föhrenbach,et al.  SIMPLE++ , 2000, OR Spectr..

[20]  D. Talkin Fundamentals of Speech Synthesis and Speech Recognition , 1996 .

[21]  Fabio Crestani,et al.  Effects of word recognition errors in spoken query processing , 2000, Proceedings IEEE Advances in Digital Libraries 2000.

[22]  Mona Singh,et al.  Experiments in spoken queries for document retrieval , 1997, EUROSPEECH.

[23]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[24]  Cyril W. Cleverdon,et al.  Aslib Cranfield research project - Factors determining the performance of indexing systems; Volume 1, Design; Part 2, Appendices , 1966 .

[25]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[26]  Amit Singhal,et al.  AT&T at TREC-7 , 1998, TREC.