Query by babbling: a research agenda

The Spoken Web, an interconnected collection of spoken content accessed through audio-only mobile phones, holds the promise of transforming information access for users in developing regions. The scale of the Spoken Web is, however, limited because current speech retrieval technology is only affordably deployable for a handful of languages. This paper proposes rethinking the conventional keyword query paradigm to instead develop systems that support a longer, richer, and more fluid interaction style that is better suited to both the affordances of spoken interaction and to the limitations of current speech technology.

[1]  Douglas W. Oard Unlocking the Potential of the Spoken Word , 2008, Science.

[2]  Gareth J. F. Jones,et al.  Overview of the CLEF-2005 Cross-Language Speech Retrieval Track , 2005, CLEF.

[3]  Nicholas J. Belkin,et al.  Query length in interactive information retrieval , 2003, SIGIR.

[4]  Florian Metze,et al.  Spoken Web Search , 2011, MediaEval.

[5]  Ronald Rosenfeld,et al.  Speech vs. touch-tone: Telephony interfaces for information access by low literate users , 2009, 2009 International Conference on Information and Communication Technologies and Development (ICTD).

[6]  Peter L. Bartlett,et al.  Learning with Missing Features , 2011, UAI.

[7]  Niranjan Balasubramanian,et al.  Exploring reductions for long web queries , 2010, SIGIR.

[8]  George Basalla,et al.  The Evolution of Technology: Selection (2): Social and Cultural Factors , 1989 .

[9]  Kenney Ng,et al.  Subword-based approaches for spoken document retrieval , 2000, Speech Commun..

[10]  Sougata Mukherjea,et al.  Two-stream indexing for spoken web search , 2011, WWW.

[11]  Sougata Mukherjea,et al.  Faceted search and browsing of audio content on spoken web , 2010, CIKM.

[12]  Charles L. Wayne Multilingual Topic Detection and Tracking: Successful Research Enabled by Corpora and Evaluation , 2000, LREC.

[13]  Matthew Lease,et al.  Beyond keywords: finding information more accurately and easily using natural language , 2010 .

[14]  Frank Seide,et al.  Unsupervised speaker adaptation for telephone call transcription , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Kentaro Toyama,et al.  Designing mobile interfaces for novice and low-literacy users , 2011, TCHI.

[16]  Paul Green,et al.  Safety and Usability of Speech Interfaces for In-Vehicle Tasks while Driving: A Brief Literature Review , 2006 .

[17]  Jaime Teevan The re:search engine: simultaneous support for finding and re-finding , 2007, UIST '07.

[18]  Aren Jansen,et al.  NLP on Spoken Documents Without ASR , 2010, EMNLP.

[19]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[20]  C. Buckley,et al.  Reliable Information Access Final Workshop Report , 2004 .

[21]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[22]  Douglas W. Oard,et al.  Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search , 2009, SIGIR.

[23]  Nitendra Rajput,et al.  Social ranking for spoken web search , 2011, CIKM '11.

[24]  Ketki A. Dhanesha,et al.  User driven audio content navigation for spoken web , 2010, ACM Multimedia.

[25]  David D. Lewis,et al.  Representation and Learning in Information Retrieval , 1991 .

[26]  Arun Kumar,et al.  The spoken web: a web for the underprivileged , 2010, LINK.

[27]  Vitor R. Carvalho,et al.  Reducing long queries using query quality predictors , 2009, SIGIR.

[28]  William Thies,et al.  IVR Junction: Building Scalable and Distributed Voice Forums in the Developing World , 2012, NSDR.

[29]  David G. Novick,et al.  What is Mixed-Initiative Interaction? , 1997 .

[30]  Bhuvana Ramabhadran,et al.  Vocabulary independent spoken term detection , 2007, SIGIR.

[31]  Maarten van Someren,et al.  The Think Aloud Method: A Practical Guide to Modelling Cognitive Processes , 1994 .