Searching by Talking: Analysis of Voice Queries on Mobile Web Search

The growing popularity of mobile search and the advancement in voice recognition technologies have opened the door for web search users to speak their queries, rather than type them. While this kind of voice search is still in its infancy, it is gradually becoming more widespread. In this paper, we examine the logs of a commercial search engine's mobile interface, and compare the spoken queries to the typed-in queries. We place special emphasis on the semantic and syntactic characteristics of the two types of queries. %Our analysis suggests that voice queries focus more on audio-visual content and question answering, and less on social networking and adult domains. We also conduct an empirical evaluation showing that the language of voice queries is closer to natural language than typed queries. Our analysis reveals further differences between voice and text search, which have implications for the design of future voice-enabled search tools.

[1]  Fabio Crestani,et al.  Written versus spoken queries: A qualitative and quantitative comparative analysis , 2006, J. Assoc. Inf. Sci. Technol..

[2]  Chih-Hung Hsieh,et al.  Towards better measurement of attention and satisfaction in mobile search , 2014, SIGIR.

[3]  Meredith Ringel Morris,et al.  #TwitterSearch: a comparison of microblog search and web search , 2011, WSDM '11.

[4]  Imed Zitouni,et al.  Automatic Online Evaluation of Intelligent Assistants , 2015, WWW.

[5]  Milad Shokouhi,et al.  Mobile query reformulations , 2014, SIGIR.

[6]  Jiulong Shan,et al.  Search by voice in Mandarin Chinese , 2010, INTERSPEECH.

[7]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[8]  Suzan Verberne Paragraph retrieval for why-question answering , 2007, SIGIR.

[9]  Umut Ozertem,et al.  Characterizing and Predicting Voice Query Reformulation , 2015, CIKM.

[10]  Ya Xu,et al.  Computers and iphones and mobile phones, oh my!: a logs-based comparison of search users on different devices , 2009, WWW '09.

[11]  Farzin Maghoul,et al.  Mobile search pattern evolution: the trend and the impact of voice queries , 2011, WWW.

[12]  Ryen W. White,et al.  Questions vs. Queries in Informational Search Tasks , 2015, WWW.

[13]  Ciprian Chelba,et al.  Empirical Exploration of Language Modeling for the google.com Query Stream as Applied to Mobile Voice Search , 2013 .

[14]  Robert E. Kraut,et al.  Expressive richness: a comparison of speech and text as media for revision , 1991, CHI.

[15]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[16]  Jane Li,et al.  Good abandonment in mobile and PC internet search , 2009, SIGIR.

[17]  Lada A. Adamic,et al.  Knowledge sharing and yahoo answers: everyone knows something , 2008, WWW.

[18]  Chin-Yew Lin,et al.  Automatic Question Generation from Queries , 2008 .

[19]  Geoffrey Zweig,et al.  Live search for mobile:Web services by voice on the cellphone , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  Daqing He,et al.  How do users respond to voice input errors?: lexical and phonetic query reformulation in voice search , 2013, SIGIR.

[21]  Farzin Maghoul,et al.  Deciphering mobile search patterns: a study of Yahoo! mobile search queries , 2008, WWW.

[22]  Milad Shokouhi,et al.  From Queries to Cards: Re-ranking Proactive Card Recommendations Based on Reactive Search History , 2015, SIGIR.

[23]  Kim-Phuong L. Vu,et al.  Privacy Concerns for Use of Voice Activated Personal Assistant in the Public Space , 2015, Int. J. Hum. Comput. Interact..

[24]  Lydia B. Chilton,et al.  Addressing people's information needs directly in a web search result page , 2011, WWW.

[25]  Yang Song,et al.  Exploring and exploiting user search behavior on mobile and tablet devices to improve search relevance , 2013, WWW '13.

[26]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[27]  Idan Szpektor,et al.  From query to question in one click: suggesting synthetic questions to searchers , 2013, WWW.

[28]  Idan Szpektor,et al.  Syntactic Parsing of Web Queries with Question Intent , 2016, HLT-NAACL.

[29]  R. Rosenfeld,et al.  Two decades of statistical language modeling: where do we go from here? , 2000, Proceedings of the IEEE.

[30]  Ricardo Baeza-Yates,et al.  A Study of Mobile Search Queries in Japan , 2007 .

[31]  Manish Gupta,et al.  Information Retrieval with Verbose Queries , 2015, Found. Trends Inf. Retr..

[32]  Shumeet Baluja,et al.  A large scale study of wireless search behavior: Google mobile search , 2006, CHI.

[33]  Francoise Beaufays,et al.  “Your Word is my Command”: Google Search by Voice: A Case Study , 2010 .

[34]  John D. Lafferty,et al.  Information retrieval as statistical translation , 1999, SIGIR '99.

[35]  Biing-Hwang Juang,et al.  Spoken Query Processing for Information Retrieval , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[36]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[37]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[38]  Geoffrey Zweig,et al.  Personalizing Model M for Voice-Search , 2011, INTERSPEECH.

[39]  Dong Yu,et al.  An introduction to voice search , 2008, IEEE Signal Processing Magazine.

[40]  Ido Guy,et al.  The Factoid Queries Collection , 2016, SIGIR.

[41]  Rosie Jones,et al.  The Linguistic Structure of English Web-Search Queries , 2008, EMNLP.

[42]  Tara N. Sainath,et al.  FUNDAMENTAL TECHNOLOGIES IN MODERN SPEECH RECOGNITION Digital Object Identifier 10.1109/MSP.2012.2205597 , 2012 .