Web Search as a Linguistic Tool

Many people rely on web search engines to check the spelling or grammatical correctness of input phrases. For example, one might search [recurring or reoccurring] to decide between these similar words. While language-related queries are common, they have low click-through rates, lack a strong intent signal, and are generally challenging to study. Perhaps for these reasons, they have yet to be characterized in the literature. In this paper we report the results of two surveys that investigate how, when, and why people use web search to support low-level, language-related tasks. The first survey was distributed by email, and asked participants to reflect on a recent search task. The second survey was embedded directly in search result pages, and captured information about searchers' intents in-situ. Our analysis confirms that language-related search tasks are indeed common, accounting for at least 2.7% of all queries posed by our respondents. Survey responses also reveal: (1) the range of language-related tasks people perform with search, (2) the contexts in which these tasks arise, and (3), the reasons why people elect to use web search rather than relying on traditional proofing tools (e.g., spelling and grammar checkers).

[1]  References , 1971 .

[2]  James Gaston Bloodsworth Legibility of Print. , 1993 .

[3]  Daniel Fallman,et al.  The penguin: using the web as a database for descriptive and dynamic grammar and spell checking , 2002, CHI Extended Abstracts.

[4]  Kai A. Olsen,et al.  Spelling and grammar checking using the Web as a text repository , 2004, J. Assoc. Inf. Sci. Technol..

[5]  Eric Brill,et al.  Spelling Correction as an Iterative Process that Exploits the Collective Knowledge of Web Users , 2004, EMNLP.

[6]  Jonas Sjöbergh The Internet as a Normative Corpus : Grammar Checking with a Search Engine , 2006 .

[7]  Joaquim Moré,et al.  A grammar checker based on web searching , 2006 .

[8]  Yang Zhang,et al.  Exploring Distributional Similarity Based Models for Query Spelling Correction , 2006, ACL.

[9]  Dan Morris,et al.  Investigating the querying and browsing behavior of advanced search engine users , 2007, SIGIR.

[10]  Marc Sebban,et al.  Correct your text with Google , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[11]  Pascal Poupart,et al.  Is the sky pure today? AwkChecker: an assistive tool for detecting and correcting collocation errors , 2008, UIST '08.

[12]  Stan Szpakowicz,et al.  Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors , 2008, LREC.

[13]  Amanda Spink,et al.  Determining the informational, navigational, and transactional intent of Web queries , 2008, Inf. Process. Manag..

[14]  Jianfeng Gao,et al.  A Web-based English Proofing System for English as a Second Language Users , 2008, IJCNLP.

[15]  Jane Li,et al.  Good abandonment in mobile and PC internet search , 2009, SIGIR.

[16]  Ryen W. White,et al.  Characterizing the influence of domain expertise on web search behavior , 2009, WSDM '09.

[17]  Xu Sun,et al.  A Large Scale Ranker-Based System for Search Query Spelling Correction , 2010, COLING.

[18]  Michael Gamon,et al.  Search right and thou shalt find ... Using Web Queries for Learner Error Detection , 2010 .

[19]  Sofia Stamou,et al.  Interpreting User Inactivity on Search Results , 2010, ECIR.

[20]  Ryen W. White,et al.  Personalizing web search results by reading level , 2011, CIKM '11.

[21]  Lydia B. Chilton,et al.  Addressing people's information needs directly in a web search result page , 2011, WWW.

[22]  Ryen W. White,et al.  Leaving so soon?: understanding and predicting web search abandonment rationales , 2012, CIKM.

[23]  Meredith Ringel Morris,et al.  Enhancing Technical Q&A Forums with CiteHistory , 2013, ICWSM.

[24]  Adam Fourney,et al.  InterTwine: creating interapplication information scent to support coordinated use of software , 2014, UIST.

[25]  Ido Guy,et al.  Searching by Talking: Analysis of Voice Queries on Mobile Web Search , 2016, SIGIR.