Reducing Misinformation in Query Autocompletions

Query autocompletions help users of search engines to speed up their searches by recommending completions of partially typed queries in a drop down box. These recommended query autocompletions are usually based on large logs of queries that were previously entered by the search engine's users. Therefore, misinformation entered -- either accidentally or purposely to manipulate the search engine -- might end up in the search engine's recommendations, potentially harming organizations, individuals, and groups of people. This paper proposes an alternative approach for generating query autocompletions by extracting anchor texts from a large web crawl, without the need to use query logs. Our evaluation shows that even though query log autocompletions perform better for shorter queries, anchor text autocompletions outperform query log autocompletions for queries of 2 words or more.

[1]  Philip N. Howard,et al.  Automation, Algorithms, and Politics| Automation, Big Data and Politics: A Research Review , 2016 .

[2]  M. Jakobsson,et al.  Autocompletion in full text transaction entry: a method for humanized input , 1986, CHI '86.

[3]  Paul Baker,et al.  ‘Why do white people have thin lips?’ Google and the perpetuation of stereotypes via auto-complete search forms , 2013 .

[4]  Milad Shokouhi,et al.  Time-sensitive query auto-completion , 2012, SIGIR '12.

[5]  Bhaskar Mitra,et al.  Query Auto-Completion for Rare Prefixes , 2015, CIKM.

[6]  Joemon M. Jose,et al.  Recent and robust query auto-completion , 2014, WWW.

[7]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[8]  W. Bruce Croft,et al.  Query reformulation using anchor text , 2010, WSDM '10.

[9]  Fei Cai,et al.  Prefix-Adaptive and Time-Sensitive Personalized Query Auto Completion , 2016, IEEE Transactions on Knowledge and Data Engineering.

[10]  Ingmar Weber,et al.  Type less, find more: fast autocompletion search with a succinct index , 2006, SIGIR.

[11]  Peng Wang,et al.  Game of Missuggestions: Semantic Analysis of Search-Autocomplete Manipulations , 2018, NDSS.

[12]  Ziv Bar-Yossef,et al.  Context-sensitive query auto-completion , 2011, WWW.

[13]  Reiner Kraft,et al.  Mining anchor text for query refinement , 2004, WWW '04.

[14]  Prasenjit Mitra,et al.  Query suggestions in the absence of query logs , 2011, SIGIR.

[15]  L. Ayalon,et al.  Age and Gender Stereotypes Reflected in Google's "Autocomplete" Function: The Portrayal and Possible Spread of Societal Stereotypes. , 2019, The Gerontologist.

[16]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[17]  Jiawei Han,et al.  adaQAC: Adaptive Query Auto-Completion via Implicit Negative Feedback , 2015, SIGIR.

[18]  Djoerd Hiemstra,et al.  MapReduce for Information Retrieval Evaluation: "Let's Quickly Test This on 12 TB of Data" , 2010, CLEF.

[19]  Tony Doyle,et al.  Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy , 2017, Inf. Soc..