Analyzing URL queries

This study investigated a relatively unexamined query type, queries composed of URLs. The extent, variation, and user click-through behavior was examined to determine the intent behind URL queries. The study made use of a search log from which URL queries were identified and selected for both qualitative and quantitative analyses. It was found that URL queries accounted for ∼17% of the sample. There were statistically significant differences between URL queries and non-URL queries in the following attributes: mean query length; mean number of tokens per query; and mean number of clicks per query. Users issuing such queries clicked on fewer result list items higher up the ranking compared to nonURL queries. Classification indicated that nearly 86% of queries were navigational in intent with informational and transactional queries representing about 7% of URL queries each. This is in contrast to past research that suggested that URL queries were 100% navigational.The conclusions of this study are that URL queries are relatively common and that simply returning the page that matches a user’s URL is not an optimal strategy.

[1]  Doug Downey,et al.  Understanding the relationship between searchers' queries and information goals , 2008, CIKM '08.

[2]  Min-Yen Kan Web page classification without the web page , 2004, WWW Alt. '04.

[3]  Amanda Spink,et al.  Sexual and pornographic Web searching: Trends analysis , 2006, First Monday.

[4]  Deepak Khemani,et al.  Unsupervised Learning from URL Corpora , 2006, COMAD.

[5]  Carolyn Watters,et al.  A field study characterizing Web-based information-seeking tasks , 2007 .

[6]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[7]  Ricardo Baeza-Yates,et al.  Analysis of Web Search Engine Query Sessions , 2006 .

[8]  James P. Callan,et al.  Combining Structural Information and the Use of Priors in Mixed Named-Page and Homepage Finding , 2003, TREC.

[9]  Andrea H. Tapia,et al.  Searching for salvation: An analysis of US religious searching on the World Wide Web , 2010 .

[10]  Sriram Raghavan,et al.  Navigating the intranet with high precision , 2007, WWW '07.

[11]  Yiqun Liu,et al.  Automatic Query Type Identification Based on Click Through Information , 2006, AIRS.

[12]  Wessel Kraaij,et al.  Annotation of URLs: more than the sum of parts , 2009, SIGIR.

[13]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[14]  Wei-Ying Ma,et al.  Exploring URL Hit Priors for Web Search , 2006, ECIR.

[15]  In-Ho Kang,et al.  Query type classification for web document retrieval , 2003, SIGIR.

[16]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[17]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[18]  Amanda Spink,et al.  Investigating customer click through behaviour with integrated sponsored and nonsponsored results , 2009 .

[19]  Kuan-Yu He,et al.  Improving Identification of Latent User Goals through Search-Result Snippet Classification , 2007 .

[20]  Zhenyu Liu,et al.  Automatic identification of user goals in Web search , 2005, WWW '05.

[21]  Amanda Spink,et al.  Use of query reformulation and relevance feedback by Excite users , 2000, Internet Res..

[22]  Kevyn Collins-Thompson,et al.  Information Filtering, Novelty Detection, and Named-Page Finding , 2002, TREC.

[23]  Alistair Moffat,et al.  Some Observations on User Search Behaviour , 2006, Aust. J. Intell. Inf. Process. Syst..

[24]  In-Ho Kang,et al.  Transactional Query Identification in Web Search , 2005, AIRS.

[25]  Ryen W. White,et al.  WWW 2007 / Track: Browsers and User Interfaces Session: Personalization Investigating Behavioral Variability in Web Search , 2022 .

[26]  Min-Yen Kan,et al.  Fast webpage classification using URL features , 2005, CIKM '05.

[27]  Bernard J. Jansen,et al.  A review of web searching studies and a framework for future research , 2001 .

[28]  Yiqun Liu,et al.  Automatic search engine performance evaluation with click-through data analysis , 2007, WWW '07.

[29]  Amanda Spink,et al.  How are we searching the World Wide Web? A comparison of nine search engine transaction logs , 2006, Inf. Process. Manag..

[30]  Djoerd Hiemstra,et al.  Retrieving Web Pages Using Content, Links, URLs and Anchors , 2001, TREC.

[31]  Bernard J. Jansen,et al.  The Methodology of Search Log Analysis , 2009 .

[32]  Amanda Spink,et al.  Web searcher interaction with the Dogpile.com metasearch engine , 2007 .

[33]  Ophir Frieder,et al.  Hourly analysis of a very large topically categorized web query log , 2004, SIGIR '04.

[34]  Daniel M. Russell,et al.  Query logs alone are not enough , 2007 .

[35]  Tim Berners-Lee,et al.  Uniform Resource Locators (URL) , 1994, RFC.

[36]  Andrew Lim,et al.  Word segmentation and recognition for web document framework , 1999, CIKM '99.

[37]  Mark Sanderson,et al.  Analyzing URL queries , 2010 .

[38]  Wei-Ying Ma,et al.  Optimizing web search using web click-through data , 2004, CIKM '04.

[39]  Amanda Spink,et al.  Determining the informational, navigational, and transactional intent of Web queries , 2008, Inf. Process. Manag..