Towards Click-Based Models of Geographic Interests in Web Search

With the recent surge in the volume of search queries that explicitly or implicitly express users' geographical interests, to accurately infer users' locality preference becomes an increasingly important yet challenging issue. We study two click-based models of the distribution of such geographical interests by mining the user click stream data in the search engine logs, addressing three important issues in spatial Web search. First, search queries and documents can be classified by the models according to their spatial specificity. Second, the geographic center(s) of interests for queries and documents can be inferred. Finally, the model can be applied to generate relevance features for search ranking. We evaluated our proposals on a large dataset with about 10,000 unique queries sampled from the Yahoo! Search query logs, and about 450 million user clicks on 1.4 million unique Web pages over a six-months period. We report about 90% accuracy and about 3% false positive rate in identifying search queries with or without specific geographical interests, as well as statistically significant improvement in relevance ranking over a strong baseline.

[1]  Xing Xie,et al.  Detecting Geographical Serving Area of Web Resources , 2006, GIR.

[2]  Xing Xie,et al.  Web resource geographic location classification and detection , 2005, WWW '05.

[3]  Hyun Chul Lee,et al.  Geographically-Sensitive Link Analysis , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[4]  Ron Sivan,et al.  Web-a-where: geotagging web content , 2004, SIGIR '04.

[5]  Luis Gravano,et al.  Categorizing web queries according to geographical locality , 2003, CIKM '03.

[6]  Dmitry Medvedev,et al.  Environmental scenario search and visualization , 2007, GIS.

[7]  Mark Sanderson,et al.  Search words and geography , 2007, GIR '07.

[8]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[9]  Mário J. Silva,et al.  Query expansion through geographical feature types , 2007, GIR '07.

[10]  Mong-Li Lee,et al.  Discovering geographical-specific interests from web click data , 2008, LocWeb.

[11]  Jon M. Kleinberg,et al.  Spatial variation in search engine queries , 2008, WWW.

[12]  Kentaro Toyama,et al.  Robust location search from text queries , 2007, GIS.

[13]  Daqing He,et al.  Geographic Named Entity Disambiguation with Automatic Profile Generation , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[14]  Ying Li,et al.  Detecting dominant locations from search queries , 2005, SIGIR '05.

[15]  Wei Vivian Zhang,et al.  Geomodification in Query Rewriting , 2006, GIR.

[16]  Mor Naaman,et al.  Generating diverse and representative image search results for landmarks , 2008, WWW.

[17]  Mor Naaman,et al.  How flickr helps us make sense of the world: context and content in community-contributed media collections , 2007, ACM Multimedia.

[18]  Mor Naaman,et al.  World explorer: visualizing aggregate data from unstructured text in geo-referenced collections , 2007, JCDL '07.

[19]  Alia I. Abdelmoty,et al.  Ontology-Based Spatial Query Expansion in Information Retrieval , 2005, OTM Conferences.

[20]  M. Sanderson,et al.  Analyzing geographic queries , 2004 .

[21]  Claudia Bauzer Medeiros,et al.  Discovering geographic locations in web pages using urban addresses , 2007, GIR '07.