Access concentration detection in click logs to improve mobile Web-IR

Effective ranking algorithms for mobile Web searches are being actively pursued. Due to the peculiar and troublesome properties of mobile contents such as scant text, few outward links, and few input keywords, conventional Web search techniques using bag-of-words ranking functions or link-based algorithms are not good enough for mobile Web searches. Our solution is to use click logs to clarify access-concentrated search results for each query and to utilize the titles and snippets to expand the queries. Many previous works regard the absolute click numbers as the degree of access concentration, but they are strongly biased such that higher-ranked search results are more easily clicked than lower-ranked ones. Therefore, it is considered that only higher-ranked search results are access-concentrated ones and that only terms extracted from them can be used to expand a query. In this paper, we introduce a new measure that is capable of estimating the degree of access concentration. This measure is used to precisely extract access concentration sites from many search results and to expand queries with terms extracted from them. We conducted an experiment using the click logs and data from an actual mobile Web search site. Results obtained show that our proposed method is a more effective way to boost the search precision than using other query expansion methods such as the top K search results or the most-often-clicked search results.

[1]  Anthony Jameson,et al.  Depth- and breadth-first processing of search result lists , 2004, CHI EA '04.

[2]  Wei-Ying Ma,et al.  Probabilistic query expansion using query logs , 2002, WWW '02.

[3]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[4]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[5]  Peter Willett,et al.  Effectiveness of query expansion in ranked-output document retrieval systems , 1992, J. Inf. Sci..

[6]  Amanda Spink,et al.  An Analysis of Web Documents Retrieved and Viewed , 2003, International Conference on Internet Computing.

[7]  Jin Zhou,et al.  Log-based indexing to improve web site search , 2007, SAC '07.

[8]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[9]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[10]  Dayne Freitag,et al.  A Machine Learning Architecture for Optimizing Web Search Engines , 1999 .

[11]  Ying Wang,et al.  A study of the effect of term proximity on query expansion , 2006, J. Inf. Sci..

[12]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[13]  Claudio Carpineto,et al.  An information-theoretic approach to automatic query expansion , 2001, TOIS.

[14]  Peter Willett,et al.  Improving subject retrieval in online catalogues: S. Walker, R.M. Jones. (British Library Research Paper 24). British Library, London (1987) xi + 193 pp. £10. ISBN 0-7123-3129-8. (Distributed by Longwood Publishing Group, Wolfeboro, NH, USA.) , 1988 .

[15]  Gerhard Weikum,et al.  Query-Log Based Authority Analysis for Web Information Search , 2004, WISE.

[16]  Stephen E. Robertson,et al.  On Term Selection for Query Expansion , 1991, J. Documentation.

[17]  Stephen E. Robertson,et al.  Flexible pseudo-relevance feedback using optimization tables , 2001, SIGIR '01.

[18]  Katsumi Tanaka,et al.  Context Matcher: Improved Web Search Using Query Term Context in Source Document and in Search Results , 2006, APWeb.

[19]  Wei-Ying Ma,et al.  Optimizing web search using web click-through data , 2004, CIKM '04.

[20]  Tao Tao,et al.  Regularized estimation of mixture models for robust pseudo-relevance feedback , 2006, SIGIR.

[21]  Silviu Cucerzan,et al.  Re-ranking search results using query logs , 2006, CIKM '06.

[22]  Xuehua Shen,et al.  Context-sensitive information retrieval using implicit feedback , 2005, SIGIR '05.

[23]  Chris Buckley,et al.  Improving automatic query expansion , 1998, SIGIR '98.

[24]  Ricardo Baeza-Yates,et al.  Web Usage Mining in Search Engines , 2005 .

[25]  Ricardo A. Baeza-Yates,et al.  Applications of Web Query Mining , 2005, ECIR.

[26]  Thorsten Joachims,et al.  Accurately Interpreting Clickthrough Data as Implicit Feedback , 2017 .

[27]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[28]  Yoram Singer,et al.  Learning to Order Things , 1997, NIPS.

[29]  Joseph H. Goldberg,et al.  Eye tracking in web search tasks: design implications , 2002, ETRA.

[30]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[31]  Aviezri S. Fraenkel,et al.  Local Feedback in Full-Text Retrieval Systems , 1977, JACM.

[32]  Marcus Fontoura,et al.  Using annotations in enterprise search , 2006, WWW '06.

[33]  Shyam Kapur,et al.  Unity: relevance feedback using user query logs , 2006, SIGIR '06.