Reexamination on Potential for Personalization in Web Search

Various strategies have been proposed to enhance web search through utilizing individual user information. However, considering the well acknowledged recurring queries and repetitive clicks among users, it is still an open issue whether using individual user information is a proper direction of efforts in improving the web search. In this paper, we first quantitatively demonstrate that individual user information is more beneficial than common user information. Then we statistically compare the benefit of individual and common user information through Kappa statistic. Finally, we calculate potential for personalization to present an overview of what queries can benefit more from individual user information. All these analyses are conducted on both English AOL log and Chinese Sogou log, and a bilingual perspective statistics consistently confirms our findings.

[1]  W. Bruce Croft,et al.  Quantifying query ambiguity , 2002 .

[2]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[3]  Susan T. Dumais,et al.  Characterizing the value of personalizing search , 2007, SIGIR.

[4]  Huan Liu,et al.  CubeSVD: a novel approach to personalized Web search , 2005, WWW '05.

[5]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[6]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[7]  Barry Smyth,et al.  A Community-Based Approach to Personalizing Web Search , 2007, Computer.

[8]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[9]  Susan T. Dumais,et al.  To personalize or not to personalize: modeling queries with variation in user intent , 2008, SIGIR '08.

[10]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[11]  Mary Beth Rosson,et al.  Paradox of the active user , 1987 .

[12]  Hans-Peter Kriegel,et al.  Ieee Transactions on Knowledge and Data Engineering Probabilistic Memory-based Collaborative Filtering , 2022 .

[13]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[14]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[15]  Ji-Rong Wen,et al.  WWW 2007 / Track: Search Session: Personalization A Largescale Evaluation and Analysis of Personalized Search Strategies ABSTRACT , 2022 .

[16]  Wolfgang Nejdl,et al.  Using ODP metadata to personalize search , 2005, SIGIR '05.

[17]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[18]  ChengXiang Zhai,et al.  Implicit user modeling for personalized search , 2005, CIKM '05.

[19]  Najafi Azadeh,et al.  REAL LIFE, REAL USERS AND REAL NEEDS: A STUDY AND ANALYSIS OF USER QUERIES ON THE WEB , 2008 .

[20]  Hinrich Schütze,et al.  Personalized search , 2002, CACM.

[21]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[22]  John F. Canny,et al.  Collaborative filtering with privacy via factor analysis , 2002, SIGIR '02.

[23]  Masatoshi Yoshikawa,et al.  Adaptive web search based on user profile constructed without any effort from users , 2004, WWW '04.

[24]  Arnd Kohrs,et al.  Clustering for collaborative filtering applications , 1999 .

[25]  Susan Gauch,et al.  Personalizing Search Based on User Search Histories , 2004 .