Intent-Based User Segmentation with Query Enhancement

With the rapid advancement of the internet, accurate prediction of user's online intent underlying their search queries has received increasing attention from online advertising community. This paper aims to address the major challenges with user queries in the context of behavioral targeting advertising by proposing a query enhancement mechanism that augments user's queries by leveraging a user query log. The empirical evaluation demonstrates that the authors' methodology for query enhancement achieves greater improvement than the baseline models in both intent-based user classification and user segmentation. Different from traditional user segmentation methods, which take little semantics of user behaviors into consideration, the authors propose a novel user segmentation strategy by incorporating the query enhancement mechanism with a topic model to mine the relationships between users and their behaviors in order to segment users in a semantic manner. Comparing with a classical clustering algorithm, K-means, the experimental results indicate that the proposed user segmentation strategy helps improve behavioral targeting effectiveness significantly. This paper also proposes an alternative to define user's search intent for the evaluation purpose, in the case that the dataset is sanitized. This approach automatically labels users in a click graph, which are then used in training an intent-based user classifier.

[1]  Amanda Spink,et al.  Real life information retrieval: a study of user queries on the Web , 1998, SIGF.

[2]  Nick Craswell,et al.  Random walks on the click graph , 2007, SIGIR.

[3]  Bing Liu,et al.  Measuring the meaning in time series clustering of text search queries , 2006, CIKM '06.

[4]  Vanja Josifovski,et al.  Web-scale user modeling for targeting , 2012, WWW.

[5]  John F. Canny,et al.  Large-scale behavioral targeting , 2009, KDD.

[6]  Eugene Agichtein,et al.  Identifying "best bet" web search results by mining past user behavior , 2006, KDD '06.

[7]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[8]  Yen-Jen Oyang,et al.  Relevant term suggestion in interactive web search based on contextual information in query session logs , 2003, J. Assoc. Inf. Sci. Technol..

[9]  Philip K. Chan,et al.  Learning implicit user interest hierarchy for context in personalization , 2003, IUI.

[10]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[11]  Zheng Chen,et al.  Transfer learning for behavioral targeting , 2010, WWW '10.

[12]  Silviu Cucerzan,et al.  Re-ranking search results using query logs , 2006, CIKM '06.

[13]  Alexander J. Smola,et al.  Scalable distributed inference of dynamic user interests for behavioral targeting , 2011, KDD.

[14]  Adwait Ratnaparkhi Finding predictive search queries for behavioral targeting , 2010 .

[15]  F. Sibel Salman,et al.  A mixed-integer programming approach to the clustering problem with an application in customer segmentation , 2006, Eur. J. Oper. Res..

[16]  Yiqun Liu,et al.  Identifying web spam with user behavior analysis , 2008, AIRWeb '08.

[17]  Thorsten Joachims,et al.  Accurately interpreting clickthrough data as implicit feedback , 2005, SIGIR '05.

[18]  Vahab S. Mirrokni,et al.  Mining advertiser-specific user behavior using adfactors , 2010, WWW '10.

[19]  Ming Zhou,et al.  Improving Query Spelling Correction Using Web Search Results , 2007, EMNLP-CoNLL.

[20]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[21]  Amanda Spink,et al.  A temporal comparison of AltaVista Web searching , 2005, J. Assoc. Inf. Sci. Technol..

[22]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Mark Claypool,et al.  Inferring User Interest , 2001, IEEE Internet Comput..

[24]  Foster Provost,et al.  Audience selection for on-line brand advertising: privacy-friendly social network targeting , 2009, KDD.

[25]  Berthier A. Ribeiro-Neto,et al.  Impedance coupling in content-targeted advertising , 2005, SIGIR '05.

[26]  Umar Qasim,et al.  A partial-order based active cache for recommender systems , 2009, RecSys '09.

[27]  Susan T. Dumais,et al.  Learning user interaction models for predicting web search result preferences , 2006, SIGIR.

[28]  Kenneth Ward Church,et al.  Query suggestion using hitting time , 2008, CIKM '08.

[29]  Christos Bouras,et al.  Clustering User Preferences Using W-kmeans , 2011, 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems.

[30]  R. J. Kuo,et al.  Integration of self-organizing feature maps neural network and genetic K-means algorithm for market segmentation , 2006, Expert Syst. Appl..

[31]  Andrei Z. Broder,et al.  A semantic approach to contextual advertising , 2007, SIGIR.

[32]  Eric Brill,et al.  Improving web search ranking by incorporating user behavior information , 2006, SIGIR.

[33]  Kai Zheng,et al.  User Clustering-Based Web Service Discovery , 2012, 2012 Sixth International Conference on Internet Computing for Science and Engineering.

[34]  Evangelos P. Markatos,et al.  On caching search engine query results , 2001, Comput. Commun..

[35]  HongLiu,et al.  Web user clustering analysis based on KMeans algorithm , 2010, ICOIN 2010.

[36]  Wen Zhang,et al.  How much can behavioral targeting help online advertising? , 2009, WWW '09.

[37]  Catarina Sismeiro,et al.  A Model of Web Site Browsing Behavior Estimated on Clickstream Data , 2003 .

[38]  Olatunji Mumini Omisore,et al.  Archetypal Personalized Recommender System for Mobile Phone Users , 2013, Int. J. Inf. Retr. Res..

[39]  Foster J. Provost,et al.  Bid optimizing and inventory scoring in targeted online advertising , 2012, KDD.

[40]  K. Mardia Measures of multivariate skewness and kurtosis with applications , 1970 .

[41]  Ji-Rong Wen,et al.  Query clustering using user logs , 2002, TOIS.

[42]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[43]  Ying Li,et al.  Learning to rank audience for behavioral targeting , 2010, SIGIR '10.

[44]  Ricardo A. Baeza-Yates,et al.  Query Recommendation Using Query Logs in Search Engines , 2004, EDBT Workshops.

[45]  Aristides Gionis,et al.  Dr. Searcher and Mr. Browser: a unified hyperlink-click graph , 2008, CIKM '08.

[46]  Susan T. Dumais,et al.  To personalize or not to personalize: modeling queries with variation in user intent , 2008, SIGIR '08.

[47]  Ji-Rong Wen,et al.  WWW 2007 / Track: Search Session: Personalization A Largescale Evaluation and Analysis of Personalized Search Strategies ABSTRACT , 2022 .

[48]  Ricardo Baeza-Yates,et al.  Query-sets: using implicit feedback and query patterns to organize web documents , 2008, WWW.

[49]  Jude W. Shavlik,et al.  Learning users' interests by unobtrusively observing their normal behavior , 2000, IUI '00.

[50]  Amanda Spink,et al.  U.S. versus European web searching trends , 2002, SIGF.

[51]  Zhenglu Yang,et al.  Dynamic Adaptation Strategies for Long-Term and Short-Term User Profile to Personalize Search , 2007, APWeb/WAIM.

[52]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[53]  Evgeniy Gabrilovich,et al.  Retrieval models for audience selection in display advertising , 2011, CIKM '11.

[54]  JinHua Xu,et al.  Web user clustering analysis based on KMeans algorithm , 2010, 2010 International Conference on Information, Networking and Automation (ICINA).

[55]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[56]  Weiguo Fan,et al.  Learning to advertise , 2006, SIGIR.

[57]  Susan Gauch,et al.  Improving Ontology-Based User Profiles , 2004, RIAO.

[58]  Ping Zhang,et al.  UNDERSTANDING CONSUMERS ATTITUDE TOWARD ADVERTISING , 2002 .

[59]  Ricardo A. Baeza-Yates,et al.  Extracting semantic relations from query logs , 2007, KDD '07.

[60]  Wei-Ying Ma,et al.  Query Expansion by Mining User Logs , 2003, IEEE Trans. Knowl. Data Eng..

[61]  Ravi Kumar,et al.  A characterization of online browsing behavior , 2010, WWW '10.

[62]  Xiaoou Tang,et al.  Real time google and live image search re-ranking , 2008, ACM Multimedia.

[63]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[64]  Eric Brill,et al.  Spelling Correction as an Iterative Process that Exploits the Collective Knowledge of Web Users , 2004, EMNLP.

[65]  Ahmed Hassan Awadallah,et al.  Beyond DCG: user behavior as a predictor of a successful search , 2010, WSDM '10.

[66]  Ji-Rong Wen,et al.  Clustering user queries of a search engine , 2001, WWW '01.

[67]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.

[68]  Shui-Lung Chuang,et al.  Enriching Web taxonomies through subject categorization of query terms from search engine logs , 2003, Decis. Support Syst..

[69]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[70]  Wei-Ying Ma,et al.  Argo: intelligent advertising by mining a user's interest from his photo collections , 2009, KDD Workshop on Data Mining and Audience Intelligence for Advertising.

[71]  Wei Gao,et al.  Cross-lingual query suggestion using query logs of different languages , 2007, SIGIR.

[72]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..