Modeling and predicting behavioral dynamics on the web

User behavior on the Web changes over time. For example, the queries that people issue to search engines, and the underlying informational goals behind the queries vary over time. In this paper, we examine how to model and predict this temporal user behavior. We develop a temporal modeling framework adapted from physics and signal processing that can be used to predict time-varying user behavior using smoothing and trends. We also explore other dynamics of Web behaviors, such as the detection of periodicities and surprises. We develop a learning procedure that can be used to construct models of users' activities based on features of current and historical behaviors. The results of experiments indicate that by using our framework to predict user behavior, we can achieve significant improvements in prediction compared to baseline models that weight historical evidence the same for all queries. We also develop a novel learning algorithm that explicitly learns when to apply a given prediction model among a set of such models. Our improved temporal modeling of user behavior can be used to enhance query suggestions, crawling policies, and result ranking.

[1]  D.P. Skinner,et al.  The cepstrum: A guide to processing , 1977, Proceedings of the IEEE.

[2]  Peiling Wang,et al.  Mining longitudinal web queries: Trends and patterns , 2003, J. Assoc. Inf. Sci. Technol..

[3]  Ophir Frieder,et al.  Hourly analysis of a very large topically categorized web query log , 2004, SIGIR '04.

[4]  Dimitrios Gunopulos,et al.  Identifying similarities, periodicities and bursts for online search queries , 2004, SIGMOD '04.

[5]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[6]  Melvin J. Hinich,et al.  Time Series Analysis by State Space Methods , 2001 .

[7]  Jan A Snyman,et al.  Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-Based Algorithms , 2005 .

[8]  Steve Chien,et al.  Semantic similarity between search engine queries using temporal correlation , 2005, WWW '05.

[9]  Fernando Diaz,et al.  Temporal profiles of queries , 2007, TOIS.

[10]  Brian N. Bershad,et al.  Why we search: visualizing and predicting user behavior , 2007, WWW '07.

[11]  Rob J. Hyndman,et al.  Forecasting with Exponential Smoothing , 2008 .

[12]  Shaul Markovitch,et al.  Predicting the News of Tomorrow Using Patterns in Web Search Queries , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[13]  Rob J Hyndman,et al.  Forecasting with Exponential Smoothing: The State Space Approach , 2008 .

[14]  Fernando Diaz,et al.  Integration of news content into web results , 2009, WSDM '09.

[15]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[16]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[17]  Susan T. Dumais,et al.  Classification-enhanced ranking , 2010, WWW '10.

[18]  Gilad Mishne,et al.  Towards recency ranking in web search , 2010, WSDM '10.

[19]  Milad Shokouhi,et al.  Detecting seasonal queries by time-series analysis , 2011, SIGIR.

[20]  Susan T. Dumais,et al.  Understanding temporal query dynamics , 2011, WSDM '11.

[21]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[22]  Evgeniy Gabrilovich,et al.  A word at a time: computing word relatedness using temporal semantic analysis , 2011, WWW.