Utility Change Point Detection in Online Social Media: A Revealed Preference Framework

This paper deals with change detection of utility maximization behavior given a dataset. In online social media, such changes occur due to the effect of marketing, advertising, or changes in ground truth. First, we use the revealed preference framework to detect the unknown time point (change point) at which the utility function changed. We derive necessary and sufficient conditions for detecting the change point. Second, in the presence of noisy measurements, we propose a method to detect the change point and construct a decision test. Also, an optimization criteria is provided to recover the linear perturbation coefficients. Finally, to reduce the computational cost, a dimensionality reduction algorithm using Johnson–Lindenstrauss transform is presented. The results developed are illustrated on two real datasets: Yahoo! Tech Buzz dataset and Youstatanalyzer dataset. By using the results developed in the paper, several useful insights can be gleaned from these datasets. First, the changes in ground truth affecting the utility of the agent can be detected by utility maximization behavior in online search. Second, the recovered utility functions satisfy the single crossing property indicating strategic substitute behavior in online search. Finally, due to the large number of videos in YouTube, the utility maximization behavior was verified through the dimensionality reduction algorithm.

[1]  Drew Fudenberg,et al.  Stochastic Choice and Revealed Perturbed Utility , 2015 .

[2]  W. B. Johnson,et al.  Extensions of Lipschitz mappings into Hilbert space , 1984 .

[3]  Michèle Basseville,et al.  Detection of Abrupt Changes: Theory and Applications. , 1995 .

[4]  Andrei Volodin,et al.  On negatively associated random variables , 2012 .

[5]  Fernando Pérez-González,et al.  Coping with the enemy: Advances in adversary-aware signal processing , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Vikram Krishnamurthy,et al.  Afriat's Test for Detecting Malicious Agents , 2012, IEEE Signal Processing Letters.

[7]  David M. Pennock,et al.  An Empirical Study of Dynamic Pari-mutuel Markets : Evidence from the Tech Buzz Game , 2008 .

[8]  P. Samuelson A Note on the Pure Theory of Consumer's Behaviour , 1938 .

[9]  Thomas J. Johnson,et al.  Online and in the Know: Uses and Gratifications of the Web for Political Information , 2002 .

[10]  Rakesh V. Vohra,et al.  Learning from revealed preference , 2006, EC '06.

[11]  H. Varian The Nonparametric Approach to Demand Analysis , 1982 .

[12]  Vikram Krishnamurthy,et al.  Nonparametric Demand Forecasting and Detection of Energy Aware Consumers , 2015, IEEE Transactions on Smart Grid.

[13]  S. Afriat THE CONSTRUCTION OF UTILITY FUNCTIONS FROM EXPENDITURE DATA , 1967 .

[14]  Santosh S. Vempala,et al.  The Random Projection Method , 2005, DIMACS Series in Discrete Mathematics and Theoretical Computer Science.

[15]  Dimitris Achlioptas,et al.  Database-friendly random projections: Johnson-Lindenstrauss with binary coins , 2003, J. Comput. Syst. Sci..

[16]  J. Aucott,et al.  The utility of "Google Trends" for epidemiological research: Lyme disease as an example. , 2010, Geospatial health.

[17]  Jeff S. Shamma,et al.  Control of preferences in social networks , 2010, 49th IEEE Conference on Decision and Control (CDC).

[18]  Martin Browning,et al.  Prices versus preferences: Taste change and revealed preference , 2015 .

[19]  H. Varian Revealed Preference , 2006 .

[20]  Donald J. Brown COWLES FOUNDATION FOR RESEARCH IN ECONOMICS , 1999 .

[21]  Maria-Florina Balcan,et al.  Learning Economic Parameters from Revealed Preferences , 2014, WINE.

[22]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[23]  K. Joag-dev,et al.  Negative Association of Random Variables with Applications , 1983 .

[24]  Francesco De Pellegrini,et al.  YOUStatAnalyzer: a tool for analysing the dynamics of YouTube content popularity , 2013, VALUETOOLS.

[25]  Olivier Toubia,et al.  Intrinsic vs. Image-Related Utility in Social Media: Why Do People Contribute Content to Twitter? , 2013, Mark. Sci..

[26]  E. Brynjolfsson,et al.  The Future of Prediction: How Google Searches Foreshadow Housing Prices and Sales , 2013, ICIS 2013.

[27]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[28]  Hang Zhang,et al.  Adaptive Caching in the YouTube Content Distribution Network: A Revealed Preference Game-Theoretic Learning Approach , 2015, IEEE Transactions on Cognitive Communications and Networking.

[29]  Eleftherios Mylonakis,et al.  Google trends: a web-based tool for real-time surveillance of disease outbreaks. , 2009, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[30]  Cong Zhang,et al.  Insight Data of YouTube from a Partner's View , 2014, NOSSDAV 2014.

[31]  D. Parkes,et al.  Analysis of Bidding Networks in eBay: Aggregate Preference Identification through Community Detection , 2007 .

[32]  J. Swait,et al.  Consumer Search in High Technology Markets: Exploring the Use of Traditional Information Channels , 2004 .

[33]  Paul R. Milgrom,et al.  Monotone Comparative Statics , 1994 .

[34]  Paolo Pin,et al.  Identifying the roles of race-based choice and chance in high school friendship network formation , 2010, Proceedings of the National Academy of Sciences.

[35]  Vikram Krishnamurthy,et al.  A data centric approach to utility change detection in online social media , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[36]  Yuan Ding,et al.  Broadcast yourself: understanding YouTube uploaders , 2011, IMC '11.

[37]  Morteza Zadimoghaddam,et al.  Efficiently Learning from Revealed Preference , 2012, WINE.

[38]  David M. Pennock,et al.  The Tech Buzz Game , 2005, Computer.

[39]  Jieping Ye,et al.  Tuberculosis Surveillance by Analyzing Google Trends , 2011, IEEE Transactions on Biomedical Engineering.

[40]  Clifford Nass,et al.  The media equation - how people treat computers, television, and new media like real people and places , 1996 .

[41]  Jurgen A. Doornik,et al.  Improving the Timeliness of Data on Influenza-like Illnesses using Google Search Data , 2010 .

[42]  Mogens Fosgerau,et al.  A Theory of the Perturbed Consumer with General Budgets , 2012 .

[43]  Vikram Krishnamurthy,et al.  PAC Algorithms for Detecting Nash Equilibrium Play in Social Networks: From Twitter to Energy Markets , 2016, IEEE Access.

[44]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[45]  Walter Diewert,et al.  Tests for the consistency of consumer data , 1985 .

[46]  Mirjam Wattenhofer,et al.  YouTube around the world: geographic popularity of videos , 2012, WWW.

[47]  Walter Diewert,et al.  Afriat and Revealed Preference Theory , 1973 .