Crumbs of the Cookie: User Profiling in Customer-Base Analysis and Behavioral Targeting

User profile is a summary of a consumer's interests and preferences revealed through the consumer's online activity. It is a fundamental component of numerous applications in digital marketing. McKinsey & Company view online user profiling as one of the promising opportunities companies should take advantage of to unlock "big data's" potential. This paper proposes a modeling approach that uncovers individual user profiles from online surfing data and allows online businesses to make profile predictions when limited information is available. The approach is easily parallelized and scales well for processing massive records of user online activity. We demonstrate application of our approach to customer-base analysis and display advertising. Our empirical analysis uncovers easy-to-interpret behavior profiles and describes the distribution of such profiles. Furthermore, it reveals that even for information-rich online firms profile inference that is based solely on their internal data may produce biased results. We find that although search engines cover smaller portions of consumer Web visits than major advertising networks, their data is of higher quality. Thus, even with the smaller information set, search engines can effectively recover consumer behavioral profiles. We also show that temporal limitations imposed on individual-level tracking abilities are likely to have a differential impact across major online businesses, and that our approach is particularly effective for temporally limited data. Using economic simulation we demonstrate potential gains the proposed model may offer a firm if used in individual-level targeting of display ads. Data, as supplemental material, are available at http://dx.doi.org/10.1287/mksc.2015.0956 .

[1]  Young-Hoon Park,et al.  Modeling Browsing Behavior at Multiple Websites , 2004 .

[2]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[3]  Beibei Li,et al.  Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowd-Sourced Content , 2011, Mark. Sci..

[4]  Wendy W. Moe,et al.  The Influence of Goal‐Directed and Experiential Activities on Online Flow Experiences , 2003 .

[5]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[6]  Peter S. Fader,et al.  Dynamic Conversion Behavior at E-Commerce Sites , 2004, Manag. Sci..

[7]  Alexander J. Smola,et al.  Scalable inference in latent variable models , 2012, WSDM '12.

[8]  Anindya Ghose,et al.  An Empirical Analysis of Search Engine Advertising: Sponsored Search in Electronic Markets , 2009, Manag. Sci..

[9]  Max Welling,et al.  Distributed Algorithms for Topic Models , 2009, J. Mach. Learn. Res..

[10]  Peter S. Fader,et al.  On the Depth and Dynamics of Online Search Behavior , 2004, Manag. Sci..

[11]  Shibo Li,et al.  Modeling Category Viewership of Web Users with Multivariate Count Models , 2002 .

[12]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Alexander J. Smola,et al.  An architecture for parallel topic models , 2010, Proc. VLDB Endow..

[14]  Santosh S. Vempala,et al.  Latent semantic indexing: a probabilistic analysis , 1998, PODS '98.

[15]  Peter J. Danaher,et al.  Modeling Multivariate Distributions Using Copulas: Applications in Marketing , 2011, Mark. Sci..

[16]  Peter J. Danaher,et al.  Modeling Page Views Across Multiple Websites with an Application to Internet Reach and Frequency Prediction , 2007 .

[17]  Oliver J. Rutz,et al.  From Generic to Branded: A Model of Spillover in Paid Search Advertising , 2011 .

[18]  Anand V. Bodapati Recommendation Systems with Purchase Data , 2008 .

[19]  Michael Trusov,et al.  Modeling Indirect Effects of Paid Search Advertising: Which Keywords Lead to More Future Visits? , 2011, Mark. Sci..

[20]  Anindya Ghose,et al.  Analyzing the Relationship Between Organic and Sponsored Search Advertising: Positive, Negative, or Zero Interdependence? , 2010, Mark. Sci..

[21]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[22]  Leveraging “ big data ” to optimize digital marketing , .

[23]  John R. Hauser,et al.  Website Morphing , 2009, Mark. Sci..

[24]  Yang,et al.  Consumer Click Behavior at a Search Engine : The Role of Keyword Popularity , .

[25]  Avi Goldfarb,et al.  Privacy Regulation and Online Advertising , 2010, Manag. Sci..

[26]  Robert F. Bordley,et al.  Morphing Banner Advertising , 2014, Mark. Sci..

[27]  Catherine E. Tucker,et al.  When Does Retargeting Work? Information Specificity in Online Advertising , 2013 .