Early Detection of User Exits from Clickstream Data: A Markov Modulated Marked Point Process Model

Most users leave e-commerce websites with no purchase. Hence, it is important for website owners to detect users at risk of exiting and intervene early (e. g., adapting website content or offering price promotions). Prior approaches make widespread use of clickstream data; however, state-of-the-art algorithms only model the sequence of web pages visited and not the time spent on them. In this paper, we develop a novel Markov modulated marked point process (M3PP) model for detecting users at risk of exiting with no purchase from clickstream data. It accommodates clickstream data in a holistic manner: our proposed M3PP models both the sequence of pages visited and the temporal dynamics between them, i. e., the time spent on pages. This is achieved by a continuous-time marked point process. Different from previous Markovian clickstream models, our M3PP is the first model in which the continuous nature of time is considered. The marked point process is modulated by a continuous-time Markov process in order to account for different latent shopping phases. As a secondary contribution, we suggest a risk assessment framework. Rather than predicting future page visits, we compute a user’s risk of exiting with no purchase. For this purpose, we build upon sequential hypothesis testing in order to suggest a risk score for user exits. Our computational experiments draw upon real-world clickstream data provided by a large online retailer. Based on this, we find that state-of-the-art algorithms are consistently outperformed by our M3PP model in terms of both AUROC (+ 6.24 percentage points) and so-called time of early warning (+ 12.93 %). Accordingly, our M3PP model allows for timely detections of user exits and thus provides sufficient time for e-commerce website owners to trigger dynamic online interventions.

[1]  M. Tamer Özsu,et al.  A Web page prediction model based on click-stream tree representation of user behavior , 2003, KDD '03.

[2]  Vasudeva Varma,et al.  Timespent based models for predicting user retention , 2013, WWW.

[3]  Michalis Vazirgiannis,et al.  Web path recommendations based on page ranking and Markov models , 2005, WIDM '05.

[4]  Chris Janiszewski,et al.  The Influence of Display Characteristics on Visual Exploratory Search Behavior , 1998 .

[5]  Mark Levene,et al.  Data Mining of User Navigation Patterns , 1999, WEBKDD.

[6]  David A. Clifton,et al.  Gaussian Processes for Personalized Interpretable Volatility Metrics in the Step-Down Ward , 2019, IEEE Journal of Biomedical and Health Informatics.

[7]  Omer F. Rana,et al.  Understanding ecommerce clickstreams: a tale of two states , 2018 .

[8]  D. Hoffman,et al.  The Influence of Goal-Directed and Experiential Activities on Online Flow Experiences , 2003 .

[9]  Meichun Hsu,et al.  Modeling Complex Clickstream Data by Stochastic Models: Theory and Methods , 2016, WWW.

[10]  Wendy W. Moe,et al.  The Influence of Goal‐Directed and Experiential Activities on Online Flow Experiences , 2003 .

[11]  Virgílio A. F. Almeida,et al.  Characterizing user behavior in online social networks , 2009, IMC '09.

[12]  Ramesh R. Sarukkai,et al.  Link prediction and path analysis using Markov chains , 2000, Comput. Networks.

[13]  Christian R. Shelton,et al.  Marked Point Process for Severity of Illness Assessment , 2017, MLHC.

[14]  Amy Wenxuan Ding,et al.  Learning User Real-Time Intent for Optimal Dynamic Web Page Transformation , 2015, Inf. Syst. Res..

[15]  Peter I. Hofgesang,et al.  Relevance of Time Spent on Web Pages , 2006 .

[16]  William C. McDowell,et al.  An examination of retail website design and conversion rate , 2016 .

[17]  Pedro M. Domingos,et al.  Relational Markov models and their application to adaptive web navigation , 2002, KDD.

[18]  Andrew B. Whinston,et al.  Path to Purchase: A Mutually Exciting Point Process Model for Online Advertising and Conversion , 2012, Manag. Sci..

[19]  Zachary A. Pardos,et al.  Deep Neural Networks and How They Apply to Sequential Education Data , 2016, L@S.

[20]  Dennis L. Hoffman,et al.  Marketing in Hypermedia Computer-Mediated Environments : Conceptual Foundations 1 ) , 1998 .

[21]  Vinayak A. Rao,et al.  Markov-modulated Marked Poisson Processes for Check-in Data , 2016, ICML.

[22]  Mark Levene,et al.  Evaluating Variable-Length Markov Chain Models for Analysis of User Web Navigation Sessions , 2007, IEEE Transactions on Knowledge and Data Engineering.

[23]  Kannan Srinivasan,et al.  Modeling Online Browsing and Path Analysis Using Clickstream Data , 2004 .

[24]  Mihaela van der Schaar,et al.  A Hidden Absorbing Semi-Markov Model for Informatively Censored Temporal Data: Learning and Inference , 2016, J. Mach. Learn. Res..

[25]  Utkarsh Upadhyay,et al.  Recurrent Marked Temporal Point Processes: Embedding Event History to Vector , 2016, KDD.

[26]  Bamshad Mobasher,et al.  Data Mining for Web Personalization , 2007, The Adaptive Web.

[27]  Yomi Kastro,et al.  Real-time prediction of online shoppers’ purchasing intention using multilayer perceptron and LSTM recurrent neural networks , 2019, Neural Computing and Applications.

[28]  A. Hawkes Spectra of some self-exciting and mutually exciting point processes , 1971 .

[29]  Howard R. Moskowitz,et al.  Integrating science into web design: consumer‐driven web site optimization , 2009 .

[30]  Peter S. Fader,et al.  Dynamic Conversion Behavior at E-Commerce Sites , 2004, Manag. Sci..

[31]  Mark Levene,et al.  Computing the Entropy of User Navigation in the Web , 2003, Int. J. Inf. Technol. Decis. Mak..

[32]  Sucheta Nadkarni,et al.  A Task-Based Model of Perceived Website Complexity , 2007, MIS Q..

[33]  Mihaela van der Schaar,et al.  Learning from Clinical Judgments: Semi-Markov-Modulated Marked Hawkes Processes for Risk Prognosis , 2017, ICML.

[34]  Jun Hong,et al.  Using Markov Chains for Link Prediction in Adaptive Web Sites , 2002, Soft-Ware.

[35]  Jie Li,et al.  Characterizing typical and atypical user sessions in clickstreams , 2008, WWW.

[36]  George Karypis,et al.  Selective Markov models for predicting Web page accesses , 2004, TOIT.

[37]  Porter Jenkins ClickGraph: Web Page Embedding using Clickstream Data for Multitask Learning , 2019, WWW.