Prediction of Sparse User-Item Consumption Rates with Zero-Inflated Poisson Regression

In this paper we address the problem of building user models that can predict the rate at which individuals consume items from a finite set, including items they have consumed in the past and items that are new. This combination of repeat and new item consumption is common in applications such as listening to music, visiting web sites, and purchasing products. We use zero-inflated Poisson (ZIP) regression models as the basis for our modeling approach, leading to a general framework for modeling user-item consumption rates over time. We show that these models are more flexible in capturing user behavior than alternatives such as well-known latent factor models based on matrix factorization. We compare the performance of ZIP regression and latent factor models on three different data sets involving music, restaurant reviews, and social media. The ZIP regression models are systematically more accurate across all three data sets and across different prediction metrics.

[1]  David M. Blei,et al.  Dynamic Poisson Factorization , 2015, RecSys.

[2]  Jonathan D. Cohen,et al.  Humans use directed and random exploration to solve the explore-exploit dilemma. , 2014, Journal of experimental psychology. General.

[3]  Piyush Rai,et al.  Scalable Generative Models for Multi-label Learning with Missing Labels , 2017, ICML.

[4]  Nicholas Jing Yuan,et al.  Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data , 2015, KDD.

[5]  Matthias W. Seeger,et al.  Bayesian Intermittent Demand Forecasting for Large Inventories , 2016, NIPS.

[6]  Vikas Kumar,et al.  "I like to explore sometimes": Adapting to Dynamic User Novelty Preferences , 2015, RecSys.

[7]  Jaideep Srivastava,et al.  Just in Time Recommendations: Modeling the Dynamics of Boredom in Activity Streams , 2015, WSDM.

[8]  Hongyuan Zha,et al.  Recurrent Poisson Factorization for Temporal Recommendation , 2020, IEEE Transactions on Knowledge and Data Engineering.

[9]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[10]  Alexander J. Smola,et al.  Neural Survival Recommender , 2017, WSDM.

[11]  Geoffrey J. Gordon,et al.  A Unified View of Matrix Factorization Models , 2008, ECML/PKDD.

[12]  David M. Blei,et al.  Zero-Inflated Exponential Family Embeddings , 2017, ICML.

[13]  Ravi Kumar,et al.  Modeling User Consumption Sequences , 2016, WWW.

[14]  Daniel P. W. Ellis,et al.  Codebook-based Scalable Music Tagging with Poisson Matrix Factorization , 2014, ISMIR.

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16]  David M. Blei,et al.  Scalable Recommendation with Hierarchical Poisson Factorization , 2015, UAI.

[17]  J. T. Wulu,et al.  Regression analysis of count data , 2002 .

[18]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[19]  P. Diggle Analysis of Longitudinal Data , 1995 .

[20]  John Hinde,et al.  Models for count data with many zeros , 1998 .

[21]  D. Hall Zero‐Inflated Poisson and Binomial Regression with Random Effects: A Case Study , 2000, Biometrics.

[22]  David M. Blei,et al.  Modeling User Exposure in Recommendation , 2015, WWW.

[23]  Deborah Estrin,et al.  Multimedia proxy caching mechanism for quality adaptive streaming applications in the Internet , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[24]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.