Inferring the Causal Impact of New Track Releases on Music Recommendation Platforms through Counterfactual Predictions

With over 20,000 tracks being released each day, recommendation systems that power music streaming services should not only be responsive to such large volumes of content, but also be adept at understanding the impact of such new releases on, both, users’ listening behavior and popularity of artists. Inferring the causal impact of new track releases is critical to fully characterizing the interplay between artists and listeners, as well as among the artists. In this study, we infer and quantify causality using a diffusion-regression state-space model that constructs counterfactual outcomes using a set of synthetic controls, which predict potential outcomes in absence of the intervention. Based on large scale experiments spanning over 21 million users and 1 billion streams on a real world streaming platform, our findings suggest that releasing a new track has a positive impact on the popularity of other tracks by the same artist. Interestingly, other related and competing artists also benefit from a new track release, which hints at the presence of a positive platform-effect wherein some artists gain significantly from activities of other artists.

[1]  Kavé Salamatian,et al.  Modeling and predicting the popularity of online contents with Cox proportional hazard regression model , 2012, Neurocomputing.

[2]  Changhe Yuan,et al.  Most Relevant Explanation: Properties, Algorithms, and Evaluations , 2009, UAI.

[3]  Mounia Lalmas,et al.  Bandit based Optimization of Multiple Objectives on a Music Streaming Platform , 2020, KDD.

[4]  Mounia Lalmas,et al.  Algorithmic Effects on the Diversity of Consumption on Spotify , 2020, WWW.

[5]  Eugene Agichtein,et al.  Understanding Music Listening Intents During Daily Activities with Implications for Contextual Music Recommendation , 2018, CHIIR.

[6]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.

[7]  WangJia,et al.  Towards automated performance diagnosis in a large IPTV network , 2009 .

[8]  Steven L. Scott,et al.  Inferring causal impact using Bayesian structural time-series models , 2015, 1506.00356.

[9]  Fernando Diaz,et al.  Towards a Fair Marketplace: Counterfactual Evaluation of the trade-off between Relevance, Fairness & Satisfaction in Recommendation Systems , 2018, CIKM.

[10]  Bernardo A. Huberman,et al.  The Pulse of News in Social Media: Forecasting Popularity , 2012, ICWSM.

[11]  Mounia Lalmas,et al.  Jointly Leveraging Intent and Interaction Signals to Predict User Satisfaction with Slate Recommendations , 2019, WWW.

[12]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[13]  Qi Zhao,et al.  Towards automated performance diagnosis in a large IPTV network , 2009, SIGCOMM '09.

[14]  Kavé Salamatian,et al.  An Approach to Model and Predict the Popularity of Online Contents with Explanatory Factors , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[15]  Bruce Ferwerda,et al.  Personality Traits Predict Music Taxonomy Preferences , 2015, CHI Extended Abstracts.

[16]  Bart J. Bronnenberg,et al.  Changing Their Tune: How Consumers' Adoption of Online Streaming Affects Music Consumption and Discovery , 2017, Mark. Sci..

[17]  Maarten de Rijke,et al.  News Comments: Exploring, Modeling, and Online Prediction , 2010, ECIR.

[18]  Yan Liu,et al.  Granger Causality for Time-Series Anomaly Detection , 2012, 2012 IEEE 12th International Conference on Data Mining.

[19]  Berkant Barla Cambazoglu,et al.  On the feasibility of predicting popular news at cold start , 2017, J. Assoc. Inf. Sci. Technol..

[20]  Oren Barkan,et al.  Groove Radio: A Bayesian Hierarchical Model for Personalized Playlist Generation , 2017, WSDM.

[21]  Huzefa Rangwala,et al.  Digging Digg: Comment Mining, Popularity Prediction, and Social Network Analysis , 2009, 2009 International Conference on Web Information Systems and Mining.

[22]  Yan Liu,et al.  Towards Twitter context summarization with user influence models , 2013, WSDM.

[23]  Jin Tian,et al.  Probabilities of causation: Bounds and identification , 2000, Annals of Mathematics and Artificial Intelligence.