Leveraging Behavioral Heterogeneity Across Markets for Cross-Market Training of Recommender Systems

Modern recommender systems are optimised to deliver personalised recommendations to millions of users spread across different geographic regions exhibiting various forms of heterogeneity, including behavioural-, content- and trend specific heterogeneity. System designers often face the challenge of deploying either a single global model across all markets, or developing custom models for different markets. In this work, we focus on the specific case of music recommendation across 21 different markets, and consider the trade-off between developing global model versus market specific models. We begin by investigating behavioural differences across users of different markets, and motivate the need for considering market as an important factor when training models. We propose five different training styles, covering the entire spectrum of models: from a single global model to individual market specific models, and in the process, propose ways to identify and leverage users abroad, and data from similar markets. Based on a large scale experimentation with data for 100M users across 21 different markets, we present insights which highlight that markets play a key role, and describe models that leverage market specific data in serving personalised recommendations.

[1]  Iván Cantador,et al.  Cross-domain recommender systems : A survey of the State of the Art , 2012 .

[2]  Donna K. Harman,et al.  CLEF 2009: Grid@CLEF Pilot Track Overview , 2009, CLEF.

[3]  Stephen E. Robertson,et al.  On per-topic variance in IR evaluation , 2012, SIGIR '12.

[4]  Mark Sanderson,et al.  Sub-corpora Impact on System Effectiveness , 2017, SIGIR.

[5]  Gediminas Adomavicius,et al.  Context-aware recommender systems , 2008, RecSys '08.

[6]  Jun Wang,et al.  Bias-variance analysis in estimating true query model for information retrieval , 2014, Inf. Process. Manag..

[7]  R. Haase,et al.  Multivariate analysis of variance. , 1987 .

[8]  Paul Over,et al.  Blind Men and Elephants: Six Approaches to TREC data , 1999, Information Retrieval.

[9]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[10]  Bruce Ferwerda,et al.  Exploring Music Diversity Needs Across Countries , 2016, UMAP.

[11]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[12]  Nicola Ferro,et al.  Toward an anatomy of IR system component performances , 2018, J. Assoc. Inf. Sci. Technol..

[13]  Andrew Rutherford,et al.  ANOVA and ANCOVA: A GLM Approach , 2011 .

[14]  Nicola Ferro,et al.  A General Linear Mixed Models Approach to Study System Component Effects , 2016, SIGIR.

[15]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[16]  Markus Schedl,et al.  Tailoring Music Recommendations to Users by Considering Diversity, Mainstreaminess, and Novelty , 2015, SIGIR.

[17]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[18]  J. Shane Culpepper,et al.  Statistical comparisons of non-deterministic IR systems using two dimensional variance , 2015, Inf. Process. Manag..

[19]  Roberto Turrin,et al.  Cross-Domain Recommender Systems , 2015, Recommender Systems Handbook.

[20]  Arbee L. P. Chen,et al.  A music recommendation system based on music data grouping and user interests , 2001, CIKM '01.

[21]  James Blustein,et al.  A Statistical Analysis of the TREC-3 Data , 1995, TREC.

[22]  Xiaodong He,et al.  A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems , 2015, WWW.

[23]  J. Shane Culpepper,et al.  On Topic Difficulty in IR Evaluation: The Effect of Systems, Corpora, and System Components , 2019, SIGIR.

[24]  Katayoun Farrahi,et al.  Impact of Listening Behavior on Music Recommendation , 2014, ISMIR.

[25]  Ben Carterette,et al.  Multiple testing in statistical analysis of systems-based information retrieval experiments , 2012, TOIS.

[26]  Iván Cantador,et al.  A generic semantic-based framework for cross-domain recommendation , 2011, HetRec '11.

[27]  Qiang Yang,et al.  Active Transfer Learning for Cross-System Recommendation , 2013, AAAI.

[28]  Il Im,et al.  Does a one-size recommendation system fit all? the effectiveness of collaborative filtering based recommendation systems across different domains and search modes , 2007, TOIS.

[29]  Mark Sanderson,et al.  Using Collection Shards to Study Retrieval Performance Effect Sizes , 2019, ACM Trans. Inf. Syst..

[30]  Yi Zhang,et al.  Efficient bayesian hierarchical user modeling for recommendation system , 2007, SIGIR.

[31]  Zhongqi Lu,et al.  Selective Transfer Learning for Cross Domain Recommendation , 2012, SDM.