Learning to Recommend via Inverse Optimal Matching

We consider recommendation in the context of optimal matching, i.e., we need to pair or match a user with an item in an optimal way. The framework is particularly relevant when the supply of an individual item is limited and it can only satisfy a small number of users even though it may be preferred by many. We leverage the methodology of optimal transport of discrete distributions and formulate an inverse optimal transport problem in order to learn the cost which gives rise to the observed matching. It leads to a non-convex optimization problem which is solved by alternating optimization. A key novel aspect of our formulation is the incorporation of marginal relaxation via regularized Wasserstein distance, significantly improving the robustness of the method in the face of observed empirical matchings. Our model has wide applicability including labor market, online dating, college application recommendation. We back up our claims with experiments on both synthetic data and real world datasets.

[1]  Craig Boutilier,et al.  A Framework for Optimizing Paper Matching , 2011, UAI.

[2]  Yu Qian,et al.  Differential Geometric Retrieval of Deep Features , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[3]  Bernhard Schölkopf,et al.  The Kernel Trick for Distances , 2000, NIPS.

[4]  G. Becker Chapter Title: a Theory of Marriage a Theory of Marriage , 2022 .

[5]  Jerome Sacks,et al.  Ecological Regression and Voting Rights , 1991 .

[6]  Gabriel Peyré,et al.  Stochastic Optimization for Large-scale Optimal Transport , 2016, NIPS.

[7]  Gabriel Peyré,et al.  Learning Generative Models with Sinkhorn Divergences , 2017, AISTATS.

[8]  Tuomas Sandholm,et al.  FutureMatch: Combining Human Value Judgments and Machine Learning to Match in Dynamic Environments , 2015, AAAI.

[9]  Arnaud Doucet,et al.  Fast Computation of Wasserstein Barycenters , 2013, ICML.

[10]  Marco Cuturi,et al.  Sinkhorn Distances: Lightspeed Computation of Optimal Transport , 2013, NIPS.

[11]  L. A. Goodman Ecological Regressions and Behavior of Individuals , 1953 .

[12]  Gabriel Peyré,et al.  Computational Optimal Transport , 2018, Found. Trends Mach. Learn..

[13]  A. Galichon,et al.  Matching with Trade-Offs: Revealed Preferences Over Competing Characteristics , 2009, 2102.12811.

[14]  Leonidas J. Guibas,et al.  Supervised Earth Mover's Distance Learning and Its Computer Vision Applications , 2012, ECCV.

[15]  Mark Fuge,et al.  Diverse Weighted Bipartite b-Matching , 2017, IJCAI.

[16]  Alfred Kobsa User Modeling and User-Adapted Interaction , 2005, User Modeling and User-Adapted Interaction.

[17]  N. Papadakis Optimal Transport for Image Processing , 2015 .

[18]  Ariel D. Procaccia,et al.  Dynamic Matching via Weighted Myopia with Application to Kidney Exchange , 2012, AAAI.

[19]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[20]  Michael Werman,et al.  Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  L. Shapley,et al.  College Admissions and the Stability of Marriage , 1962 .

[22]  David Avis,et al.  Ground metric learning , 2011, J. Mach. Learn. Res..

[23]  David Autor Wiring the Labor Market , 2000 .

[24]  Gabriel Peyré,et al.  Fast Dictionary Learning with a Smoothed Wasserstein Loss , 2016, AISTATS.

[25]  Morteza Zadimoghaddam,et al.  Proportional Allocation: Simple, Distributed, and Diverse Matching with High Entropy , 2018, ICML.

[26]  Paul R. Milgrom,et al.  Envelope Theorems for Arbitrary Choice Sets , 2002 .

[27]  A. Dupuy,et al.  Estimating Matching Affinity Matrix Under Low-Rank Constraints , 2016, Information and Inference: A Journal of the IMA.

[28]  Deepak Agarwal,et al.  Regression-based latent factor models , 2009, KDD.

[29]  Vaclav Petricek,et al.  Recommender System for Online Dating Service , 2007, ArXiv.

[30]  Richard Sinkhorn,et al.  Concerning nonnegative matrices and doubly stochastic matrices , 1967 .

[31]  Alexander J. Smola,et al.  Who Supported Obama in 2012?: Ecological Inference through Distribution Regression , 2015, KDD.

[32]  Tat-Seng Chua,et al.  Neural Factorization Machines for Sparse Predictive Analytics , 2017, SIGIR.

[33]  Lior Rokach,et al.  Introduction to Recommender Systems Handbook , 2011, Recommender Systems Handbook.

[34]  Gary King,et al.  A Solution to the Ecological Inference Problem: Reconstructing Individual Behavior from Aggregate Data , 1998 .

[35]  Lénaïc Chizat,et al.  Scaling Algorithms for Unbalanced Transport Problems , 2016, 1607.05816.

[36]  Judy Kay,et al.  Recommending people to people The nature of reciprocal recommenders with a case study in online dating , 2012 .

[37]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[38]  Ievgen Redko,et al.  Co-clustering through Optimal Transport , 2017, ICML.

[39]  Nicolas Courty,et al.  Mapping Estimation for Discrete Optimal Transport , 2016, NIPS.

[40]  Roberto Cominetti,et al.  Asymptotic analysis of the exponential penalty trajectory in linear programming , 1994, Math. Program..

[41]  Inderjit S. Dhillon,et al.  The Metric Nearness Problem , 2008, SIAM J. Matrix Anal. Appl..

[42]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[43]  J. Lorenz,et al.  On the scaling of multidimensional matrices , 1989 .

[44]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[45]  Alvin E. Roth,et al.  The College Admissions Problem Revisited , 1989 .

[46]  Li Bai,et al.  Cosine Similarity Metric Learning for Face Verification , 2010, ACCV.

[47]  G. Carlier,et al.  Matching for teams , 2010 .

[48]  Kwok-Wai Cheung,et al.  Learning User Similarity and Rating Style for Collaborative Recommendation , 2003, Information Retrieval.

[49]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[50]  Yiqun Liu,et al.  Economic Recommendation with Surplus Maximization , 2016, WWW.

[51]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[52]  A. Wilson in the Theory of Trip Distribution, Mode Split and Route Split , 2016 .

[53]  P. Chiappori,et al.  Hedonic price equilibria, stable matching, and optimal transport: equivalence, topology, and uniqueness , 2007 .

[54]  C. Villani Optimal Transport: Old and New , 2008 .

[55]  Cordelia Schmid,et al.  Is that you? Metric learning approaches for face identification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[56]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[57]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[58]  Robin D. Burke,et al.  Hybrid Recommender Systems: Survey and Experiments , 2002, User Modeling and User-Adapted Interaction.

[59]  A Wilson,et al.  Use of entropy maximizing models in theory of trip distribution, mode split and route split , 1969 .

[60]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[61]  Gabriel Peyré,et al.  Iterative Bregman Projections for Regularized Transportation Problems , 2014, SIAM J. Sci. Comput..

[62]  Aranyak Mehta,et al.  Online Matching and Ad Allocation , 2013, Found. Trends Theor. Comput. Sci..

[63]  Amos Azaria,et al.  Movie recommender system for profit maximization , 2013, AAAI.

[64]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[65]  Itai Ashlagi,et al.  On Matching and Thickness in Heterogeneous Dynamic Markets , 2017 .

[66]  Kamal Kant Bharadwaj,et al.  A collaborative filtering framework for friends recommendation in social networks based on interaction intensity and adaptive user similarity , 2012, Social Network Analysis and Mining.

[67]  Frank Nielsen,et al.  Tsallis Regularized Optimal Transport and Ecological Inference , 2016, AAAI.

[68]  Shangrong Huang,et al.  Friend recommendation in social multimedia networks , 2017 .

[69]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.