Discrete Wasserstein barycenters: optimal transport for discrete data

Wasserstein barycenters correspond to optimal solutions of transportation problems for several marginals, and as such have a wide range of applications ranging from economics to statistics and computer science. When the marginal probability measures are absolutely continuous (or vanish on small sets) the theory of Wasserstein barycenters is well-developed [see the seminal paper (Agueh and Carlier in SIAM J Math Anal 43(2):904–924, 2011)]. However, exact continuous computation of Wasserstein barycenters in this setting is tractable in only a small number of specialized cases. Moreover, in many applications data is given as a set of probability measures with finite support. In this paper, we develop theoretical results for Wasserstein barycenters in this discrete setting. Our results rely heavily on polyhedral theory which is possible due to the discrete structure of the marginals. The results closely mirror those in the continuous case with a few exceptions. In this discrete setting we establish that Wasserstein barycenters must also be discrete measures and there is always a barycenter which is provably sparse. Moreover, for each Wasserstein barycenter there exists a non-mass-splitting optimal transport to each of the discrete marginals. Such non-mass-splitting transports do not generally exist between two discrete measures unless special mass balance conditions hold. This makes Wasserstein barycenters in this discrete setting special in this regard. We illustrate the results of our discrete barycenter theory with a proof-of-concept computation for a hypothetical transportation problem with multiple marginals: distributing a fixed set of goods when the demand can take on different distributional shapes characterized by the discrete marginal distributions. A Wasserstein barycenter, in this case, represents an optimal distribution of inventory facilities which minimize the squared distance/transportation cost totaled over all demands.

[1]  H. Kellerer Duality theorems for marginal problems , 1984 .

[2]  S. Rachev The Monge–Kantorovich Mass Transference Problem and Its Stochastic Applications , 1985 .

[3]  Shuzhong Zhang On the Strictly Complementary Slackness Relation in Linear Programming , 1994 .

[4]  W. Gangbo,et al.  Optimal maps for the multidimensional Monge-Kantorovich problem , 1998 .

[5]  Anil K. Jain,et al.  Deformable template models: A review , 1998, Signal Process..

[6]  C. Villani Topics in Optimal Transportation , 2003 .

[7]  Alain Trouvé,et al.  Local Geometry of Deformable Templates , 2005, SIAM J. Math. Anal..

[8]  P. Chiappori,et al.  Hedonic price equilibria, stable matching, and optimal transport: equivalence, topology, and uniqueness , 2007 .

[9]  C. Villani Optimal Transport: Old and New , 2008 .

[10]  G. Carlier,et al.  Matching for teams , 2010 .

[11]  Brendan Pass,et al.  On the local structure of optimal measures in the multi-marginal optimal transportation problem , 2010, 1005.2162.

[12]  Brendan Pass,et al.  Uniqueness and Monge Solutions in the Multimarginal Optimal Transportation Problem , 2010, SIAM J. Math. Anal..

[13]  S. Mukherjee,et al.  Probability measures on the space of persistence diagrams , 2011 .

[14]  Guillaume Carlier,et al.  Barycenters in the Wasserstein Space , 2011, SIAM J. Math. Anal..

[15]  Thibaut Le Gouic,et al.  Distribution's template estimate with Wasserstein metrics , 2011, 1111.5927.

[16]  Julien Rabin,et al.  Wasserstein Barycenter and Its Application to Texture Mixing , 2011, SSVM.

[17]  Codina Cotar,et al.  Density Functional Theory and Optimal Transportation with Coulomb Cost , 2011, 1104.0603.

[18]  Brendan Pass Multi-marginal optimal transport and multi-agent matching problems: uniqueness and structure of solutions , 2012, 1210.7372.

[19]  Brendan Pass Optimal transportation with infinitely many marginals , 2012, 1206.5515.

[20]  Enac,et al.  Characterization of barycenters in the Wasserstein space by averaging optimal transport maps , 2012, 1212.2562.

[21]  G. Buttazzo,et al.  Optimal-transport formulation of electronic density-functional theory , 2012, 1205.4514.

[22]  Jérémie Bigot,et al.  Consistent estimation of a population barycenter in the Wasserstein space , 2013 .

[23]  Nizar Touzi,et al.  A Stochastic Control Approach to No-Arbitrage Bounds Given Marginals, with an Application to Lookback Options , 2013, 1401.3921.

[24]  Mathias Beiglböck,et al.  Model-independent bounds for option prices—a mass transport approach , 2011, Finance Stochastics.

[25]  A. Galichon,et al.  A stochastic control approach to no-arbitrage bounds given marginals, with an application to lookback options , 2014, 1401.3921.

[26]  Sayan Mukherjee,et al.  Fréchet Means for Distributions of Persistence Diagrams , 2012, Discrete & Computational Geometry.

[27]  Adam M. Oberman,et al.  NUMERICAL METHODS FOR MATCHING FOR TEAMS AND WASSERSTEIN BARYCENTERS , 2014, 1411.3602.

[28]  Arnaud Doucet,et al.  Fast Computation of Wasserstein Barycenters , 2013, ICML.

[29]  Probabilistic Fréchet means for time varying persistence diagrams , 2013, 1307.6530.

[30]  Iain Dunning,et al.  Computing in Operations Research Using Julia , 2013, INFORMS J. Comput..

[31]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..