Cooperative Data-Driven Distributionally Robust Optimization

We study a class of multiagent stochastic optimization problems where the objective is to minimize the expected value of a function which depends on a random variable. The probability distribution of the random variable is unknown to the agents. The agents aim to cooperatively find, using their collected data, a solution with guaranteed out-of-sample performance. The approach is to formulate a data-driven distributionally robust optimization problem using Wasserstein ambiguity sets, which turns out to be equivalent to a convex program. We reformulate the latter as a distributed optimization problem and identify a convex–concave augmented Lagrangian, whose saddle points are in correspondence with the optimizers, provided a min–max interchangeability criteria is met. Our distributed algorithm design, then consists of the saddle-point dynamics associated to the augmented Lagrangian. We formally establish that the trajectories converge asymptotically to a saddle point and, hence, an optimizer of the problem. Finally, we identify classes of functions that meet the min–max interchangeability criteria.

[1]  Chaoyue Zhao,et al.  Data-driven risk-averse stochastic program and renewable energy integration , 2014 .

[2]  Mikael Johansson,et al.  A Randomized Incremental Subgradient Method for Distributed Optimization in Networked Systems , 2009, SIAM J. Optim..

[3]  Andrea Bacciotti,et al.  Nonpathological Lyapunov functions and discontinuous Carathéodory systems , 2006, Autom..

[4]  Angelia Nedić,et al.  Distributed Optimization , 2015, Encyclopedia of Systems and Control.

[5]  A. Guillin,et al.  On the rate of convergence in Wasserstein distance of the empirical measure , 2013, 1312.2128.

[6]  Michael D. Lemmon,et al.  Event-triggered distributed optimization in sensor networks , 2009, 2009 International Conference on Information Processing in Sensor Networks.

[7]  Xi Chen,et al.  Wasserstein Distributional Robustness and Regularization in Statistical Learning , 2017, ArXiv.

[8]  Jorge Cortés,et al.  Noise-to-state exponentially stable distributed convex optimization on weight-balanced digraphs , 2013, 52nd IEEE Conference on Decision and Control.

[9]  Sonia Martínez,et al.  On Distributed Convex Optimization Under Inequality and Equality Constraints , 2010, IEEE Transactions on Automatic Control.

[10]  Xi Chen,et al.  Wasserstein Distributional Robustness and Regularization in Statistical Learning , 2017, 1712.06050.

[11]  S. Lang Real and Functional Analysis , 1983 .

[12]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[13]  Sonia Martínez,et al.  Distributed convex optimization via continuous-time coordination algorithms with discrete-time communication , 2014, Autom..

[14]  Garud Iyengar,et al.  Ambiguous chance constrained problems and robust optimization , 2006, Math. Program..

[15]  Ruiwei Jiang,et al.  Data-driven chance constrained stochastic program , 2015, Mathematical Programming.

[16]  A. Nagurney,et al.  Projected Dynamical Systems and Variational Inequalities with Applications , 1995 .

[17]  Ashish Cherukuri,et al.  Data-driven distributed optimization using Wasserstein ambiguity sets , 2017, 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[18]  Peter Richtárik,et al.  Federated Optimization: Distributed Machine Learning for On-Device Intelligence , 2016, ArXiv.

[19]  Yinyu Ye,et al.  Distributionally Robust Optimization Under Moment Uncertainty with Application to Data-Driven Problems , 2010, Oper. Res..

[20]  Alexander Shapiro,et al.  Stochastic Approximation approach to Stochastic Programming , 2013 .

[21]  Tito Homem-de-Mello,et al.  Monte Carlo sampling-based methods for stochastic optimization , 2014 .

[22]  Enrique Mallada,et al.  The Role of Convexity in Saddle-Point Dynamics: Lyapunov Function and Robustness , 2016, IEEE Transactions on Automatic Control.

[23]  A. Ozdaglar,et al.  Existence of Global Minima for Constrained Optimization , 2006 .

[24]  Enrique Mallada,et al.  Asymptotic convergence of constrained primal-dual dynamics , 2015, Syst. Control. Lett..

[25]  Daniel Kuhn,et al.  Distributionally Robust Convex Optimization , 2014, Oper. Res..

[26]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[27]  Jing Wang,et al.  A control perspective for centralized and distributed convex optimization , 2011, IEEE Conference on Decision and Control and European Control Conference.

[28]  Daniel Kuhn,et al.  Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations , 2015, Mathematical Programming.

[29]  ASHISH CHERUKURI,et al.  Saddle-Point Dynamics: Conditions for Asymptotic Stability of Saddle Points , 2015, SIAM J. Control. Optim..

[30]  Asuman E. Ozdaglar,et al.  Distributed Subgradient Methods for Multi-Agent Optimization , 2009, IEEE Transactions on Automatic Control.

[31]  F. Clarke Optimization And Nonsmooth Analysis , 1983 .

[32]  Karthyek R. A. Murthy,et al.  Quantifying Distributional Model Risk Via Optimal Transport , 2016, Math. Oper. Res..

[33]  Rafal Goebel,et al.  Stability and robustness for saddle-point dynamics through monotone mappings , 2017, Syst. Control. Lett..

[34]  Dennis S. Bernstein,et al.  Nontangency-Based Lyapunov Tests for Convergence and Stability in Systems Having a Continuum of Equilibria , 2003, SIAM J. Control. Optim..

[35]  Jorge Cortés,et al.  Distributed Saddle-Point Subgradient Algorithms With Laplacian Averaging , 2015, IEEE Transactions on Automatic Control.

[36]  M. KarthyekRajhaaA.,et al.  Robust Wasserstein profile inference and applications to machine learning , 2019, J. Appl. Probab..

[37]  Sanjay Mehrotra,et al.  Decomposition Algorithm for Distributionally Robust Optimization using Wasserstein Metric , 2017, 1704.03920.

[38]  Vishal Gupta,et al.  Robust sample average approximation , 2014, Math. Program..

[39]  J. Cortés Discontinuous dynamical systems , 2008, IEEE Control Systems.

[40]  Bahman Gharesifard,et al.  Distributed Continuous-Time Convex Optimization on Weight-Balanced Digraphs , 2012, IEEE Transactions on Automatic Control.

[41]  Zhaolin Hu,et al.  Kullback-Leibler divergence constrained distributionally robust optimization , 2012 .

[42]  Francesco Bullo,et al.  Distributed Control of Robotic Networks , 2009 .

[43]  K. I. M. McKinnon,et al.  On Saddle Points of Augmented Lagrangians for Constrained Nonconvex Optimization , 2005, SIAM J. Optim..

[44]  Robert D. Nowak,et al.  Quantized incremental algorithms for distributed optimization , 2005, IEEE Journal on Selected Areas in Communications.

[45]  Angelia Nedic,et al.  Subgradient Methods for Saddle-Point Problems , 2009, J. Optimization Theory and Applications.

[46]  A. Kleywegt,et al.  Distributionally Robust Stochastic Optimization with Wasserstein Distance , 2016, Math. Oper. Res..