Coordinate Dual Averaging for Decentralized Online Optimization With Nonseparable Global Objectives

We consider a decentralized online convex optimization problem in a network of agents, where each agent controls only a coordinate (or a part) of the global decision vector. For such a problem, we propose two decentralized variants (ODA-C and ODA-PS) of Nesterov's primal-dual algorithm with dual averaging. In ODA-C, to mitigate the disagreements on the primal-vector updates, the agents implement a generalization of the local information-exchange dynamics recently proposed by Li and Marden [1] over a static undirected graph. In ODA-PS, the agents implement the broadcast-based push-sum dynamics [2] over a time-varying sequence of uniformly connected digraphs. We show that the regret bounds in both cases have sublinear growth of <inline-formula><tex-math notation="LaTeX">$O(\sqrt{T})$</tex-math></inline-formula>, with the time horizon <inline-formula><tex-math notation="LaTeX">$T$</tex-math></inline-formula>, when the stepsize is of the form <inline-formula><tex-math notation="LaTeX">$1/\sqrt{t}$</tex-math></inline-formula> and the objective functions are Lipschitz-continuous convex functions with Lipschitz gradients. We also implement the proposed algorithms on a sensor network to complement our theoretical analysis.

[1]  Sameera S. Ponda,et al.  Trajectory Optimization for Target Localization Using Small Unmanned Aerial Vehicles , 2009 .

[2]  Magnus Egerstedt,et al.  Graph Theoretic Methods in Multiagent Networks , 2010, Princeton Series in Applied Mathematics.

[3]  Asuman E. Ozdaglar,et al.  Distributed Subgradient Methods for Multi-Agent Optimization , 2009, IEEE Transactions on Automatic Control.

[4]  Michael Athans,et al.  Convergence and asymptotic agreement in distributed decision problems , 1982, 1982 21st IEEE Conference on Decision and Control.

[5]  Michael G. Rabbat,et al.  Consensus-based distributed optimization: Practical issues and applications in large-scale machine learning , 2012, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[6]  Michael G. Rabbat,et al.  Push-Sum Distributed Dual Averaging for convex optimization , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[7]  Tim Lüth,et al.  Distributed Autonomous Robotic Systems , 1999, Robotics Auton. Syst..

[8]  Anna Scaglione,et al.  Distributed Constrained Optimization by Consensus-Based Primal-Dual Perturbation Method , 2013, IEEE Transactions on Automatic Control.

[9]  Johannes Gehrke,et al.  Gossip-based computation of aggregate information , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[10]  John N. Tsitsiklis,et al.  Problems in decentralized decision making and computation , 1984 .

[11]  José M. F. Moura,et al.  Cooperative Convex Optimization in Networked Systems: Augmented Lagrangian Algorithms With Directed Gossip Communication , 2010, IEEE Transactions on Signal Processing.

[12]  Angelia Nedic,et al.  On the rate of convergence of distributed subgradient methods for multi-agent optimization , 2007, 2007 46th IEEE Conference on Decision and Control.

[13]  Jorge Cortés,et al.  Distributed Online Convex Optimization Over Jointly Connected Digraphs , 2014, IEEE Transactions on Network Science and Engineering.

[14]  Angelia Nedic,et al.  Distributed Asynchronous Constrained Stochastic Optimization , 2011, IEEE Journal of Selected Topics in Signal Processing.

[15]  Angelia Nedic,et al.  Distributed optimization over time-varying directed graphs , 2013, 52nd IEEE Conference on Decision and Control.

[16]  Asuman E. Ozdaglar,et al.  On the O(1=k) convergence of asynchronous distributed alternating Direction Method of Multipliers , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[17]  Lin Xiao,et al.  Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization , 2009, J. Mach. Learn. Res..

[18]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[19]  M. Raginsky,et al.  Decentralized Online Convex Programming with local information , 2011, Proceedings of the 2011 American Control Conference.

[20]  Martin J. Wainwright,et al.  Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling , 2010, IEEE Transactions on Automatic Control.

[21]  Asuman E. Ozdaglar,et al.  Distributed Alternating Direction Method of Multipliers , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[22]  David Tse,et al.  Optimal Distributed Voltage Regulation in Power Distribution Networks , 2012, ArXiv.

[23]  Gesualdo Scutari,et al.  NEXT: In-Network Nonconvex Optimization , 2016, IEEE Transactions on Signal and Information Processing over Networks.

[24]  Angelia Nedic,et al.  Distributed Stochastic Subgradient Projection Algorithms for Convex Optimization , 2008, J. Optim. Theory Appl..

[25]  Mikael Johansson,et al.  A simple peer-to-peer algorithm for distributed optimization in sensor networks , 2007, 2007 46th IEEE Conference on Decision and Control.

[26]  Peter Stoica,et al.  Source localization from range-difference measurements , 2006 .

[27]  Elizabeth L. Wilmer,et al.  Markov Chains and Mixing Times , 2008 .

[28]  Mehran Mesbahi,et al.  Online Distributed Optimization on Dynamic Networks , 2014, ArXiv.

[29]  José M. F. Moura,et al.  Fast Distributed Gradient Methods , 2011, IEEE Transactions on Automatic Control.

[30]  Qing Ling,et al.  Decentralized Dynamic Optimization Through the Alternating Direction Method of Multipliers , 2013, IEEE Transactions on Signal Processing.

[31]  R.M. Murray,et al.  On a decentralized active sensing strategy using mobile sensor platforms in a network , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[32]  Yurii Nesterov,et al.  Primal-dual subgradient methods for convex problems , 2005, Math. Program..

[33]  John N. Tsitsiklis,et al.  Weighted Gossip: Distributed Averaging using non-doubly stochastic matrices , 2010, 2010 IEEE International Symposium on Information Theory.

[34]  Soummya Kar,et al.  Distributed Consensus Algorithms in Sensor Networks: Quantized Data and Random Link Failures , 2007, IEEE Transactions on Signal Processing.

[35]  Lindsay Kleeman,et al.  Mobile Robot Sonar for Target Localization and Classification , 1995, Int. J. Robotics Res..

[36]  Bahman Gharesifard,et al.  Distributed subgradient-push online convex optimization on time-varying directed graphs , 2014, 2014 52nd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[37]  Jason R. Marden,et al.  Designing games for distributed optimization , 2011, IEEE Conference on Decision and Control and European Control Conference.

[38]  John N. Tsitsiklis,et al.  Distributed Asynchronous Deterministic and Stochastic Gradient Optimization Algorithms , 1984, 1984 American Control Conference.

[39]  Bahman Gharesifard,et al.  Distributed Continuous-Time Convex Optimization on Weight-Balanced Digraphs , 2012, IEEE Transactions on Automatic Control.

[40]  Francesco Bullo,et al.  Distributed Control of Robotic Networks , 2009 .

[41]  Angelia Nedic,et al.  Stochastic Gradient-Push for Strongly Convex Functions on Time-Varying Directed Graphs , 2014, IEEE Transactions on Automatic Control.