Gradient-Consensus: Linearly Convergent Distributed Optimization Algorithm over Directed Graphs

In this article, we focus on a multi-agent optimization problem of minimizing a sum, $f = \sum_{i=1}^n f_i$, of convex objective functions $f_i$'s, where, $f_i$ is only available locally to the agent $i$, over a network of $n$ agents. The agents communicate only to their neighbors connected through a directed edge governed by a directed graph. In this article, we propose a "optimize then agree" framework to decouple the gradient-descent step and the consensus step in the distributed optimization algorithms. Using this framework we develop a novel distributed algorithm to solve the above multi-agent convex optimization problem. In this method, each agent maintains an estimate of the optimal solution. During each iteration of the proposed algorithm, the agents utilize locally available gradient information along with a finite-time approximate consensus protocol to move towards the optimal solution (hence the name "gradient-consensus" method). We establish that the proposed algorithm converges with a global linear rate if the aggregate function $f$ is strongly convex and smooth. We also show that the proposed method has a linear rate of convergence (in terms of number of iterations), compared to prior state-of-the-art, until reaching an $O(\eta)$ neighborhood of the optimal objective function value, for a given $\eta > 0$, under the relaxed assumption of $f_i$'s being convex and smooth. To the best of our knowledge, the proposed method achieves a convergence rate better than the best known rate estimates existing in the literature under these assumptions. Further, we numerically evaluate the proposed algorithm by solving two distributed optimization problems. The results show that the proposed \textit{GradConsensus} algorithm has a reduced computational footprint compared to the existing distributed optimization schemes for a similar accuracy solution.

[1]  Qing Ling,et al.  A Proximal Gradient Algorithm for Decentralized Composite Optimization , 2015, IEEE Transactions on Signal Processing.

[2]  H. Vincent Poor,et al.  Distributed learning in wireless sensor networks , 2005, IEEE Signal Processing Magazine.

[3]  Sourav Patel,et al.  Distributed Stopping Criterion for Ratio Consensus , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[4]  Laurent Massoulié,et al.  Optimal Algorithms for Smooth and Strongly Convex Distributed Optimization in Networks , 2017, ICML.

[5]  Sourav Patel,et al.  Distributed apportioning in a power network for providing demand response services , 2017, 2017 IEEE International Conference on Smart Grid Communications (SmartGridComm).

[6]  Alexander Olshevsky,et al.  Linear Time Average Consensus and Distributed Optimization on Fixed Graphs , 2017, SIAM J. Control. Optim..

[7]  Ali H. Sayed,et al.  Diffusion Adaptation Strategies for Distributed Optimization and Learning Over Networks , 2011, IEEE Transactions on Signal Processing.

[8]  H. Vincent Poor,et al.  A Collaborative Training Algorithm for Distributed Learning , 2009, IEEE Transactions on Information Theory.

[9]  José M. F. Moura,et al.  Fast Distributed Gradient Methods , 2011, IEEE Transactions on Automatic Control.

[10]  Angelia Nedic,et al.  Stochastic Gradient-Push for Strongly Convex Functions on Time-Varying Directed Graphs , 2014, IEEE Transactions on Automatic Control.

[11]  Asuman E. Ozdaglar,et al.  Distributed Subgradient Methods for Multi-Agent Optimization , 2009, IEEE Transactions on Automatic Control.

[12]  Albert S. Berahas,et al.  Balancing Communication and Computation in Distributed Optimization , 2017, IEEE Transactions on Automatic Control.

[13]  Chenguang Xi,et al.  On the Linear Convergence of Distributed Optimization over Directed Graphs , 2015, 1510.02149.

[14]  Wei Ren,et al.  Information consensus in multivehicle cooperative control , 2007, IEEE Control Systems.

[15]  Sourav Patel,et al.  Gradient-Consensus Method for Distributed Optimization in Directed Multi-Agent Networks , 2019, 2020 American Control Conference (ACC).

[16]  Alexander Olshevsky,et al.  Efficient information aggregation strategies for distributed control and signal processing , 2010, 1009.6036.

[17]  Karl Henrik Johansson,et al.  Subgradient methods and consensus algorithms for solving convex optimization problems , 2008, 2008 47th IEEE Conference on Decision and Control.

[18]  Stephen P. Boyd,et al.  Fast linear iterations for distributed averaging , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[19]  Qing Ling,et al.  EXTRA: An Exact First-Order Algorithm for Decentralized Consensus Optimization , 2014, 1404.6264.

[20]  Murti V. Salapaka,et al.  Distributed Stopping Criterion for Consensus in the Presence of Delays , 2020, IEEE Transactions on Control of Network Systems.

[21]  Asuman E. Ozdaglar,et al.  Distributed Subgradient Methods for Convex Optimization Over Random Networks , 2011, IEEE Transactions on Automatic Control.

[22]  Robert Nowak,et al.  Distributed optimization in sensor networks , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[23]  Wotao Yin,et al.  ExtraPush for Convex Smooth Decentralized Optimization over Directed Networks , 2015, ArXiv.

[24]  Qing Ling,et al.  On the Convergence of Decentralized Gradient Descent , 2013, SIAM J. Optim..

[25]  Angelia Nedic,et al.  Asynchronous Broadcast-Based Convex Optimization Over a Network , 2011, IEEE Transactions on Automatic Control.

[26]  Sourav Patel,et al.  Distributed Power Apportioning with Early Dispatch for Ancillary Services in Renewable Grids , 2020, ArXiv.

[27]  B. Bollobás The evolution of random graphs , 1984 .

[28]  Sourav Patel,et al.  Distributed Finite-Time Termination for Consensus Algorithm in Switching Topologies , 2019, IEEE Transactions on Network Science and Engineering.

[29]  Wei Shi,et al.  A Decentralized Proximal-Gradient Method With Network Independent Step-Sizes and Separated Convergence Rates , 2017, IEEE Transactions on Signal Processing.

[30]  John N. Tsitsiklis,et al.  Convergence Speed in Distributed Consensus and Averaging , 2009, SIAM J. Control. Optim..

[31]  Michael G. Rabbat,et al.  Push-Sum Distributed Dual Averaging for convex optimization , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[32]  Wei Shi,et al.  Push–Pull Gradient Methods for Distributed Optimization in Networks , 2021, IEEE Transactions on Automatic Control.

[33]  Usman A. Khan,et al.  A Linear Algorithm for Optimization Over Directed Graphs With Geometric Convergence , 2018, IEEE Control Systems Letters.

[34]  Christoforos N. Hadjicostis,et al.  Distributed strategies for average consensus in directed graphs , 2011, IEEE Conference on Decision and Control and European Control Conference.

[35]  Angelia Nedic,et al.  Distributed Stochastic Subgradient Projection Algorithms for Convex Optimization , 2008, J. Optim. Theory Appl..

[36]  Usman A. Khan,et al.  DEXTRA: A Fast Algorithm for Optimization Over Directed Graphs , 2017, IEEE Transactions on Automatic Control.

[37]  Johannes Gehrke,et al.  Gossip-based computation of aggregate information , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[38]  John N. Tsitsiklis,et al.  Problems in decentralized decision making and computation , 1984 .

[39]  Wei Shi,et al.  Achieving Geometric Convergence for Distributed Optimization Over Time-Varying Graphs , 2016, SIAM J. Optim..

[40]  Annie I-An Chen,et al.  Fast Distributed First-Order Methods , 2012 .

[41]  Martin J. Wainwright,et al.  Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling , 2010, IEEE Transactions on Automatic Control.

[42]  Na Li,et al.  Accelerated Distributed Nesterov Gradient Descent , 2017, IEEE Transactions on Automatic Control.

[43]  Angelia Nedic,et al.  Distributed Optimization Over Time-Varying Directed Graphs , 2015, IEEE Trans. Autom. Control..