How is Distributed ADMM Affected by Network Topology

When solving consensus optimization problems over a graph, there is often an explicit characterization of the convergence rate of Gradient Descent (GD) using the spectrum of the graph Laplacian. The same type of problems under the Alternating Direction Method of Multipliers (ADMM) are, however, poorly understood. For instance, simple but important non-strongly-convex consensus problems have not yet being analyzed, especially concerning the dependency of the convergence rate on the graph topology. Recently, for a non-strongly-convex consensus problem, a connection between distributed ADMM and lifted Markov chains was proposed, followed by a conjecture that ADMM is faster than GD by a square root factor in its convergence time, in close analogy to the mixing speedup achieved by lifting several Markov chains. Nevertheless, a proof of such a claim is is still lacking. Here we provide a full characterization of the convergence of distributed over-relaxed ADMM for the same type of consensus problem in terms of the topology of the underlying graph. Our results provide explicit formulas for optimal parameter selection in terms of the second largest eigenvalue of the transition matrix of the graph's random walk. Another consequence of our results is a proof of the aforementioned conjecture, which interestingly, we show it is valid for any graph, even the ones whose random walks cannot be accelerated via Markov chain lifting.

[1]  Wotao Yin,et al.  Global Convergence of ADMM in Nonconvex Nonsmooth Optimization , 2015, Journal of Scientific Computing.

[2]  Zheng Xu,et al.  Adaptive Consensus ADMM for Distributed Optimization , 2017, ICML.

[3]  Zheng Xu,et al.  Adaptive Relaxed ADMM: Convergence Theory and Practical Implementation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Guilherme França,et al.  Markov Chain Lifting and Distributed ADMM , 2017, IEEE Signal Processing Letters.

[5]  Zheng Xu,et al.  Adaptive ADMM with Spectral Penalty Parameter Selection , 2016, AISTATS.

[6]  Asuman E. Ozdaglar,et al.  Convergence Rate of Distributed ADMM Over Networks , 2016, IEEE Transactions on Automatic Control.

[7]  Stephen P. Boyd,et al.  Linear Convergence and Metric Selection for Douglas-Rachford Splitting and ADMM , 2014, IEEE Transactions on Automatic Control.

[8]  Wotao Yin,et al.  Faster Convergence Rates of Relaxed Peaceman-Rachford and ADMM Under Regularity Assumptions , 2014, Math. Oper. Res..

[9]  Zhi-Quan Luo,et al.  On the linear convergence of the alternating direction method of multipliers , 2012, Mathematical Programming.

[10]  Zheng Xu,et al.  An Empirical Study of ADMM for Nonconvex Problems , 2016, ArXiv.

[11]  Qing Ling,et al.  Communication-efficient weighted ADMM for decentralized network optimization , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Wotao Yin,et al.  On the Global and Linear Convergence of the Generalized Alternating Direction Method of Multipliers , 2016, J. Sci. Comput..

[13]  Guilherme França,et al.  An explicit rate bound for over-relaxed ADMM , 2015, 2016 IEEE International Symposium on Information Theory (ISIT).

[14]  Damek Davis,et al.  Convergence Rate Analysis of Several Splitting Schemes , 2014, 1406.4834.

[15]  Pascal Bianchi,et al.  Explicit Convergence Rate of a Distributed Alternating Direction Method of Multipliers , 2013, IEEE Transactions on Automatic Control.

[16]  Aryan Mokhtari,et al.  Decentralized quadratically approximated alternating direction method of multipliers , 2015, 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[17]  Qing Ling,et al.  DLM: Decentralized Linearized Alternating Direction Method of Multipliers , 2015, IEEE Transactions on Signal Processing.

[18]  Michael I. Jordan,et al.  A General Analysis of the Convergence of ADMM , 2015, ICML.

[19]  A. Raghunathan,et al.  ADMM for Convex Quadratic Programs: Linear Convergence and Infeasibility Detection , 2014, 1411.7288.

[20]  Euhanna Ghadimi,et al.  Optimal Parameter Selection for the Alternating Direction Method of Multipliers (ADMM): Quadratic Problems , 2013, IEEE Transactions on Automatic Control.

[21]  Pascal Bianchi,et al.  Linear convergence rate for distributed optimization with the alternating direction method of multipliers , 2014, 53rd IEEE Conference on Decision and Control.

[22]  Stephen P. Boyd,et al.  Diagonal scaling in Douglas-Rachford splitting and ADMM , 2014, 53rd IEEE Conference on Decision and Control.

[23]  Euhanna Ghadimi,et al.  The ADMM algorithm for distributed averaging: Convergence rates and optimal parameter selection , 2014, 2014 48th Asilomar Conference on Signals, Systems and Computers.

[24]  Asuman E. Ozdaglar,et al.  Broadcast-based distributed alternating direction method of multipliers , 2014, 2014 52nd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[25]  Qing Ling,et al.  On the Linear Convergence of the ADMM in Decentralized Consensus Optimization , 2013, IEEE Transactions on Signal Processing.

[26]  A. Raghunathan,et al.  Optimal Step-Size Selection in Alternating Direction Method of Multipliers for Convex Quadratic Programs and Model Predictive Control , 2014 .

[27]  Xiaoming Yuan,et al.  Local Linear Convergence of the Alternating Direction Method of Multipliers for Quadratic Programs , 2013, SIAM J. Numer. Anal..

[28]  Javier Alonso-Mora,et al.  A message-passing algorithm for multi-agent trajectory planning , 2013, NIPS.

[29]  Daniel Boley,et al.  Local Linear Convergence of the Alternating Direction Method of Multipliers on Quadratic or Linear Programs , 2013, SIAM J. Optim..

[30]  Nate Derbinsky,et al.  An Improved Three-Weight Message-Passing Algorithm , 2013, ArXiv.

[31]  Henrik Sandberg,et al.  Optimal scaling of the ADMM algorithm for distributed quadratic programming , 2013, 52nd IEEE Conference on Decision and Control.

[32]  René Vidal,et al.  Riemannian Consensus for Manifolds With Bounded Curvature , 2012, IEEE Transactions on Automatic Control.

[33]  V. Climenhaga Markov chains and mixing times , 2013 .

[34]  Asuman E. Ozdaglar,et al.  Distributed Alternating Direction Method of Multipliers , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[35]  Emiliano Dall'Anese,et al.  Fast Consensus by the Alternating Direction Multipliers Method , 2011, IEEE Transactions on Signal Processing.

[36]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[37]  Kyomin Jung,et al.  Distributed Averaging Via Lifted Markov Chains , 2009, IEEE Transactions on Information Theory.

[38]  Wen J. Li,et al.  Location-Aided Fast Distributed Consensus in Wireless Networks , 2010, IEEE Transactions on Information Theory.

[39]  Andreas Terzis,et al.  Distributed pose averaging in camera networks via consensus on SE(3) , 2008, 2008 Second ACM/IEEE International Conference on Distributed Smart Cameras.

[40]  Richard M. Murray,et al.  Consensus problems in networks of agents with switching topology and time-delays , 2004, IEEE Transactions on Automatic Control.

[41]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[42]  B. He,et al.  Alternating Direction Method with Self-Adaptive Penalty Parameters for Monotone Variational Inequalities , 2000 .

[43]  László Lovász,et al.  Faster mixing via average conductance , 1999, STOC '99.

[44]  Fang Chen,et al.  Lifting Markov chains to speed up mixing , 1999, STOC '99.

[45]  J. Cheeger A lower bound for the smallest eigenvalue of the Laplacian , 1969 .