EXACT NONPARAMETRIC DECENTRALIZED ONLINE OPTIMIZATION

We consider online learning over decentralized networks, where nodes observe unique, possibly correlated, observation stream. We focus on the case where agents learn a regression function that belongs to a reproducing kernel Hilbert space (RKHS). In this setting, a decentralized network aims to learn nonlinear statistical models that are optimal in terms of a global stochastic convex functional that aggregates data across the network, with only access to a local data stream. We incentivize coordination while respecting network heterogeneity through the introduction of nonlinear proximity constraints. To solve it, we propose applying a functional variant of stochastic primal-dual (Arrow-Hurwicz) method which yields a decentralized algorithm. To handle the fact that the RKHS parameterization has complexity comparable to the iteration index, we project the primal iterates onto Hilbert subspaces that are greedily constructed from the observation sequence of each node. The resulting proximal stochastic variant of Arrow-Hurwicz is shown to converge in expectation, both in terms of primal sub-optimality and constraint violation to a neighborhood that depends on a given constant step-size selection. Experiments on a correlated random field estimation problem validate our theoretical results.

[1]  Richard L. Wheeden Measure and integral , 1977 .

[2]  L. Eon Bottou Online Learning and Stochastic Approximations , 1998 .

[3]  Asuman E. Ozdaglar,et al.  Distributed Subgradient Methods for Multi-Agent Optimization , 2009, IEEE Transactions on Automatic Control.

[4]  Brian M. Sadler,et al.  Information Retrieval and Processing in Sensor Networks: Deterministic Scheduling Versus Random Access , 2007, IEEE Transactions on Signal Processing.

[5]  Alejandro Ribeiro,et al.  Ergodic Stochastic Optimization Algorithms for Wireless Communication and Networking , 2010, IEEE Transactions on Signal Processing.

[6]  Shahin Shahrampour,et al.  Distributed Online Optimization in Dynamic Environments Using Mirror Descent , 2016, IEEE Transactions on Automatic Control.

[7]  Brian M. Sadler,et al.  Proximity Without Consensus in Online Multiagent Optimization , 2016, IEEE Transactions on Signal Processing.

[8]  Léon Bottou,et al.  On-line learning and stochastic approximations , 1999 .

[9]  Sergios Theodoridis,et al.  Online Learning in Reproducing Kernel Hilbert Spaces , 2014 .

[10]  Angelia Nedic,et al.  Decentralized online optimization with global objectives and local communication , 2015, 2015 American Control Conference (ACC).

[11]  Saswati Sarkar,et al.  Pricing for Profit in Internet of Things , 2019, IEEE Transactions on Network Science and Engineering.

[12]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[13]  Bernhard Schölkopf,et al.  A Generalized Representer Theorem , 2001, COLT/EuroCOLT.

[14]  Pascal Vincent,et al.  Kernel Matching Pursuit , 2002, Machine Learning.

[15]  Sudheendra Hangal,et al.  PrPl: a decentralized social networking infrastructure , 2010, MCS '10.

[16]  Sergios Theodoridis,et al.  Special Issue on Advances in Kernel-Based Learning for Signal Processing [From the Guest Editors] , 2013, IEEE Signal Process. Mag..

[17]  Ketan Rajawat,et al.  Asynchronous Incremental Stochastic Dual Descent Algorithm for Network Resource Allocation , 2017, IEEE Transactions on Signal Processing.

[18]  Martin J. Wainwright,et al.  Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling , 2010, IEEE Transactions on Automatic Control.

[19]  Vijay Kumar,et al.  A Multi-robot Control Policy for Information Gathering in the Presence of Unknown Hazards , 2011, ISRR.

[20]  Hao Zhu,et al.  Projected Stochastic Primal-Dual Method for Constrained Online Learning with Kernels , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[21]  Jeng-Shyang Pan,et al.  Kernel Learning Algorithms for Face Recognition , 2013 .

[22]  Furong Huang,et al.  Escaping From Saddle Points - Online Stochastic Gradient for Tensor Decomposition , 2015, COLT.

[23]  Ketan Rajawat,et al.  Beyond Consensus and Synchrony in Online Network Optimization via Saddle Point Method , 2017 .

[24]  Qing Ling,et al.  On the Linear Convergence of the ADMM in Decentralized Consensus Optimization , 2013, IEEE Transactions on Signal Processing.

[25]  Alejandro Ribeiro,et al.  Parsimonious Online Learning with Kernels via sparse projections in function space , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  Angelia Nedic,et al.  Distributed Stochastic Subgradient Projection Algorithms for Convex Optimization , 2008, J. Optim. Theory Appl..

[27]  Vladimir I. Norkin,et al.  On Stochastic Optimization and Statistical Learning in Reproducing Kernel Hilbert Spaces by Support Vector Machines (SVM) , 2009, Informatica.

[28]  Cédric Richard,et al.  Decentralized Online Learning With Kernels , 2017, IEEE Transactions on Signal Processing.

[29]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[30]  Ketan Rajawat,et al.  Tracking Moving Agents via Inexact Online Gradient Descent Algorithm , 2017, IEEE Journal of Selected Topics in Signal Processing.

[31]  Mehran Mesbahi,et al.  Online distributed optimization via dual averaging , 2013, 52nd IEEE Conference on Decision and Control.

[32]  Alexander Shapiro,et al.  Stochastic Approximation approach to Stochastic Programming , 2013 .

[33]  Bahman Gharesifard,et al.  Distributed Online Convex Optimization on Time-Varying Directed Graphs , 2017, IEEE Transactions on Control of Network Systems.

[34]  H. Robbins A Stochastic Approximation Method , 1951 .

[35]  Alejandro Ribeiro,et al.  A Saddle Point Algorithm for Networked Online Convex Optimization , 2014, IEEE Transactions on Signal Processing.

[36]  Brian M. Sadler,et al.  Source localization with distributed sensor arrays and partial spatial coherence , 2000, IEEE Transactions on Signal Processing.

[37]  A. Zygmund,et al.  Measure and integral : an introduction to real analysis , 1977 .

[38]  Hanif D. Sherali,et al.  Algorithm design for femtocell base station placement in commercial building environments , 2012, 2012 Proceedings IEEE INFOCOM.

[39]  Prateek Jain,et al.  Non-convex Optimization for Machine Learning , 2017, Found. Trends Mach. Learn..

[40]  Michael M. Zavlanos,et al.  Distributed primal-dual methods for online constrained optimization , 2016, 2016 American Control Conference (ACC).