论文信息 - Tracking Moving Agents via Inexact Online Gradient Descent Algorithm

Tracking Moving Agents via Inexact Online Gradient Descent Algorithm

Multiagent systems are being increasingly deployed in challenging environments for performing complex tasks such as multitarget tracking, search-and-rescue, and intrusion detection. Not with standing the computational limitations of individual robots, such systems rely on collaboration to sense and react to the environment. This paper formulates the generic target tracking problem as a time-varying optimization problem and puts forth an inexact online gradient descent method for solving it sequentially. The performance of the proposed algorithm is studied by characterizing its dynamic regret, a notion common to the online learning literature. Building upon the existing results, we provide improved regret rates that not only allow nonstrongly convex costs but also explain the role of the cumulative gradient error. Two distinct classes of problems are considered: one in which the objective function adheres to a quadratic growth condition, and another where the objective function is convex but the variable belongs to a compact domain. For both cases, results are developed while allowing the error to be either adversarial or arising from a white noise process. Further, the generality of the proposed framework is demonstrated by developing online variants of existing stochastic gradient algorithms and interpreting them as special cases of the proposed inexact gradient method. The efficacy of the proposed inexact gradient framework is established on a multiagent multitarget tracking problem, while its flexibility is exemplified by generating online movie recommendations for Movielens 10M dataset.

[1] M. Ani Hsieh,et al. An Optimal Approach to Collaborative Target Tracking with Performance Guarantees , 2009, J. Intell. Robotic Syst..

[2] Aryan Mokhtari,et al. Decentralized Prediction-Correction Methods for Networked Time-Varying Convex Optimization , 2016, IEEE Transactions on Automatic Control.

[3] Geert Leus,et al. On non-differentiable time-varying optimization , 2015, 2015 IEEE 6th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[4] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[5] Shahin Shahrampour,et al. Online Optimization : Competing with Dynamic Comparators , 2015, AISTATS.

[6] John N. Tsitsiklis,et al. Gradient Convergence in Gradient methods with Errors , 1999, SIAM J. Optim..

[7] Angelia Nedic,et al. Dynamic stochastic optimization , 2014, 53rd IEEE Conference on Decision and Control.

[8] Aryan Mokhtari,et al. A Class of Prediction-Correction Methods for Time-Varying Convex Optimization , 2015, IEEE Transactions on Signal Processing.

[9] Mark W. Schmidt,et al. A Stochastic Gradient Method with an Exponential Convergence Rate for Finite Training Sets , 2012, NIPS.

[10] Aryan Mokhtari,et al. A Class of Prediction-Correction Methods for Time-Varying Convex Optimization , 2015, IEEE Transactions on Signal Processing.

[11] R. Rockafellar,et al. Implicit Functions and Solution Mappings: A View from Variational Analysis , 2009 .

[12] Victor M. Zavala,et al. Real-Time Nonlinear Optimization as a Generalized Equation , 2010, SIAM J. Control. Optim..

[13] Andrea Simonetto. Time-Varying Convex Optimization via Time-Varying Averaged Operators , 2017, 1704.07338.

[14] Jason C. Derenick,et al. A convex optimization framework for multi-agent motion planning , 2009 .

[15] Pieter Abbeel,et al. An Application of Reinforcement Learning to Aerobatic Helicopter Flight , 2006, NIPS.

[16] Guanghui Lan. Convex optimization under inexact first-order information , 2009 .

[17] P. Tseng,et al. On the linear convergence of descent methods for convex essentially smooth minimization , 1992 .

[18] Mark W. Schmidt,et al. Convergence Rates of Inexact Proximal-Gradient Methods for Convex Optimization , 2011, NIPS.

[19] Dinh Quoc Tran,et al. Adjoint-Based Predictor-Corrector Sequential Convex Programming for Parametric Nonlinear Optimization , 2012, SIAM J. Optim..

[20] Colin Neil Jones,et al. A Parametric Nonconvex Decomposition Algorithm for Real-Time and Distributed NMPC , 2016, IEEE Transactions on Automatic Control.

[21] Rebecca Willett,et al. Online Convex Optimization in Dynamic Environments , 2015, IEEE Journal of Selected Topics in Signal Processing.

[22] Alfred O. Hero,et al. A Convergent Incremental Gradient Method with a Constant Step Size , 2007, SIAM J. Optim..

[23] Robert D. Tortora,et al. Sampling: Design and Analysis , 2000 .

[24] Steven Dubowsky,et al. A Concept Mission: Microbots for Large‐Scale Planetary Surface and Subsurface Exploration , 2005 .

[25] Jinfeng Yi,et al. Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient , 2016, ICML.

[26] A. Yu. Popkov,et al. Gradient Methods for Nonstationary Unconstrained Optimization Problems , 2005 .

[27] R. Rockafellar,et al. Implicit Functions and Solution Mappings , 2009 .

[28] Aryan Mokhtari,et al. Target tracking with dynamic convex optimization , 2015, 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[29] Mark W. Schmidt,et al. Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition , 2016, ECML/PKDD.

[30] Francis Bach,et al. SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives , 2014, NIPS.

[31] Aryan Mokhtari,et al. Optimization in Dynamic Environments : Improved Regret Rates for Strongly Convex Problems , 2016 .

[32] Asuman E. Ozdaglar,et al. Approximate Primal Solutions and Rate Analysis for Dual Subgradient Methods , 2008, SIAM J. Optim..

[33] Manuela M. Veloso,et al. Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[34] Stéphan Clémençon,et al. Online Matrix Completion Through Nuclear Norm Regularisation , 2014, SDM.

[35] Tong Zhang,et al. Accelerating Stochastic Gradient Descent using Predictive Variance Reduction , 2013, NIPS.

[36] L. Rosasco,et al. Convergence of Stochastic Proximal Gradient Algorithm , 2014, Applied Mathematics & Optimization.

[37] Benoît Champagne,et al. Estimation of Space-Time Varying Parameters Using a Diffusion LMS Algorithm , 2014, IEEE Transactions on Signal Processing.

[38] Alejandro Ribeiro,et al. Prediction-Correction Interior-Point Method for Time-Varying Convex Optimization , 2016, IEEE Transactions on Automatic Control.

[39] Ketan Rajawat,et al. Adaptive Low-Rank Matrix Completion , 2017, IEEE Transactions on Signal Processing.

[40] Mark W. Schmidt,et al. Hybrid Deterministic-Stochastic Methods for Data Fitting , 2011, SIAM J. Sci. Comput..

[41] Jinfeng Yi,et al. Improved Dynamic Regret for Non-degenerate Functions , 2016, NIPS.

[42] Stergios I. Roumeliotis,et al. Multirobot Active Target Tracking With Combinations of Relative Observations , 2011, IEEE Transactions on Robotics.

[43] Qing Ling,et al. An Online Convex Optimization Approach to Proactive Network Resource Allocation , 2017, IEEE Transactions on Signal Processing.

[44] Shahin Shahrampour,et al. Distributed Online Optimization in Dynamic Environments Using Mirror Descent , 2016, IEEE Transactions on Automatic Control.

[45] Omar Besbes,et al. Non-Stationary Stochastic Optimization , 2013, Oper. Res..

[46] R. Tyrrell Rockafellar,et al. An Euler-Newton Continuation Method for Tracking Solution Trajectories of Parametric Variational Inequalities , 2013, SIAM J. Control. Optim..

[47] Jun Ota,et al. Multi-agent robot systems as distributed autonomous systems , 2006, Adv. Eng. Informatics.

[48] Qing Ling,et al. An Online Convex Optimization Approach to Dynamic Network Resource Allocation , 2017, arXiv.org.

[49] Hui Zhang,et al. Restricted strong convexity and its applications to convergence analysis of gradient-type methods in convex optimization , 2015, Optim. Lett..

[50] K.P. Valavanis,et al. Unmanned helicopter waypoint trajectory tracking using model predictive control , 2007, 2007 Mediterranean Conference on Control & Automation.

[51] Anthony Man-Cho So,et al. Non-asymptotic convergence analysis of inexact gradient methods for machine learning without strong convexity , 2013, Optim. Methods Softw..

[52] Judy Kay,et al. Clustering and Sequential Pattern Mining of Online Collaborative Learning Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[53] Ali H. Sayed,et al. On the Influence of Informed Agents on Learning and Adaptation Over Networks , 2012, IEEE Transactions on Signal Processing.