论文信息 - Adversarial Multi-Agent Target Tracking with Inexact Online Gradient Descent

Adversarial Multi-Agent Target Tracking with Inexact Online Gradient Descent

Multi-agent systems are being increasingly deployed in challenging environments for performing complex tasks such as multi-target tracking, search-and-rescue, and intrusion detection. This paper formulates the generic target tracking problem as a time-varying optimization problem and puts forth an inexact online gradient descent method for solving it sequentially. The performance of the proposed algorithm is studied by characterizing its dynamic regret, a notion common to the online learning literature. Building upon the existing results, we provide improved regret rates that not only allow non-strongly convex costs but also explicating the role of the cumulative gradient error. The objective function is convex but the variable belongs to a compact domain. The efficacy of the proposed inexact gradient framework is established on a multi-agent multi-target tracking problem.

[1] Geert Leus,et al. On non-differentiable time-varying optimization , 2015, 2015 IEEE 6th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[2] P. Tseng,et al. On the linear convergence of descent methods for convex essentially smooth minimization , 1992 .

[3] Jinfeng Yi,et al. Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient , 2016, ICML.

[4] M. Ani Hsieh,et al. An Optimal Approach to Collaborative Target Tracking with Performance Guarantees , 2009, J. Intell. Robotic Syst..

[5] Shahin Shahrampour,et al. Online Optimization : Competing with Dynamic Comparators , 2015, AISTATS.

[6] Anthony Man-Cho So,et al. Non-asymptotic convergence analysis of inexact gradient methods for machine learning without strong convexity , 2013, Optim. Methods Softw..

[7] Judy Kay,et al. Clustering and Sequential Pattern Mining of Online Collaborative Learning Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[8] Ali H. Sayed,et al. On the Influence of Informed Agents on Learning and Adaptation Over Networks , 2012, IEEE Transactions on Signal Processing.

[9] Rebecca Willett,et al. Online Convex Optimization in Dynamic Environments , 2015, IEEE Journal of Selected Topics in Signal Processing.

[10] Alfred O. Hero,et al. A Convergent Incremental Gradient Method with a Constant Step Size , 2007, SIAM J. Optim..

[11] Jinfeng Yi,et al. Improved Dynamic Regret for Non-degenerate Functions , 2016, NIPS.

[12] Stergios I. Roumeliotis,et al. Multirobot Active Target Tracking With Combinations of Relative Observations , 2011, IEEE Transactions on Robotics.

[13] L. Rosasco,et al. Convergence of Stochastic Proximal Gradient Algorithm , 2014, Applied Mathematics & Optimization.

[14] Benoît Champagne,et al. Estimation of Space-Time Varying Parameters Using a Diffusion LMS Algorithm , 2014, IEEE Transactions on Signal Processing.

[15] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[16] Aryan Mokhtari,et al. Optimization in Dynamic Environments : Improved Regret Rates for Strongly Convex Problems , 2016 .

[17] Francis Bach,et al. SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives , 2014, NIPS.

[18] Tong Zhang,et al. Accelerating Stochastic Gradient Descent using Predictive Variance Reduction , 2013, NIPS.

[19] Mark W. Schmidt,et al. A Stochastic Gradient Method with an Exponential Convergence Rate for Finite Training Sets , 2012, NIPS.

[20] Aryan Mokhtari,et al. Decentralized Prediction-Correction Methods for Networked Time-Varying Convex Optimization , 2016, IEEE Transactions on Automatic Control.

[21] John N. Tsitsiklis,et al. Gradient Convergence in Gradient methods with Errors , 1999, SIAM J. Optim..

[22] Mark W. Schmidt,et al. Hybrid Deterministic-Stochastic Methods for Data Fitting , 2011, SIAM J. Sci. Comput..

[23] Shahin Shahrampour,et al. Distributed Online Optimization in Dynamic Environments Using Mirror Descent , 2016, IEEE Transactions on Automatic Control.

[24] Omar Besbes,et al. Non-Stationary Stochastic Optimization , 2013, Oper. Res..

[25] Pieter Abbeel,et al. An Application of Reinforcement Learning to Aerobatic Helicopter Flight , 2006, NIPS.

[26] Hui Zhang,et al. Restricted strong convexity and its applications to convergence analysis of gradient-type methods in convex optimization , 2015, Optim. Lett..

[27] K.P. Valavanis,et al. Unmanned helicopter waypoint trajectory tracking using model predictive control , 2007, 2007 Mediterranean Conference on Control & Automation.