Online Trajectory Optimization Using Inexact Gradient Feedback for Time-Varying Environments

This article considers the problem of online trajectory design under time-varying environments. We formulate the general trajectory optimization problem within the framework of time-varying constrained convex optimization and propose a novel version of online gradient ascent algorithm (OGA) for such problems. Respecting the online nature, we carefully select the step size of OGA at each iteration so that the iterates stay feasible. Importantly, the proposed algorithm allows noisy gradients, expanding the range of practical applicability. In contrast to the most available literature, we present the offline sublinear regret of OGA up to the path length variations of the offline optimal solution, the cumulative gradient, and the error in the gradient variations. Furthermore, we establish a lower-bound on the offline dynamic regret, which defines the optimality of any trajectory. To show the efficacy of the proposed algorithm, we consider two practical problems of interest. First, a device to device (D2D) communications setting, where the goal is to design a user trajectory while maximizing its connectivity to the internet. Second, planning energy-efficient trajectories for unmanned surface vehicles (USV) under strong disturbances in ocean environments. Different from the state-of-the-art trajectory planning algorithms that entail planning and re-planning the full trajectory using the forecast data at each time instant, the proposed algorithm is entirely online and relies mostly on the ocean velocity measurements at the vehicle location. The detailed simulation results demonstrate the significance of the proposed algorithm on both synthetic and real data sets. Video result is available at https://tinyurl.com/y3ahmhsf.

[1]  Hyun Myung,et al.  Energy efficient path planning for a marine surface vehicle considering heading angle , 2015 .

[2]  Amit Konar,et al.  A Deterministic Improved Q-Learning for Path Planning of a Mobile Robot , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[3]  Gaurav S. Sukhatme,et al.  Planning and Implementing Trajectories for Autonomous Underwater Vehicles to Track Evolving Ocean Processes Based on Predictions from a Regional Ocean Model , 2010, Int. J. Robotics Res..

[4]  Hao Yu,et al.  A Low Complexity Algorithm with $O(\sqrt{T})$ Regret and Finite Constraint Violations for Online Convex Optimization with Long Term Constraints , 2016, ArXiv.

[5]  Ketan Rajawat,et al.  Tracking Moving Agents via Inexact Online Gradient Descent Algorithm , 2017, IEEE Journal of Selected Topics in Signal Processing.

[6]  Mohan Krishna Nutalapati,et al.  Online Utility-Optimal Trajectory Design for Time-Varying Ocean Environments , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[7]  Ryan N. Smith,et al.  Wind-energy based path planning for Unmanned Aerial Vehicles using Markov Decision Processes , 2013, 2013 IEEE International Conference on Robotics and Automation.

[8]  Geoffrey A. Hollinger,et al.  Planning Energy-Efficient Trajectories in Strong Disturbances , 2017, IEEE Robotics and Automation Letters.

[9]  Qing Wang,et al.  A Survey on Device-to-Device Communication in Cellular Networks , 2013, IEEE Communications Surveys & Tutorials.

[10]  Marceau Coupechoux,et al.  An Online Approach to D2D Trajectory Utility Maximization Problem , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[11]  Ketan Rajawat,et al.  Online Learning With Inexact Proximal Online Gradient Descent Algorithms , 2018, IEEE Transactions on Signal Processing.

[12]  Francesco Borrelli,et al.  A linear time varying model predictive control approach to the integrated vehicle dynamics control problem in autonomous systems , 2007, 2007 46th IEEE Conference on Decision and Control.

[13]  Siddhartha S. Srinivasa,et al.  CHOMP: Gradient optimization techniques for efficient motion planning , 2009, 2009 IEEE International Conference on Robotics and Automation.

[14]  Rebecca Willett,et al.  Online Convex Optimization in Dynamic Environments , 2015, IEEE Journal of Selected Topics in Signal Processing.

[15]  Robin De Keyser,et al.  Heuristic approaches in robot path planning: A survey , 2016, Robotics Auton. Syst..

[16]  Dong Sun,et al.  Coordinated Motion Planning for Multiple Mobile Robots Along Designed Paths With Formation Requirement , 2011, IEEE/ASME Transactions on Mechatronics.

[17]  Gabriel Oliver,et al.  Path Planning of Autonomous Underwater Vehicles in Current Fields with Complex Spatial Variability: an A* Approach , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[18]  Federico Pecora,et al.  Integrated Motion Planning and Coordination for Industrial Vehicles , 2014, ICAPS.

[19]  Qing Ling,et al.  An Online Convex Optimization Approach to Proactive Network Resource Allocation , 2017, IEEE Transactions on Signal Processing.

[20]  Stefan Schaal,et al.  STOMP: Stochastic trajectory optimization for motion planning , 2011, 2011 IEEE International Conference on Robotics and Automation.

[21]  Arindam Banerjee,et al.  Online Alternating Direction Method , 2012, ICML.

[22]  Giuseppe Caire,et al.  Wireless Device-to-Device Caching Networks: Basic Principles and System Performance , 2013, IEEE Journal on Selected Areas in Communications.

[23]  Lijun Zhang,et al.  Adaptive Online Learning in Dynamic Environments , 2018, NeurIPS.

[24]  Zhong Fan,et al.  Emerging technologies and research challenges for 5G wireless networks , 2014, IEEE Wireless Communications.

[25]  Nikos A. Vlassis,et al.  A point-based POMDP algorithm for robot planning , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[26]  Alexander F. Shchepetkin,et al.  The regional oceanic modeling system (ROMS): a split-explicit, free-surface, topography-following-coordinate oceanic model , 2005 .

[27]  Rustam Stolkin,et al.  Optimal AUV path planning for extended missions in complex, fast-flowing estuarine environments , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[28]  R. Takei,et al.  A practical path-planning algorithm for a simple car: a Hamilton-Jacobi approach , 2010, Proceedings of the 2010 American Control Conference.

[29]  A. Matveev,et al.  Algorithms for collision-free navigation of mobile robots in complex cluttered environments: a survey , 2014, Robotica.

[30]  Byron Boots,et al.  Continuous-time Gaussian process motion planning via probabilistic inference , 2017, Int. J. Robotics Res..

[31]  Lydia E. Kavraki,et al.  Motion Planning With Dynamics by a Synergistic Combination of Layers of Planning , 2010, IEEE Transactions on Robotics.

[32]  Martin Zinkevich,et al.  Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[33]  Mahdi Fakoor,et al.  Humanoid Robot Path Planning with Fuzzy Markov Decision Processes , 2016 .

[34]  Aryan Mokhtari,et al.  Optimization in Dynamic Environments : Improved Regret Rates for Strongly Convex Problems , 2016 .

[35]  Matthew Dunbabin,et al.  Go with the flow : optimal AUV path planning in coastal environments , 2009, ICRA 2009.

[36]  Thierry Fraichard,et al.  Trajectory planning in a dynamic workspace: a 'state-time space' approach , 1998, Adv. Robotics.

[37]  Ryan N. Smith,et al.  Predictive motion planning for AUVs subject to strong time-varying currents and forecasting uncertainties , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[38]  Arindam Banerjee,et al.  Online Alternating Direction Method (longer version) , 2013, ArXiv.

[39]  Hui X. Li,et al.  A probabilistic approach to optimal robust path planning with obstacles , 2006, 2006 American Control Conference.

[40]  Pierre FJ Lermusiaux,et al.  Time-optimal path planning in dynamic flows using level set equations: theory and schemes , 2014, Ocean Dynamics.

[41]  Yacine Ghamri-Doudane,et al.  On the feasibility of WiFi offloading in urban areas: The Paris case study , 2013, 2013 IFIP Wireless Days (WD).

[42]  Pieter Abbeel,et al.  Motion planning with sequential convex optimization and convex collision checking , 2014, Int. J. Robotics Res..

[43]  Anthony Stentz,et al.  Using interpolation to improve path planning: The Field D* algorithm , 2006, J. Field Robotics.

[44]  M. Diehl,et al.  Time-energy optimal path tracking for robots: a numerically efficient optimization approach , 2008, 2008 10th IEEE International Workshop on Advanced Motion Control.

[45]  Ketan Rajawat,et al.  Adversarial Multi-Agent Target Tracking with Inexact Online Gradient Descent , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[46]  Dizan Vasquez,et al.  Novel planning-based algorithms for human motion prediction , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[47]  Elad Hazan,et al.  Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.

[48]  Jinfeng Yi,et al.  Improved Dynamic Regret for Non-degenerate Functions , 2016, NIPS.

[49]  Ying-Dar Lin,et al.  Multihop cellular: a new architecture for wireless communications , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[50]  Cédric Archambeau,et al.  Adaptive Algorithms for Online Convex Optimization with Long-term Constraints , 2015, ICML.

[51]  Deepak N. Subramani,et al.  Energy-optimal path planning by stochastic dynamically orthogonal level-set optimization , 2016 .

[52]  Hao Yu,et al.  Constraint Violations for Online Convex Optimization with Long Term Constraints , 2020 .

[53]  Omar Besbes,et al.  Non-Stationary Stochastic Optimization , 2013, Oper. Res..

[54]  Paul J. Martin,et al.  Description of the Navy Coastal Ocean Model Version 1.0 , 2000 .

[55]  Geoffrey A. Hollinger,et al.  Learning Uncertainty in Ocean Current Predictions for Safe and Reliable Navigation of Underwater Vehicles , 2015, J. Field Robotics.

[56]  Derrick Wing Kwan Ng,et al.  Joint Trajectory and Resource Allocation Design for UAV Communication Systems , 2018, 2018 IEEE Globecom Workshops (GC Wkshps).

[57]  Karl Sammut,et al.  A survey on path planning for persistent autonomy of autonomous underwater vehicles , 2015 .

[58]  M. Ani Hsieh,et al.  Time and Energy Optimal Path Planning in General Flows , 2016, Robotics: Science and Systems.

[59]  E. Kim,et al.  Model predictive control strategy for smooth path tracking of autonomous vehicles with steering actuator dynamics , 2014 .

[60]  Pieter Abbeel,et al.  LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information , 2010, Int. J. Robotics Res..

[61]  Maxim Likhachev,et al.  Planning Long Dynamically Feasible Maneuvers for Autonomous Vehicles , 2008, Int. J. Robotics Res..

[62]  Rosli Salleh,et al.  The Future of Mobile Wireless Communication Networks , 2009, 2009 International Conference on Communication Software and Networks.