Efficient Optimization of Control Libraries

A popular approach to high dimensional control problems in robotics uses a library of candidate "maneuvers" or "trajectories". The library is either evaluated on a fixed number of candidate choices at runtime (e.g. path set selection for planning) or by iterating through a sequence of feasible choices until success is achieved (e.g. grasp selection). The performance of the library relies heavily on the content and order of the sequence of candidates. We propose a provably efficient method to optimize such libraries, leveraging recent advances in optimizing sub-modular functions of sequences. This approach is demonstrated on two important problems: mobile robot navigation and manipulator grasp set selection. In the first case, performance can be improved by choosing a subset of candidates which optimizes the metric under consideration (cost of traversal). In the second case, performance can be optimized by minimizing the depth in the list that is searched before a successful candidate is found. Our method can be used in both on-line and batch settings with provable performance guarantees, and can be run in an anytime manner to handle real-time constraints.

[1]  Sebastian Thrun,et al.  Stanley: The robot that won the DARPA Grand Challenge: Research Articles , 2006 .

[2]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[3]  Ross A. Knepper,et al.  Path diversity is only part of the problem , 2009, 2009 IEEE International Conference on Robotics and Automation.

[4]  Sebastian Thrun,et al.  Junior: The Stanford entry in the Urban Challenge , 2008, J. Field Robotics.

[5]  Daniel Sidobre,et al.  Efficient models for grasp planning with a multi-fingered hand , 2012, Robotics Auton. Syst..

[6]  J. Andrew Bagnell,et al.  Anytime online novelty detection for vehicle safeguarding , 2010, 2010 IEEE International Conference on Robotics and Automation.

[7]  Dmitry Berenson,et al.  Grasp planning in complex scenes , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[8]  David M. Bradley,et al.  Learning for Autonomous Navigation , 2010, IEEE Robotics & Automation Magazine.

[9]  Takeo Kanade,et al.  Automated Construction of Robotic Manipulation Programs , 2010 .

[10]  Steven M. LaValle,et al.  Survivability: Measuring and ensuring path diversity , 2009, 2009 IEEE International Conference on Robotics and Automation.

[11]  Uriel Feige A threshold of ln n for approximating set cover (preliminary version) , 1996, STOC '96.

[12]  Manfred K. Warmuth,et al.  The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.

[13]  Robert B. Fisher,et al.  Ranking planar grasp configurations for a three-finger hand , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[14]  Geoffrey J. Gordon,et al.  No-Regret Reductions for Imitation Learning and Structured Prediction , 2010, ArXiv.

[15]  E. Feron,et al.  Robust hybrid control for autonomous vehicle motion planning , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[16]  Siddhartha S. Srinivasa,et al.  Imitation learning for locomotion and manipulation , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[17]  Alonzo Kelly,et al.  State Space Sampling of Feasible Motions for High Performance Mobile Robot Navigation in Highly Constrained Environments , 2007, FSR.

[18]  Matei T. Ciocarlie,et al.  Data-driven grasping with partial sensor data , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Satoru Fujishige,et al.  Submodular functions and optimization , 1991 .

[20]  Claudio Gentile,et al.  On the generalization ability of on-line learning algorithms , 2001, IEEE Transactions on Information Theory.

[21]  William Whittaker,et al.  Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.

[22]  Matthew J. Streeter,et al.  An Online Algorithm for Maximizing Submodular Functions , 2008, NIPS.

[23]  Alonzo Kelly,et al.  Optimal Sampling In the Space of Paths: Preliminary Results , 2006 .

[24]  László Lovász,et al.  Approximating Min Sum Set Cover , 2004, Algorithmica.

[25]  Steven M. LaValle,et al.  RRT-connect: An efficient approach to single-query path planning , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[26]  Christopher G. Atkeson,et al.  Policies based on trajectory libraries , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[27]  Avrim Blum,et al.  On-line Algorithms in Machine Learning , 1996, Online Algorithms.

[28]  Matei T. Ciocarlie,et al.  On-Line Interactive Dexterous Grasping , 2008, EuroHaptics.

[29]  Alonzo Kelly,et al.  Toward Reliable Off Road Autonomous Vehicles Operating in Challenging Environments , 2006, Int. J. Robotics Res..

[30]  Larry D. Jackel,et al.  The DARPA LAGR program: Goals, challenges, methodology, and phase I results , 2006, J. Field Robotics.

[31]  William Whittaker,et al.  Autonomous driving in urban environments: Boss and the Urban Challenge , 2008 .

[32]  Ross A. Knepper,et al.  Path and trajectory diversity: Theory and algorithms , 2008, 2008 IEEE International Conference on Robotics and Automation.

[33]  William Whittaker,et al.  A robust approach to high‐speed navigation for unrehearsed desert terrain , 2006, J. Field Robotics.