论文信息 - Learning from Experience in Manipulation Planning: Setting the Right Goals

Learning from Experience in Manipulation Planning: Setting the Right Goals

In this paper, we describe a method of improving trajectory optimization based on predicting good initial guesses from previous experiences. In order to generalize to new situations, we propose a paradigm shift: predicting qualitative attributes of the trajectory that place the initial guess in the basin of attraction of a low-cost solution. We start with a key such attribute, the choice of a goal within a goal set that describes the task, and show the generalization capabilities of our method in extensive experiments on a personal robotics platform.

Siddhartha S. Srinivasa | Anca D. Dragan | Geoffrey J. Gordon | S. Srinivasa | A. Dragan

[1] Marc Toussaint,et al. Trajectory prediction: learning to map situations to robot trajectories , 2009, ICML '09.

[2] Siddhartha S. Srinivasa,et al. Manipulation planning with Workspace Goal Regions , 2009, 2009 IEEE International Conference on Robotics and Automation.

[3] Christoph H. Lampert,et al. Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.

[5] Thomas B. Moeslund,et al. Long-Term Occupancy Analysis Using Graph-Based Optimisation in Thermal Imagery , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Jitendra Malik,et al. Image Retrieval and Classification Using Local Distance Functions , 2006, NIPS.

[7] Ross A. Knepper,et al. Path and trajectory diversity: Theory and algorithms , 2008, 2008 IEEE International Conference on Robotics and Automation.

[8] Andrew G. Barto,et al. Autonomous shaping: knowledge transfer in reinforcement learning , 2006, ICML.

[9] M. Stolle,et al. Knowledge Transfer Using Local Features , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[10] Barry Smyth,et al. Retrieval, reuse, revision and retention in case-based reasoning , 2005, The Knowledge Engineering Review.

[11] Manuela Veloso. Learning by analogical reasoning in general problem-solving , 1992 .

[12] Siddhartha S. Srinivasa,et al. CHOMP: Gradient optimization techniques for efficient motion planning , 2009, 2009 IEEE International Conference on Robotics and Automation.

[13] Christopher G. Atkeson,et al. Transfer of policies based on trajectory libraries , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14] J. Andrew Bagnell,et al. Efficient Optimization of Control Libraries , 2011, AAAI.

[15] Sean R. Martin,et al. Offline and Online Evolutionary Bi-Directional RRT Algorithms for Efficient Re-Planning in Dynamic Environments , 2007, 2007 IEEE International Conference on Automation Science and Engineering.

[16] Christopher G. Atkeson,et al. Policies based on trajectory libraries , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[17] Marc Toussaint,et al. Trajectory prediction in cluttered voxel environments , 2010, 2010 IEEE International Conference on Robotics and Automation.

[18] Ali Farhadi,et al. Describing objects by their attributes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[19] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20] Geoffrey E. Hinton,et al. Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[21] Siddhartha S. Srinivasa,et al. Manipulation planning with goal sets using constrained trajectory optimization , 2011, 2011 IEEE International Conference on Robotics and Automation.