论文信息 - Trajectory prediction: learning to map situations to robot trajectories

Trajectory prediction: learning to map situations to robot trajectories

Trajectory planning and optimization is a fundamental problem in articulated robotics. Algorithms used typically for this problem compute optimal trajectories from scratch in a new situation. In effect, extensive data is accumulated containing situations together with the respective optimized trajectories - but this data is in practice hardly exploited. The aim of this paper is to learn from this data. Given a new situation we want to predict a suitable trajectory which only needs minor refinement by a conventional optimizer. Our approach has two essential ingredients. First, to generalize from previous situations to new ones we need an appropriate situation descriptor - we propose a sparse feature selection approach to find such well-generalizing features of situations. Second, the transfer of previously optimized trajectories to a new situation should not be made in joint angle space - we propose a more efficient task space transfer of old trajectories to new situations. Experiments on a simulated humanoid reaching problem show that we can predict reasonable motion prototypes in new situations for which the refinement is much faster than an optimization from scratch.

Marc Toussaint | Nikolay Jetchev | Marc Toussaint | Nikolay Jetchev

[1] Yoshihiko Nakamura,et al. Optimal Redundancy Control of Robot Manipulators , 1987 .

[2] David G. Lowe,et al. Similarity Metric Learning for a Variable-Kernel Classifier , 1995, Neural Computation.

[3] Lydia E. Kavraki,et al. Randomized query processing in robot path planning , 1995, STOC '95.

[4] Jianwei Zhang,et al. An Enhanced Optimization Approach for Generating Smooth Robot Trajectories in the Presence of Obstacles , 1995 .

[5] Kazuo Hiraki,et al. From Egocentric to Allocentric Spatial Behavior: A Computational Model of Spatial Development , 1998, Adapt. Behav..

[6] Edwin D. de Jong,et al. Context-based policy search: transfer of experience across problems , 2002 .

[7] James Theiler,et al. Grafting: Fast, Incremental Feature Selection by Gradient Descent in Function Space , 2003, J. Mach. Learn. Res..

[8] Ubbo Visser,et al. Egocentric qualitative spatial knowledge representation for physical robots , 2004, Robotics Auton. Syst..

[9] Ronan Boulic,et al. An inverse kinematics architecture enforcing an arbitrary number of strict priority levels , 2004, The Visual Computer.

[10] E. Todorov,et al. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..

[11] Aude Billard,et al. Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM , 2005, ICML.

[12] A. Atiya,et al. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[13] Andrew G. Barto,et al. Autonomous shaping: knowledge transfer in reinforcement learning , 2006, ICML.

[14] Jian Li,et al. Iterative RELIEF for feature weighting , 2006, ICML.

[15] Tamim Asfour,et al. An integrated approach to inverse kinematics and path planning for redundant manipulators , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[16] Gerd Hirzinger,et al. Capturing robot workspace structure: representing robot capabilities , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17] Rajesh P. N. Rao,et al. Towards a Real-Time Bayesian Imitation System for a Humanoid Robot , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[18] Sean R. Martin,et al. Offline and Online Evolutionary Bi-Directional RRT Algorithms for Efficient Re-Planning in Dynamic Environments , 2007, 2007 IEEE International Conference on Automation Science and Engineering.

[19] M. Stolle,et al. Knowledge Transfer Using Local Features , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[20] Glenn Fung,et al. Feature Selection and Kernel Design via Linear Programming , 2007, IJCAI.

[21] Mark W. Schmidt,et al. Fast Optimization Methods for L1 Regularization: A Comparative Study and Two New Approaches , 2007, ECML.

[22] Ross A. Knepper,et al. Path and trajectory diversity: Theory and algorithms , 2008, 2008 IEEE International Conference on Robotics and Automation.

[23] Konrad Paul Kording,et al. Estimating the sources of motor errors for adaptation and generalization , 2008, Nature Neuroscience.

[24] Sean R. Martin,et al. Evolutionary Bi-Directional RRT Algorithms for Efficient RePlanning in Dynamic Environments , .