Active Training Trajectory Generation for Inverse Dynamics Model Learning with Deep Neural Networks

Inverse dynamics models have been used in robot control algorithms to realize a desired motion or to enhance a robot’s performance. As robot dynamics and their operating environments become more complex, there is a growing trend of learning uncertain or unknown dynamics from data. While techniques such as deep neural networks (DNNs) have been successfully used to learn inverse dynamics, it is usually implicitly assumed that the learning modules are trained on sufficiently rich datasets. In practical implementations, this assumption typically results in a trial-and-error training process, which can be inefficient or unsafe for robot applications. In this paper, we present an active trajectory generation framework that allows us to systematically design informative trajectories for training DNN inverse dynamics modules. In particular, we introduce an episode-based algorithm that integrates a spline trajectory optimization approach with DNN active learning for efficient data collection. We consider different DNN uncertainty estimation techniques and active learning heuristics in our work and illustrate the proposed active training trajectory generation approach in simulation. We show that the proposed active training trajectory generation outperforms adhoc, intuitive training approaches.

[1]  David A. Cohn,et al.  Neural Network Exploration Using Optimal Experiment Design , 1993, NIPS.

[2]  Chong-Ho Choi,et al.  Iterative learning control for discrete-time nonlinear systems , 1994 .

[3]  Christoforos Anagnostopoulos,et al.  The Automatic Neuroscientist: A framework for optimizing experimental design with closed-loop real-time fMRI , 2016, NeuroImage.

[4]  Biao Huang,et al.  System Identification , 2000, Control Theory for Physicists.

[5]  Angela P. Schoellig,et al.  Deep neural networks for improved, impromptu trajectory tracking of quadrotors , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[6]  Gerd Hirzinger,et al.  Robot excitation trajectories for dynamic parameter estimation using optimized B-splines , 2012, 2012 IEEE International Conference on Robotics and Automation.

[7]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[8]  Ajg Bert Schoofs Experimental Design and Structural Optimization , 1988 .

[9]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[10]  Angela P. Schoellig,et al.  Design of deep neural networks as add-on blocks for improving impromptu trajectory tracking , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[11]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[12]  David Cohn,et al.  Active Learning , 2010, Encyclopedia of Machine Learning.

[13]  W. J. Studden,et al.  Theory Of Optimal Experiments , 1972 .

[14]  Zoubin Ghahramani,et al.  Deep Bayesian Active Learning with Image Data , 2017, ICML.

[15]  Vijay Kumar,et al.  Minimum snap trajectory generation and control for quadrotors , 2011, 2011 IEEE International Conference on Robotics and Automation.

[16]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[17]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[18]  Giuseppe Carlo Calafiore,et al.  Robot dynamic calibration: Optimal excitation trajectories and experimental parameter estimation , 2001, J. Field Robotics.

[19]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[20]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[21]  Jan Swevers,et al.  Optimal robot excitation and identification , 1997, IEEE Trans. Robotics Autom..

[22]  Ross D. King,et al.  Active Learning for Regression Based on Query by Committee , 2007, IDEAL.

[23]  Peter J. Gawthrop,et al.  Neural networks for control systems - A survey , 1992, Autom..

[24]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.