Deep neural networks as add-on modules for enhancing robot performance in impromptu trajectory tracking

High-accuracy trajectory tracking is critical to many robotic applications, including search and rescue, advanced manufacturing, and industrial inspection, to name a few. Yet the unmodeled dynamics and parametric uncertainties of operating in such complex environments make it difficult to design controllers that are capable of accurately tracking arbitrary, feasible trajectories from the first attempt (i.e., impromptu trajectory tracking). This article proposes a platform-independent, learning-based “add-on” module to enhance the tracking performance of black-box control systems in impromptu tracking tasks. Our approach is to pre-cascade a deep neural network (DNN) to a stabilized baseline control system, in order to establish an identity mapping from the desired output to the actual output. Previous research involving quadrotors showed that, for 30 arbitrary hand-drawn trajectories, the DNN-enhancement control architecture reduces tracking errors by 43% on average, as compared with the baseline controller. In this article, we provide a platform-independent formulation and practical design guidelines for the DNN-enhancement approach. In particular, we: (1) characterize the underlying function of the DNN module; (2) identify necessary conditions for the approach to be effective; (3) provide theoretical insights into the stability of the overall DNN-enhancement control architecture; (4) derive a condition that supports data-efficient training of the DNN module; and (5) compare the novel theory-driven DNN design with the prior trial-and-error design using detailed quadrotor experiments. We show that, as compared with the prior trial-and-error design, the novel theory-driven design allows us to reduce the input dimension of the DNN by two thirds while achieving similar tracking performance.

[1]  Angela P. Schoellig,et al.  Provably Robust Learning-Based Approach for High-Accuracy Tracking Control of Lagrangian Systems , 2019, IEEE Robotics and Automation Letters.

[2]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[3]  Jessy W. Grizzle,et al.  Supervised learning for stabilizing underactuated bipedal robot locomotion, with outdoor experiments on the wave field , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Peter J. Gawthrop,et al.  Neural networks for control systems - A survey , 1992, Autom..

[5]  Jin Zhang,et al.  Neural-network control of nonaffine nonlinear system with zero dynamics by state and output feedback , 2003, IEEE Trans. Neural Networks.

[6]  A.G. Alleyne,et al.  A survey of iterative learning control , 2006, IEEE Control Systems.

[7]  R. Hirschorn Invertibility of multivariable nonlinear control systems , 1979 .

[8]  Goldie Nejat,et al.  Robotic Urban Search and Rescue: A Survey from the Control Perspective , 2013, J. Intell. Robotic Syst..

[9]  K. Åström,et al.  Revisiting the Ziegler-Nichols step response method for PID control , 2004 .

[10]  Munther A. Dahleh,et al.  Lectures on Dynamic Systems and Control , 2004 .

[11]  Stefan Schaal,et al.  Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[12]  H. Sussmann Limitations on the stabilizability of globally-minimum-phase systems , 1990 .

[13]  Angela P. Schoellig,et al.  On the construction of safe controllable regions for affine systems with applications to robotics , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[14]  Nahum Shimkin,et al.  Nonlinear Control Systems , 2008 .

[15]  R. Siegwart,et al.  A UAV system for inspection of industrial facilities , 2013, 2013 IEEE Aerospace Conference.

[16]  Raffaello D'Andrea,et al.  Optimization-based iterative learning for precise quadrocopter trajectory tracking , 2012, Autonomous Robots.

[17]  Jesse B. Hoagg,et al.  Nonminimum-phase zeros - much to do about nothing - classical control - revisited part II , 2007, IEEE Control Systems.

[18]  James M. Rehg,et al.  Aggressive Deep Driving: Combining Convolutional Neural Networks and Model Predictive Control , 2017, CoRL.

[19]  Xiao-gang Jia,et al.  Adaptive iterative learning control for robot manipulators , 2010, 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[20]  Angela P. Schoellig,et al.  Design of deep neural networks as add-on blocks for improving impromptu trajectory tracking , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[21]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[22]  Hassan K. Khalil,et al.  Adaptive control of a class of nonlinear discrete-time systems using neural networks , 1995, IEEE Trans. Autom. Control..

[23]  Benyamin Kusumoputro,et al.  Optimized direct inverse control to control altitude of a small helicopter , 2015 .

[24]  Sergey Levine,et al.  Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[25]  Gang Tao,et al.  Adaptive Neural Network Based Control of Noncanonical Nonlinear Systems , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Claire J. Tomlin,et al.  Learning quadrotor dynamics using neural network for flight control , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[27]  Tone Magister,et al.  The Autonomous Flight , 2012 .

[28]  Mitsuo Kawato,et al.  Feedback-Error-Learning Neural Network for Supervised Motor Learning , 1990 .

[29]  Pramod P. Khargonekar,et al.  Robust control theory , 1995 .

[30]  Jan Peters,et al.  Local Gaussian process regression for real-time model-based robot control , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  Angela P. Schoellig,et al.  An Inversion-Based Learning Approach for Improving Impromptu Trajectory Tracking of Robots With Non-Minimum Phase Dynamics , 2018, IEEE Robotics and Automation Letters.

[32]  Wei He,et al.  Adaptive Neural Network Control of an Uncertain Robot With Full-State Constraints , 2016, IEEE Transactions on Cybernetics.

[33]  Jean-Jacques E. Slotine,et al.  The Robust Control of Robot Manipulators , 1985 .

[34]  Jan Peters,et al.  Using model knowledge for learning inverse dynamics , 2010, 2010 IEEE International Conference on Robotics and Automation.

[35]  B. Paden,et al.  Nonlinear inversion-based output tracking , 1996, IEEE Trans. Autom. Control..

[36]  Robert Babuska,et al.  Actor-critic reinforcement learning for tracking control in robotics , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[37]  Danwei W. Wang,et al.  Analysis of Nonlinear Discrete-Time Systems with Higher-Order Iterative Learning Control , 2001 .

[38]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[39]  Raffaello D'Andrea,et al.  Iterative learning of feed-forward corrections for high-performance tracking , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[40]  Bruce A. Francis,et al.  The internal model principle of control theory , 1976, Autom..

[41]  Lakmal Seneviratne,et al.  Adaptive Control Of Robot Manipulators , 1992, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42]  Zhong-Ping Jiang,et al.  Input-to-state stability for discrete-time nonlinear systems , 1999 .

[43]  Frank L. Lewis,et al.  Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics , 2014, Autom..

[44]  Bruce A. Francis,et al.  A vision-based robotic follower vehicle , 2009, Defense + Commercial Sensing.

[45]  Serdar Iplikci,et al.  Support vector machines‐based generalized predictive control , 2006 .

[46]  Chong-Ho Choi,et al.  Iterative learning control for discrete-time nonlinear systems , 1994 .

[47]  Nicolas Petit,et al.  The Navigation and Control technology inside the AR.Drone micro UAV , 2011 .

[48]  Angela P. Schoellig,et al.  Deep neural networks for improved, impromptu trajectory tracking of quadrotors , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[49]  Finale Doshi-Velez,et al.  Decomposition of Uncertainty in Bayesian Deep Learning for Efficient and Risk-sensitive Learning , 2017, ICML.

[50]  Zheng Yan,et al.  Robust Model Predictive Control of Nonlinear Systems With Unmodeled Dynamics and Bounded Uncertainties Based on Neural Networks , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[51]  Manfred Morari,et al.  Optimization‐based autonomous racing of 1:43 scale RC cars , 2015, ArXiv.

[52]  Nolan Wagener,et al.  Learning contact-rich manipulation skills with guided policy search , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[53]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[54]  Torgny Brogårdh,et al.  Present and future robot control development - An industrial perspective , 2007, Annu. Rev. Control..

[55]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[56]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[57]  K. Astr,et al.  Revisiting the Ziegler – Nichols step response method for PID control , 2004 .

[58]  P. Antsaklis Special issue on hybrid systems: theory and applications a brief introduction to the theory and applications of hybrid systems , 2000, Proc. IEEE.

[59]  Peter E. Caines,et al.  Epsilon controllability of nonlinear systems on polytopes , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).