Modeling Dynamic Systems for Multi-Step Prediction with Recurrent Neural Networks

This thesis investigates the applicability of Recurrent Neural Networks (RNNs) and Deep Learning methods for multi-step prediction of robotic systems. The unmodeled dynamics and simplifying assumptions in classic modeling methods result in models that yield rapidly diverging predictions when the model is used in an iterative fashion, i.e., for multi-step prediction. However, the effect of the unmodeled dynamics can be captured by collecting datasets of the system. Deep Learning provides a strong set of tools to extract patterns from data, however, large datasets are commonly required for the methods to work well. Collecting a large amount of data from a robotic system can be a cumbersome and expensive approach. In this work, Deep Learning methods, particularly RNNs, are studied and employed for the purpose of learning models of two aerial vehicles from experimental data. The feasibility of employing RNNs is first studied to learn a model of a quadrotor based on a simulated dataset, which yields a Multi Layer Fully Connected (MLFC) architecture. Models can be learned for multi-step prediction, recovering excellent predictions over 500 timesteps in the presence of simulated disturbances to the robot and noise on the measurements. To learn models from experimental data, the RNN state initialization problem is defined and formulated. It is shown that the RNN state initialization problem can be addressed by creating and training an initialization network jointly with the multi-step prediction network, and the combination can be used in a black-box modeling approach such that the model produces predictions which are immediately accurate. The RNN based black-box methods are trained on an experimental dataset gathered from a quadrotor and a publicly available helicopter dataset. The quadrotor dataset, which encompasses approximately 4

[1]  Sergey Levine,et al.  Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Ranjan Ganguli,et al.  Identification of Helicopter Dynamics Using Recurrent Neural Networks and Flight Data , 2006 .

[3]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[4]  Ken-ichi Funahashi,et al.  On the approximate realization of continuous mappings by neural networks , 1989, Neural Networks.

[5]  Gade Pandu Rangaiah,et al.  Identification and predictive control of a multistage evaporator , 2010 .

[6]  Steve B. Jiang,et al.  Nonlinear Systems Identification Using Deep Dynamic Neural Networks , 2016, ArXiv.

[7]  Claire J. Tomlin,et al.  Learning-based model predictive control on a quadrotor: Onboard implementation and experimental results , 2012, 2012 IEEE International Conference on Robotics and Automation.

[8]  Vincent A Akpan,et al.  Nonlinear model identification and adaptive model predictive control using neural networks. , 2011, ISA transactions.

[9]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[10]  Peter Tiño,et al.  Learning long-term dependencies in NARX recurrent neural networks , 1996, IEEE Trans. Neural Networks.

[11]  Barak A. Pearlmutter Gradient calculations for dynamic recurrent neural networks: a survey , 1995, IEEE Trans. Neural Networks.

[12]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[13]  Abdelhakim Deboucha,et al.  Small-scale helicopter system identification model using recurrent neural networks , 2010, TENCON 2010 - 2010 IEEE Region 10 Conference.

[14]  Sean R. Anderson,et al.  Nonlinear Dynamic Modeling of Isometric Force Production in Primate Eye Muscle , 2010, IEEE Transactions on Biomedical Engineering.

[15]  Hana Boudjedir Neural Network Control Based on Adaptive Observer for Quadrotor Helicopter , 2012 .

[16]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[17]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[20]  Hans-Georg Zimmermann,et al.  Recurrent Neural Networks are Universal approximators , 2007, Int. J. Neural Syst..

[21]  Ieroham S. Baruch,et al.  A levenberg–marquardt learning applied for recurrent neural identification and control of a wastewater treatment bioprocess , 2009, HIS 2009.

[22]  Steven Lake Waslander,et al.  Modelling a Quadrotor Vehicle Using a Modular Deep Recurrent Neural Network , 2015, 2015 IEEE International Conference on Systems, Man, and Cybernetics.

[23]  Zhan Li,et al.  Muscle Fatigue Tracking with Evoked EMG via Recurrent Neural Network: Toward Personalized Neuroprosthetics , 2014, IEEE Computational Intelligence Magazine.

[24]  D. T. Greenwood Principles of dynamics , 1965 .

[25]  Stephen A. Billings,et al.  Non-linear system identification using neural networks , 1990 .

[26]  Steven Lake Waslander,et al.  Deep Learning a Quadrotor Dynamic Model for Multi-Step Prediction , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[27]  Jan M. Maciejowski,et al.  Predictive control : with constraints , 2002 .

[28]  Peter Sollich,et al.  Theory of Neural Information Processing Systems , 2005 .

[29]  Victor M. Becerra,et al.  SYSTEM IDENTIFICATION USING DYNAMIC NEURAL NETWORKS: TRAINING AND INITIALIZATION ASPECTS , 2002 .

[30]  John F. Kolen,et al.  Dynamical Recurrent Networks , 2001 .

[31]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[32]  Noboru Murata,et al.  Neural Network with Unbounded Activation Functions is Universal Approximator , 2015, 1505.03654.

[33]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[34]  Pieter Abbeel,et al.  Deep learning helicopter dynamics models , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[35]  Yoshua Bengio,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[36]  Steven Lake Waslander,et al.  Modular deep Recurrent Neural Network: Application to quadrotors , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[37]  K. Warwick,et al.  Dynamic recurrent neural network for system identification and control , 1995 .

[38]  Jürgen Schmidhuber,et al.  Learning Precise Timing with LSTM Recurrent Networks , 2003, J. Mach. Learn. Res..

[39]  James P. Sethna,et al.  Improvements to the Levenberg-Marquardt algorithm for nonlinear least-squares minimization , 2012, 1201.5885.

[40]  Chris J. B. Macnab,et al.  Robust adaptive control of a quadrotor helicopter , 2011 .

[41]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[42]  Ross A. Knepper,et al.  DeepMPC: Learning Deep Latent Features for Model Predictive Control , 2015, Robotics: Science and Systems.

[43]  Claire J. Tomlin,et al.  Precision flight control for a multi-vehicle quadrotor helicopter testbed , 2011 .

[44]  Péter András,et al.  The Equivalence of Support Vector Machine and Regularization Neural Networks , 2002, Neural Processing Letters.

[45]  Ilya Sutskever,et al.  Training Deep and Recurrent Networks with Hessian-Free Optimization , 2012, Neural Networks: Tricks of the Trade.

[46]  O. Nelles Nonlinear System Identification: From Classical Approaches to Neural Networks and Fuzzy Models , 2000 .

[47]  Chris Eliasmith,et al.  Neural Engineering: Computation, Representation, and Dynamics in Neurobiological Systems , 2004, IEEE Transactions on Neural Networks.

[48]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[49]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[50]  Kumpati S. Narendra,et al.  Identification and control of dynamical systems using neural networks , 1990, IEEE Trans. Neural Networks.

[51]  M. Gupta,et al.  Approximation of discrete-time state-space trajectories using dynamic recurrent neural networks , 1995, IEEE Trans. Autom. Control..

[52]  Steven Lake Waslander,et al.  State initialization for recurrent neural network modeling of time-series data , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[53]  Pierre Roussel-Ragot,et al.  Training recurrent neural networks: why and how? An illustration in dynamical process modeling , 1994, IEEE Trans. Neural Networks.

[54]  Vincent Wertz,et al.  Fuzzy Logic, Identification and Predictive Control , 2004 .

[55]  Simon Haykin,et al.  Neural networks , 1994 .

[56]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[57]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[58]  M. McCall,et al.  Rigid Body Dynamics , 2008 .

[59]  D. W. Jordan,et al.  Nonlinear ordinary differential equations : an introduction to dynamical systems , 1999 .

[60]  Sauro Longhi,et al.  A Feedback Linearization Approach to Fault Tolerance in Quadrotor Vehicles , 2011 .

[61]  Lennart Ljung,et al.  Perspectives on system identification , 2010, Annu. Rev. Control..

[62]  Biao Huang,et al.  System Identification , 2000, Control Theory for Physicists.

[63]  Amir F. Atiya,et al.  Application of the recurrent multilayer perceptron in modeling complex process dynamics , 1994, IEEE Trans. Neural Networks.

[64]  Pieter Abbeel,et al.  Autonomous Helicopter Aerobatics through Apprenticeship Learning , 2010, Int. J. Robotics Res..

[65]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[66]  Jun Wu,et al.  Modeling and control approach to a distinctive quadrotor helicopter. , 2014, ISA transactions.

[67]  Laura Giarré,et al.  NARX models of an industrial power plant gas turbine , 2005, IEEE Transactions on Control Systems Technology.

[68]  Andrew W. Senior,et al.  Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[69]  Yuichi Nakamura,et al.  Approximation of dynamical systems by continuous time recurrent neural networks , 1993, Neural Networks.

[70]  C. L. Philip Chen,et al.  The equivalence between fuzzy logic systems and feedforward neural networks , 2000, IEEE Trans. Neural Networks Learn. Syst..

[71]  Danilo P. Mandic,et al.  Recurrent Neural Networks for Prediction: Learning Algorithms, Architectures and Stability , 2001 .

[72]  M. Crucianu,et al.  Multi-step-ahead Prediction with Neural Networks : a Review , 2002 .

[73]  Bart Kosko,et al.  Fuzzy Systems as Universal Approximators , 1994, IEEE Trans. Computers.

[74]  Shuhui Li,et al.  Wind power prediction using recurrent multilayer perceptron neural networks , 2003, 2003 IEEE Power Engineering Society General Meeting (IEEE Cat. No.03CH37491).

[75]  Yi Cao,et al.  Nonlinear system identification for predictive control using continuous time recurrent neural networks and automatic differentiation , 2008 .

[76]  Lennart Ljung,et al.  Estimation of grey box and black box models for non-linear circuit data , 2004 .

[77]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[78]  Thor I. Fossen,et al.  Marine Control Systems Guidance, Navigation, and Control of Ships, Rigs and Underwater Vehicles , 2002 .

[79]  L. Zadeh,et al.  An Introduction to Fuzzy Logic Applications in Intelligent Systems , 1992 .

[80]  Holger Voos,et al.  Nonlinear control of a quadrotor micro-UAV using feedback-linearization , 2009, 2009 IEEE International Conference on Mechatronics.

[81]  G. Lewicki,et al.  Approximation by Superpositions of a Sigmoidal Function , 2003 .

[82]  Xiaoou Li,et al.  Dynamic system identification via recurrent multilayer perceptron , 2002, Inf. Sci..

[83]  Hans-Georg Zimmermann,et al.  Forecasting with Recurrent Neural Networks: 12 Tricks , 2012, Neural Networks: Tricks of the Trade.

[84]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[85]  Hava T. Siegelmann,et al.  Computational capabilities of recurrent NARX neural networks , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[86]  Amir F. Atiya,et al.  Multi-step-ahead prediction using dynamic recurrent neural networks , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[87]  Sarangapani Jagannathan,et al.  Output Feedback Control of a Quadrotor UAV Using Neural Networks , 2010, IEEE Transactions on Neural Networks.

[88]  James D. Keeler,et al.  Layered Neural Networks with Gaussian Hidden Units as Universal Approximations , 1990, Neural Computation.

[89]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[90]  T. Madani,et al.  Adaptive Control via Backstepping Technique and Neural Networks of a Quadrotor Helicopter , 2008 .