Forced Variational Integrator Networks for Prediction and Control of Mechanical Systems

As deep learning becomes more prevalent for prediction and control of real physical systems, it is important that these overparameterized models are consistent with physically plausible dynamics. This elicits a problem with how much inductive bias to impose on the model through known physical parameters and principles to reduce complexity of the learning problem to give us more reliable predictions. Recent work employs discrete variational integrators parameterized as a neural network architecture to learn conservative Lagrangian systems. The learned model captures and enforces global energy preserving properties of the system from very few trajectories. However, most real systems are inherently non-conservative and, in practice, we would also like to apply actuation. In this paper we extend this paradigm to account for general forcing (e.g. control input and damping) via discrete d'Alembert's principle which may ultimately be used for control applications. We show that this forced variational integrator networks (FVIN) architecture allows us to accurately account for energy dissipation and external forcing while still capturing the true underlying energy-based passive dynamics. We show that in application this can result in highly-data efficient model-based control and can predict on real non-conservative systems.

[1]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[2]  J. Marsden,et al.  Discrete mechanics and optimal control , 2005 .

[3]  Marc Peter Deisenroth,et al.  Variational Integrator Networks for Physically Structured Embeddings , 2020, AISTATS.

[4]  Jan Peters,et al.  Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning , 2019, ICLR.

[5]  J. Marsden,et al.  Mechanical integrators derived from a discrete variational principle , 1997 .

[6]  Ramitha Sundar,et al.  Blue River Controls: A toolkit for Reinforcement Learning Control Systems on Hardware , 2020, ArXiv.

[7]  David Duvenaud,et al.  Neural Ordinary Differential Equations , 2018, NeurIPS.

[8]  Jason Yosinski,et al.  Hamiltonian Neural Networks , 2019, NeurIPS.

[9]  J. Marsden,et al.  Discrete mechanics and variational integrators , 2001, Acta Numerica.

[10]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[11]  J. Marsden,et al.  DISCRETE MECHANICS AND OPTIMAL CONTROL: AN ANALYSIS ∗ , 2008, 0810.1386.

[12]  Reuven Y. Rubinstein,et al.  Optimization of computer simulation models with rare events , 1997 .

[13]  L. Verlet Computer "Experiments" on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules , 1967 .

[14]  Jérémie Guiochet,et al.  Safety-critical advanced robots: A survey , 2017, Robotics Auton. Syst..

[15]  G. Dullerud,et al.  A Course in Robust Control Theory: A Convex Approach , 2005 .

[16]  Sergey Levine,et al.  When to Trust Your Model: Model-Based Policy Optimization , 2019, NeurIPS.

[17]  Demis Hassabis,et al.  A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.

[18]  Marc Peter Deisenroth,et al.  Learning Contact Dynamics using Physically Structured Neural Networks , 2021, AISTATS.

[19]  B. Vujanović Conservation laws of dynamical systems via d'alembert's principle , 1978 .