Neural learning of vector fields for encoding stable dynamical systems

Abstract The data-driven approximation of vector fields that encode dynamical systems is a persistently hard task in machine learning. If data is sparse and given in the form of velocities derived from few trajectories only, state-space regions exist, where no information on the vector field and its induced dynamics is available. Generalization towards such regions is meaningful only if strong biases are introduced, for instance assumptions on global stability properties of the to-be-learned dynamics. We address this issue in a novel learning scheme that represents vector fields by means of neural networks, where asymptotic stability of the induced dynamics is explicitly enforced through utilizing knowledge from Lyapunov׳s stability theory, in a predefined workspace. The learning of vector fields is constrained through point-wise conditions, derived from a suitable Lyapunov function candidate, which is first adjusted towards the training data. We point out the significance of optimized Lyapunov function candidates and analyze the approach in a scenario where trajectories are learned and generalized from human handwriting motions. In addition, we demonstrate that learning from robotic data obtained by kinesthetic teaching of the humanoid robot iCub leads to robust motion generation.

[1]  Yasuaki Kuroe,et al.  Vector Field Approximation by Model Inclusive Learning of Neural Networks , 2007, ICANN.

[2]  Klaus Neumann,et al.  Neural learning of stable dynamical systems based on data-driven Lyapunov candidates , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  VerriA.,et al.  Motion Field and Optical Flow , 1989 .

[4]  Ferdinando A. Mussa-Ivaldi,et al.  Vector field approximation: a computational paradigm for motor control and learning , 1992, Biological Cybernetics.

[5]  Ferdinando A. Mussa-Ivaldi,et al.  From basis functions to basis fields: vector field approximation from sparse data , 1992, Biological Cybernetics.

[6]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[7]  Eduardo Sontag A universal construction of Artstein's theorem on nonlinear stabilization , 1989 .

[8]  Mory Gharib,et al.  Quantitative Flow Visualization , 2002 .

[9]  Klaus Neumann,et al.  Neurally imprinted stable vector fields , 2013, ESANN.

[10]  Alexander Zelinsky,et al.  Quantitative Safety Guarantees for Physical Human-Robot Interaction , 2003, Int. J. Robotics Res..

[11]  김용수,et al.  Extreme Learning Machine 기반 퍼지 패턴 분류기 설계 , 2015 .

[12]  Mokhtar S. Bazaraa,et al.  Nonlinear Programming: Theory and Algorithms , 1993 .

[13]  Stefan Schaal,et al.  Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[14]  Helge J. Ritter,et al.  The dynamic wave expansion neural network model for robot motion planning in time-varying environments , 2005, Neural Networks.

[15]  William J. Kargo,et al.  Modeling of dynamic controls in the frog wiping reflex: Force-field level controls , 2001, Neurocomputing.

[16]  Estimation Method of Motion Fields from Images by Model Inclusive Learning of Neural Networks , 2009, ICANN.

[17]  Nikolaos G. Tsagarakis,et al.  iCub: the design and realization of an open humanoid platform for cognitive and neuroscience research , 2007, Adv. Robotics.

[18]  Darwin G. Caldwell,et al.  On improving the extrapolation capability of task-parameterized movement models , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Aude Billard,et al.  Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[20]  Giulio Sandini,et al.  Joint torque sensing for the upper-body of the iCub humanoid robot , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[21]  E. Yaz Linear Matrix Inequalities In System And Control Theory , 1998, Proceedings of the IEEE.

[22]  Luis Moreno,et al.  Kinesthetic teaching via Fast Marching Square , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[24]  Jochen J. Steil,et al.  Representation and generalization of bi-manual skills from kinesthetic teaching , 2012, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012).

[25]  Giulio Sandini,et al.  Computing robot internal/external wrenches by means of inertial, tactile and F/T sensors: Theory and implementation on the iCub , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[26]  採編典藏組 Society for Industrial and Applied Mathematics(SIAM) , 2008 .

[27]  Tomaso A. Poggio,et al.  Motion Field and Optical Flow: Qualitative Properties , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  IjspeertAuke Jan,et al.  Dynamical movement primitives , 2013 .

[29]  Jean-Claude Latombe,et al.  Numerical potential field techniques for robot path planning , 1991, Fifth International Conference on Advanced Robotics 'Robots in Unstructured Environments.

[30]  Klaus Neumann,et al.  RELIABLE INTEGRATION OF CONTINUOUS CONSTRAINTS INTO EXTREME LEARNING MACHINES , 2013 .

[31]  Hendrik Van Brussel,et al.  Human-inspired robot assistant for fast point-to-point movements , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[32]  Seyed Mohammad,et al.  A Dynamical System-based Approach to Modeling Stable Robot Control Policies via Imitation Learning , 2012 .

[33]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[34]  Stefan Schaal,et al.  http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained , 2007 .

[35]  Katta G. Murty,et al.  Nonlinear Programming Theory and Algorithms , 2007, Technometrics.