论文信息 - Learning Image-Conditioned Dynamics Models for Control of Underactuated Legged Millirobots

Learning Image-Conditioned Dynamics Models for Control of Underactuated Legged Millirobots

Millirobots are a promising robotic platform for many applications due to their small size and low manufacturing costs. Legged millirobots, in particular, can provide increased mobility in complex environments and improved scaling of obstacles. However, controlling these small, highly dynamic, and underactuated legged systems is difficult. Hand-engineered controllers can sometimes control these legged millirobots, but they have difficulties with dynamic maneuvers and complex terrains. We present an approach for controlling a real-world legged millirobot that is based on learned neural network models. Using less than 17 minutes of data, our method can learn a predictive model of the robot's dynamics that can enable effective gaits to be synthesized on the fly for following user-specified waypoints on a given terrain. Furthermore, by leveraging expressive, high-capacity neural network models, our approach allows for these predictions to be directly conditioned on camera images, endowing the robot with the ability to predict how different terrains might affect its dynamics. This enables sample-efficient and effective learning for locomotion of a dynamic legged millirobot on various terrains, including gravel, turf, carpet, and styrofoam. Videos and further details can be found at https://sites.google.com/view/imageconddyn

[1] Duncan W. Haldane,et al. Automatic identification of dynamic piecewise affine models for a running robot , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2] Florentin Wörgötter,et al. Neuromechanical control for hexapedal robot walking on challenging surfaces and surface classification , 2014, Robotics Auton. Syst..

[3] Michael L. Littman,et al. Perception-based generalization in model-based reinforcement learning , 2009 .

[4] Katie Byl,et al. Metastable legged-robot locomotion , 2008 .

[5] Jan Peters,et al. An experimental comparison of Bayesian optimization for bipedal locomotion , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[6] Daniel E. Koditschek,et al. A Physical Model for Dynamical Arthropod Running on Level Ground , 2008, ISER.

[7] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[8] Sergey Levine,et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[9] Ruben Grandia,et al. Contact Invariant Model Learning for Legged Robot Locomotion , 2018, IEEE Robotics and Automation Letters.

[10] Jonathan E. Clark,et al. iSprawl: Design and Tuning for High-speed Autonomous Open-loop Running , 2006, Int. J. Robotics Res..

[11] Kikuo Fujimura,et al. The intelligent ASIMO: system overview and integration , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[13] Tao Wang,et al. Automatic Gait Optimization with Gaussian Process Regression , 2007, IJCAI.

[14] Sebastian Thrun,et al. Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[15] H. Benjamin Brown,et al. c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. RHex: A Biologically Inspired Hexapod Runner ∗ , 2022 .

[16] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[17] Dieter Fox,et al. GP-BayesFilters: Bayesian filtering using Gaussian process prediction and observation models , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18] Thomas Kindermann,et al. Walknet--a biologically inspired network to control six-legged walking , 1998, Neural Networks.

[19] Philip Holmes,et al. Stability Analysis of a Clock-Driven Rigid-Body SLIP Model for RHex , 2004, Int. J. Robotics Res..

[20] Manuela M. Veloso,et al. An evolutionary approach to gait learning for four-legged robots , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[21] Daniel E. Koditschek,et al. Characterization of Dynamic Behaviors in a Hexapod Robot , 2010, ISER.

[22] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[23] Sergey Levine,et al. Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[24] Duncan W. Haldane,et al. Performance analysis and terrain classification for a legged robot over rough terrain , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25] Samuel Burden,et al. Bio-inspired design and dynamic maneuverability of a minimally actuated six-legged robot , 2010, 2010 3rd IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics.

[26] Christopher G. Atkeson,et al. Optimization and learning for rough terrain legged locomotion , 2011, Int. J. Robotics Res..

[27] David Zarrouk,et al. Dynamic legged locomotion for palm-size robots , 2015, Defense + Security Symposium.

[28] Stefan Schaal,et al. Fast, robust quadruped locomotion over challenging terrain , 2010, 2010 IEEE International Conference on Robotics and Automation.

[29] Howie Choset,et al. Using response surfaces and expected improvement to optimize snake robot gait parameters , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[30] Mark R. Cutkosky,et al. Integrated Ground Reaction Force Sensing and Terrain Classification for Small Legged Robots , 2016, IEEE Robotics and Automation Letters.

[31] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[32] H. Sebastian Seung,et al. Learning to Walk in 20 Minutes , 2005 .

[33] Jonathan E. Clark,et al. Biomimetic design and fabrication of a hexapedal running robot , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[34] Duncan W. Haldane,et al. Roll oscillation modulated turning in dynamic millirobots , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[35] Kevin Blankespoor,et al. BigDog, the Rough-Terrain Quadruped Robot , 2008 .

[36] Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.

[37] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[38] Ronald S. Fearing,et al. Fast scale prototyping for folded millirobots , 2008, 2008 IEEE International Conference on Robotics and Automation.

[39] Michael C. Yip,et al. Model-Less Feedback Control of Continuum Manipulators in Constrained Environments , 2014, IEEE Transactions on Robotics.

[40] Auke Jan Ijspeert,et al. Learning robot gait stability using neural networks as sensory feedback function for Central Pattern Generators , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[41] Jan Peters,et al. Toward fast policy search for learning legged locomotion , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42] Duncan W. Haldane,et al. Animal-inspired design and aerodynamic stabilization of a hexapedal millirobot , 2013, 2013 IEEE International Conference on Robotics and Automation.

[43] Pieter Abbeel,et al. Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion , 2007, NIPS.

[44] Anil V. Rao,et al. ( Preprint ) AAS 09-334 A SURVEY OF NUMERICAL METHODS FOR OPTIMAL CONTROL , 2009 .

[45] Roland Siegwart,et al. Haptic terrain classification for legged robots , 2010, 2010 IEEE International Conference on Robotics and Automation.

[46] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[47] Jessy W. Grizzle,et al. Supervised learning for stabilizing underactuated bipedal robot locomotion, with outdoor experiments on the wave field , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[48] Peter Fankhauser,et al. ANYmal - a highly mobile and dynamic quadrupedal robot , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[49] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.