Hao Zhang: Robot Adaptation to Unstructured Terrains by Joint Representation and Apprenticeship Learning

When a mobile robot is deployed in a field environment, e.g., during a disaster response application, the capability of adapting its navigational behaviors to unstructured terrains is essential for effective and safe robot navigation. In this paper, we introduce a novel joint terrain representation and apprenticeship learning approach to implement robot adaptation to unstructured terrains. Different from conventional learning-based adaptation techniques, our approach provides a unified problem formulation that integrates representation and apprenticeship learning under a unified regularized optimization framework, instead of treating them as separate and independent procedures. Our approach also has the capability to automatically identify discriminative feature modalities, which can improve the robustness of robot adaptation. In addition, we implement a new optimization algorithm to solve the formulated problem, which provides a theoretical guarantee to converge to the global optimal solution. In the experiments, we extensively evaluate the proposed approach in real-world scenarios, in which a mobile robot navigates on familiar and unfamiliar unstructured terrains. Experimental results have shown that the proposed approach is able to transfer human expertise to robots with small errors, achieve superior performance compared with previous and baseline methods, and provide intuitive insights on the importance of terrain feature modalities.

[1]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[2]  Farhi Marir,et al.  Case-based reasoning: A review , 1994, The Knowledge Engineering Review.

[3]  Andreas Zell,et al.  Vibration-based Terrain Classification Using Support Vector Machines , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Carl E. Rasmussen,et al.  PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.

[5]  Laura E. Ray,et al.  Mobility characterization for autonomous mobile robots using machine learning , 2011, Auton. Robots.

[6]  Nicolas Pouliot,et al.  Transmission line maintenance robots capable of crossing obstacles: State-of-the-art review and challenges ahead , 2009, J. Field Robotics.

[7]  Lynne E. Parker,et al.  Case study for life-long learning and adaptation in coopertive robot teams , 1999, Optics East.

[8]  M. G. Bekker Introduction to Terrain-Vehicle Systems , 1969 .

[9]  Sebastian Thrun,et al.  Lifelong robot learning , 1993, Robotics Auton. Syst..

[10]  Hua Wang,et al.  Robust Multimodal Sequence-Based Loop Closure Detection via Structured Sparsity , 2016, Robotics: Science and Systems.

[11]  Navinda Kottege,et al.  Terrain characterisation and gait adaptation by a hexapod robot , 2016 .

[12]  Duncan W. Haldane,et al.  Automatic identification of dynamic piecewise affine models for a running robot , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Yoji Kuroda,et al.  Potential Field Navigation of High Speed Unmanned Ground Vehicles on Uneven Terrain , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[14]  David W. Aha,et al.  Trust-Guided Behavior Adaptation Using Case-Based Reasoning , 2015, IJCAI.

[15]  Lynne E. Parker,et al.  Task-oriented multi-robot learning in behavior-based systems , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[16]  Paul Filitchkin,et al.  Feature-based terrain classification for LittleDog , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  Gary Witus,et al.  Terrain characterization and classification with a mobile robot , 2006, J. Field Robotics.

[18]  Hiroshi Kimura,et al.  Realization of Dynamic Walking and Running of the Quadruped Using Neural Oscillator , 1999, Auton. Robots.

[19]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[20]  Sebastian Thrun,et al.  Probabilistic Terrain Analysis For High-Speed Desert Driving , 2006, Robotics: Science and Systems.

[21]  Duncan W. Haldane,et al.  Performance analysis and terrain classification for a legged robot over rough terrain , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Michael Happold,et al.  Enhancing Supervised Terrain Classification with Predictive Unsupervised Learning , 2006, Robotics: Science and Systems.

[23]  Pietro Perona,et al.  Learning to predict slip for ground robots , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[24]  Bernhard Nebel,et al.  Towards a Life-Long Learning Soccer Agent , 2002, RoboCup.

[25]  R. Siegwart,et al.  ROBOT-CENTRIC ELEVATION MAPPING WITH UNCERTAINTY ESTIMATES , 2014 .

[26]  James S. Albus,et al.  Learning traversability models for autonomous mobile vehicles , 2008, Auton. Robots.

[27]  Yu Zhang,et al.  Sequence-based multimodal apprenticeship learning for robot perception and decision making , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[28]  Andreas Zell,et al.  Terrain classification with conditional random fields on fused 3D LIDAR and camera data , 2013, 2013 European Conference on Mobile Robots.

[29]  Siddhartha S. Srinivasa,et al.  Imitation learning for locomotion and manipulation , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[30]  Hiroshi Kimura,et al.  Biologically-inspired adaptive dynamic walking of the quadruped on irregular terrain , 1999, Proceedings of the 1999 IEEE International Conference on Control Applications (Cat. No.99CH36328).

[31]  Alonzo Kelly,et al.  Optimal Rough Terrain Trajectory Generation for Wheeled Mobile Robots , 2007, Int. J. Robotics Res..

[32]  Roberto Manduchi,et al.  Obstacle Detection and Terrain Classification for Autonomous Off-Road Navigation , 2005, Auton. Robots.

[33]  Dieter Fox,et al.  GP-BayesFilters: Bayesian filtering using Gaussian process prediction and observation models , 2008, IROS.

[34]  Yann LeCun,et al.  Adaptive long range vision in unstructured terrain , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[35]  Sang-Ho Hyon Compliant Terrain Adaptation for Biped Humanoids Without Measuring Ground Surface and Contact Forces , 2009, IEEE Transactions on Robotics.

[36]  Aude Billard,et al.  Incremental learning of gestures by imitation in a humanoid robot , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[37]  Miaoliang Zhu,et al.  Parameter adaptation by case-based mission-planning of outdoor autonomous mobile robot , 2005, IEEE Proceedings. Intelligent Vehicles Symposium, 2005..

[38]  Luis E. Navarro-Serment,et al.  Robot Navigation from Human Demonstration: Learning Control Behaviors , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[39]  David Silver,et al.  Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain , 2010, Int. J. Robotics Res..

[40]  Manuel A. Armada,et al.  Multiple Terrain Adaptation Approach Using Ultrasonic Sensors for Legged Robots , 2005, CLAWAR.

[41]  Homayoun Seraji,et al.  Vision-based terrain characterization and traversability assessment , 2001, J. Field Robotics.

[42]  Karl Iagnemma,et al.  Vibration-based terrain classification for planetary exploration rovers , 2005, IEEE Transactions on Robotics.

[43]  Gentaro Taga,et al.  A model of the neuro-musculo-skeletal system for human locomotion , 1995, Biological Cybernetics.

[44]  Xue Yang,et al.  SRAL: Shared Representative Appearance Learning for Long-Term Visual Place Recognition , 2017, IEEE Robotics and Automation Letters.

[45]  Cheng Wang,et al.  Integrating LiDAR Intensity and Elevation Data for Terrain Characterization in a Forested Area , 2009, IEEE Geoscience and Remote Sensing Letters.

[46]  Ross A. Knepper,et al.  DeepMPC: Learning Deep Latent Features for Model Predictive Control , 2015, Robotics: Science and Systems.

[47]  Michael Antolovich,et al.  Learning from Demonstration Using GMM, CHMM and DHMM: A Comparison , 2015, Australasian Conference on Artificial Intelligence.

[48]  Yoon-Gu Kim,et al.  Autonomous terrain adaptation and user-friendly tele-operation of wheel-track hybrid mobile robot , 2012 .

[49]  Sebastian Thrun,et al.  Reverse Optical Flow for Self-Supervised Adaptive Autonomous Robot Navigation , 2007, International Journal of Computer Vision.

[50]  Sergey Levine,et al.  Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics , 2014, NIPS.

[51]  KimuraHiroshi,et al.  Realization of Dynamic Walking and Running of the Quadruped Using Neural Oscillator , 1999 .

[52]  Shinji Kawatsuma,et al.  Emergency response by robots to Fukushima-Daiichi accident: summary and lessons learned , 2012, Ind. Robot.

[53]  Wolfram Burgard,et al.  Learning predictive terrain models for legged robot locomotion , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[54]  William Whittaker,et al.  A robust approach to high‐speed navigation for unrehearsed desert terrain , 2007 .

[55]  David Zhang,et al.  A Survey of Sparse Representation: Algorithms and Applications , 2015, IEEE Access.

[56]  Panagiotis Papadakis,et al.  Terrain traversability analysis methods for unmanned ground vehicles: A survey , 2013, Eng. Appl. Artif. Intell..

[57]  Steven Dubowsky,et al.  Online terrain parameter estimation for wheeled mobile robots with application to planetary rovers , 2004, IEEE Transactions on Robotics.

[58]  Wolfram Burgard,et al.  The dynamic window approach to collision avoidance , 1997, IEEE Robotics Autom. Mag..

[59]  Lynne E. Parker,et al.  Lifelong Adaptation in Heterogeneous Multi-Robot Teams: Response to Continual Variation in Individual Robot Performance , 2000, Auton. Robots.

[60]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[61]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).