Hierarchical Interpretable Imitation Learning for End-to-End Autonomous Driving

End-to-end autonomous driving provides a simple and efficient framework for autonomous driving systems, which can directly obtain control commands from raw perception data. However, it fails to address stability and interpretability problems in complex urban scenarios. In this paper, we construct a two-stage end-to-end autonomous driving model for complex urban scenarios, named HIIL (Hierarchical Interpretable Imitation Learning), which integrates interpretable BEV mask and steering angle to solve the problems shown above. In Stage One, we propose a pretrained Bird's Eye View (BEV) model which leverages a BEV mask to present an interpretation of the surrounding environment. In Stage Two, we construct an Interpretable Imitation Learning (IIL) model that fuses BEV latent feature from Stage One with an additional steering angle from Pure-Pursuit (PP) algorithm. In the HIIL model, visual information is converted to semantic images by the semantic segmentation network, and the semantic images are encoded to extract the BEV latent feature, which are decoded to predict BEV masks and fed to the IIL as perception data. In this way, the BEV latent feature bridges the BEV and IIL models. Visual information can be supplemented by the calculated steering angle for PP algorithm, speed vector, and location information, thus it could have better performance in complex and terrible scenarios. Our HIIL model meets an urgent requirement for interpretability and robustness of autonomous driving. We validate the proposed model in the CARLA simulator with extensive experiments which show remarkable interpretability, generalization, and robustness capability in unknown scenarios for navigation tasks.

[1]  Chen Lv,et al.  Milestones in Autonomous Driving and Intelligent Vehicles: Survey of Surveys , 2023, IEEE Transactions on Intelligent Vehicles.

[2]  Feiyue Wang,et al.  Parallel Driving OS: A Ubiquitous Operating System for Autonomous Driving in CPSS , 2022, IEEE Transactions on Intelligent Vehicles.

[3]  D. Cao,et al.  Verification and Validation Methods for Decision-Making and Planning of Automated Vehicles: A Review , 2022, IEEE Transactions on Intelligent Vehicles.

[4]  Xin Xu,et al.  Receding-Horizon Reinforcement Learning Approach for Kinodynamic Motion Planning of Autonomous Vehicles , 2022, IEEE Transactions on Intelligent Vehicles.

[5]  Bai Li,et al.  Online Trajectory Replanning for Sudden Environmental Changes During Automated Parking: A Parallel Stitching Method , 2022, IEEE Transactions on Intelligent Vehicles.

[6]  Baopu Li,et al.  GMR-RRT*: Sampling-Based Path Planning Using Gaussian Mixture Regression , 2022, IEEE Transactions on Intelligent Vehicles.

[7]  Changliu Liu,et al.  Social Interactions for Autonomous Driving: A Review and Perspective , 2022, Found. Trends Robotics.

[8]  Yunfeng Ai,et al.  AutoMine: An Unmanned Mine Dataset , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Weida Wang,et al.  An Optimization-Based Path Planning Approach for Autonomous Vehicles Using the DynEFWA-Artificial Potential Field , 2022, IEEE Transactions on Intelligent Vehicles.

[10]  Long Chen,et al.  Conditional DQN-Based Motion Planning With Fuzzy Logic for Autonomous Driving , 2022, IEEE Transactions on Intelligent Transportation Systems.

[11]  Xiao Wang,et al.  Modeling and Simulation of Crowd Evacuation With Signs at Subway Platform: A Case Study of Beijing Subway Stations , 2022, IEEE Transactions on Intelligent Transportation Systems.

[12]  Long Chen,et al.  Deep Learning for Image and Point Cloud Fusion in Autonomous Driving: A Review , 2020, IEEE Transactions on Intelligent Transportation Systems.

[13]  Masayoshi Tomizuka,et al.  Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning , 2020, IEEE Transactions on Intelligent Transportation Systems.

[14]  Long Chen,et al.  Learning a Deep Cascaded Neural Network for Multiple Motion Commands Prediction in Autonomous Driving , 2021, IEEE Transactions on Intelligent Transportation Systems.

[15]  Hangbin Wu,et al.  Deep Neural Network Based Vehicle and Pedestrian Detection for Autonomous Driving: A Survey , 2021, IEEE Transactions on Intelligent Transportation Systems.

[16]  R. Urtasun,et al.  Perceive, Attend, and Drive: Learning Spatial Attention for Safe Self-Driving , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Min Zhou,et al.  Parallel Urban Rail Transit Stations for Passenger Emergency Management , 2020, IEEE Intelligent Systems.

[18]  Long Chen,et al.  A survey on deep learning methods for scene flow estimation , 2020, Pattern Recognit..

[19]  Fenghua Zhu,et al.  Parallel Transportation Systems: Toward IoT-Enabled Smart Urban Traffic Control and Management , 2020, IEEE Transactions on Intelligent Transportation Systems.

[20]  Long Chen,et al.  A Reinforcement Learning-Based Adaptive Path Tracking Approach for Autonomous Driving , 2020, IEEE Transactions on Vehicular Technology.

[21]  Bingbing Zhuang,et al.  Understanding Road Layout From Videos as a Whole , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Lutz Eckstein,et al.  A Sim2Real Deep Learning Approach for the Transformation of Images from Multiple Vehicle-Mounted Cameras to a Semantically Segmented Image in Bird’s Eye View , 2020, 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC).

[23]  Chen Sun,et al.  Imitative Reinforcement Learning Fusing Vision and Pure Pursuit for Self-driving , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Roberto Cipolla,et al.  Predicting Semantic Map Representations From Images Using Pyramid Occupancy Networks , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Fei-Yue Wang,et al.  Learning Driving Models From Parallel End-to-End Driving Data Set , 2020, Proceedings of the IEEE.

[26]  Wenshuo Wang,et al.  Learning V2V interactive driving patterns at signalized intersections , 2019, Transportation Research Part C: Emerging Technologies.

[27]  Dongpu Cao,et al.  End-to-End Autonomous Driving: An Angle Branched Network Approach , 2019, IEEE Transactions on Vehicular Technology.

[28]  Eder Santana,et al.  Exploring the Limitations of Behavior Cloning for Autonomous Driving , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29]  Long Chen,et al.  High-Speed Scene Flow on Embedded Commercial Off-the-Shelf Systems , 2019, IEEE Transactions on Industrial Informatics.

[30]  Nanning Zheng,et al.  Parallel testing of vehicle intelligence via virtual-real interaction , 2019, Science Robotics.

[31]  Roberto Cipolla,et al.  Orthographic Feature Transform for Monocular 3D Object Detection , 2018, BMVC.

[32]  David Janz,et al.  Learning to Drive in a Day , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[33]  Hong Wang,et al.  Parallel planning: a new motion planning framework for autonomous driving , 2019, IEEE/CAA Journal of Automatica Sinica.

[34]  Qiao Li,et al.  Consensus-Based Distributed Economic Dispatch Control Method in Power Systems , 2019, IEEE Transactions on Smart Grid.

[35]  Dongpu Cao,et al.  Optimization of Pure Pursuit Controller based on PID Controller and Low-pass Filter , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[36]  Long Chen,et al.  A Fast and Efficient Double-Tree RRT$^*$-Like Sampling-Based Planner Applying on Mobile Robotic Systems , 2018, IEEE/ASME Transactions on Mechatronics.

[37]  Long Chen,et al.  Dynamic path planning for autonomous driving on various roads with avoidance of static and moving obstacles , 2018 .

[38]  Alexey Dosovitskiy,et al.  End-to-End Driving Via Conditional Imitation Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Germán Ros,et al.  CARLA: An Open Urban Driving Simulator , 2017, CoRL.

[40]  Hesham M. Eraqi,et al.  End-to-End Deep Learning for Steering Autonomous Vehicles Considering Temporal Dependencies , 2017, ArXiv.

[41]  James M. Rehg,et al.  Parallel vision for perception and understanding of complex scenes: methods, framework, and perspectives , 2017, Artificial Intelligence Review.

[42]  Nanning Zheng,et al.  Parallel learning: a perspective and a framework , 2017, IEEE/CAA Journal of Automatica Sinica.

[43]  Andrea Palazzi,et al.  Learning to Map Vehicles into Bird's Eye View , 2017, ICIAP.

[44]  Yang Gao,et al.  End-to-End Learning of Driving Models from Large-Scale Video Datasets , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Xin Zhang,et al.  End to End Learning for Self-Driving Cars , 2016, ArXiv.

[46]  Junqiang Xi,et al.  Study of semi-active suspension control strategy based on driving behaviour characteristics , 2015 .

[47]  Tom Schaul,et al.  Universal Value Function Approximators , 2015, ICML.

[48]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[49]  Peter Englert,et al.  Multi-task policy search for robotics , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[50]  Philip H. S. Torr,et al.  Automatic dense visual semantic mapping from street-level imagery , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[51]  Christopher M. Bishop,et al.  A New Framework for Machine Learning , 2008, WCCI.