论文信息 - Explainability of vision-based autonomous driving systems: Review and challenges

Explainability of vision-based autonomous driving systems: Review and challenges

This survey reviews explainability methods for vision-based self-driving systems. The concept of explainability has several facets and the need for explainability is strong in driving, a safety-critical application. Gathering contributions from several research fields, namely computer vision, deep learning, autonomous driving, explainable AI (X-AI), this survey tackles several points. First, it discusses definitions, context, and motivation for gaining more interpretability and explainability from self-driving systems. Second, major recent state-of-the-art approaches to develop self-driving systems are quickly presented. Third, methods providing explanations to a black-box self-driving system in a post-hoc fashion are comprehensively organized and detailed. Fourth, approaches from the literature that aim at building more interpretable self-driving systems by design are presented and discussed in detail. Finally, remaining open-challenges and potential future research directions are identified and examined.

Patrick Pérez | Matthieu Cord | Hedi Ben-Younes | Éloi Zablocki

[1] D. Cremers,et al. Learning to Drive using Inverse Reinforcement Learning and Deep Q-Networks , 2016, ArXiv.

[2] Yuan Shen,et al. To Explain or Not to Explain: A Study on the Necessity of Explanations for Autonomous Vehicles , 2020, ArXiv.

[3] Mariusz Bojarski,et al. VisualBackProp: Efficient Visualization of CNNs for Autonomous Driving , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[4] Alexey Dosovitskiy,et al. End-to-End Driving Via Conditional Imitation Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[5] Kristina Lerman,et al. A Survey on Bias and Fairness in Machine Learning , 2019, ACM Comput. Surv..

[6] Kyunghyun Cho,et al. Differences between human and machine perception in medical diagnosis , 2020, Scientific Reports.

[7] Vladlen Koltun,et al. Does computer vision matter for action? , 2019, Science Robotics.

[8] Hironobu Fujiyoshi,et al. Visual Explanation by Attention Branch Network for End-to-end Learning-based Self-driving , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[9] Juan Carlos Niebles,et al. Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] João L. Monteiro,et al. Point-cloud based 3D object detection and classification methods for self-driving applications: A survey and taxonomy , 2021, Inf. Fusion.

[11] Wolfram Burgard,et al. Learning driving styles for autonomous vehicles from demonstration , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[12] C. Pal,et al. Action-Based Representation Learning for Autonomous Driving , 2020, Conference on Robot Learning.

[13] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Jacob Haspiel,et al. Look Who's Talking Now: Implications of AV's Explanations on Driver's Trust, AV Preference, Anxiety and Mental Workload , 2019, Transportation Research Part C: Emerging Technologies.

[15] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[16] James R. Eagan,et al. Flexible and Context-Specific AI Explainability: A Multidisciplinary Approach , 2020, SSRN Electronic Journal.

[17] Barbara Caputo,et al. A Deeper Look at Dataset Bias , 2015, Domain Adaptation in Computer Vision Applications.

[18] Monique Thonnat,et al. Active and intelligent sensing of road obstacles: Application to the European Eureka-PROMETHEUS project , 1993, 1993 (4th) International Conference on Computer Vision.

[19] Markus Borg,et al. Safely Entering the Deep: A Review of Verification and Validation for Machine Learning and a Challenge Elicitation in the Automotive Industry , 2018, Journal of Automotive Software Engineering.

[20] Ting-Chun Wang,et al. Image Inpainting for Irregular Holes Using Partial Convolutions , 2018, ECCV.

[21] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[22] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[23] Carlos Guestrin,et al. Anchors: High-Precision Model-Agnostic Explanations , 2018, AAAI.

[24] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.

[25] Klaus C. J. Dietmayer,et al. Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges , 2019, IEEE Transactions on Intelligent Transportation Systems.

[26] Dacheng Tao,et al. Deep Ordinal Regression Network for Monocular Depth Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27] Yong Gu Ji,et al. Investigating the Importance of Trust on Adopting an Autonomous Vehicle , 2015, Int. J. Hum. Comput. Interact..

[28] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.

[30] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[31] Been Kim,et al. Sanity Checks for Saliency Maps , 2018, NeurIPS.

[32] Lalana Kagal,et al. Explaining Explanations: An Overview of Interpretability of Machine Learning , 2018, 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA).

[33] Alexander Carballo,et al. A Survey of Autonomous Driving: Common Practices and Emerging Technologies , 2019, IEEE Access.

[34] Jianfei Cai,et al. VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions , 2018, ECCV.

[35] Mayank Bansal,et al. Attentional Bottleneck: Towards an Interpretable Deep Driving Network , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[36] Ashish Mehta,et al. Learning End-to-end Autonomous Driving using Guided Auxiliary Supervision , 2018, ICVGIP.

[37] X. Jessie Yang,et al. Expectations and Trust in Automated Vehicles , 2020, CHI Extended Abstracts.

[38] Yang Gao,et al. End-to-End Learning of Driving Models from Large-Scale Video Datasets , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Peter J. Rousseeuw,et al. Clustering by means of medoids , 1987 .

[40] Min Sun,et al. Anticipating Accidents in Dashcam Videos , 2016, ACCV.

[41] Sergey Levine,et al. Causal Confusion in Imitation Learning , 2019, NeurIPS.

[42] Yaser Sheikh,et al. Recycle-GAN: Unsupervised Video Retargeting , 2018, ECCV.

[43] Marcel van Gerven,et al. Explainable Deep Learning: A Field Guide for the Uninitiated , 2020, J. Artif. Intell. Res..

[44] Guillermo Sapiro,et al. Explainable Artificial Intelligence for Neuroscience: Behavioral Neurostimulation , 2019, Front. Neurosci..

[45] Suman Jana,et al. DeepTest: Automated Testing of Deep-Neural-Network-Driven Autonomous Cars , 2017, 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE).

[46] Ruifeng Li,et al. Road Scene Graph: A Semantic Graph-Based Scene Representation Dataset for Intelligent Vehicles , 2020, ArXiv.

[47] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[48] Guillaume Lample,et al. Fader Networks: Manipulating Images by Sliding Attributes , 2017, NIPS.

[49] Matthieu Cord,et al. Detecting 32 Pedestrian Attributes for Autonomous Vehicles , 2020, IEEE Transactions on Intelligent Transportation Systems.

[50] John D. Lee,et al. Trust, self-confidence, and operators' adaptation to automation , 1994, Int. J. Hum. Comput. Stud..

[51] Simon Lucey,et al. Argoverse: 3D Tracking and Forecasting With Rich Maps , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[52] Alexei A. Efros,et al. Unbiased look at dataset bias , 2011, CVPR 2011.

[53] Oleksandr Bailo,et al. Red Blood Cell Image Generation for Data Augmentation Using Conditional Generative Adversarial Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[54] Quanshi Zhang,et al. Interpreting CNN knowledge via an Explanatory Graph , 2017, AAAI.

[55] Franco Turini,et al. A Survey of Methods for Explaining Black Box Models , 2018, ACM Comput. Surv..

[56] Mario Fritz,et al. Ask Your Neurons: A Deep Learning Approach to Visual Question Answering , 2016, International Journal of Computer Vision.

[57] Matthieu Cord,et al. Confidence Estimation via Auxiliary Models , 2021, IEEE transactions on pattern analysis and machine intelligence.

[58] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[59] John F. Canny,et al. Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[60] A. Shariff,et al. Psychological roadblocks to the adoption of self-driving vehicles , 2017, Nature Human Behaviour.

[61] Noah Snavely,et al. Unsupervised Learning of Depth and Ego-Motion from Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62] Lawrence D. Jackel,et al. Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car , 2017, ArXiv.

[63] Shaojie Shen,et al. Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving , 2018, ECCV.

[64] Jie Tang,et al. Learning Guided Convolutional Network for Depth Completion , 2019, IEEE Transactions on Image Processing.

[65] Trevor Darrell,et al. Deep Object-Centric Policies for Autonomous Driving , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[66] Yutaka Satoh,et al. Anticipating Traffic Accidents with Adaptive Loss and Large-Scale Incident DB , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[67] Ali Farhadi,et al. From Recognition to Cognition: Visual Commonsense Reasoning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[68] K. Madhava Krishna,et al. INFER: INtermediate representations for FuturE pRediction , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[69] Bernease Herman,et al. The Promise and Peril of Human Evaluation for Model Interpretability , 2017, ArXiv.

[70] Stanley H. Chan,et al. Who Make Drivers Stop? Towards Driver-centric Risk Assessment: Risk Object Identification via Causal Inference , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[71] Oisin Mac Aodha,et al. Unsupervised Monocular Depth Estimation with Left-Right Consistency , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[72] Bernard Ghanem,et al. Driving Policy Transfer via Modularity and Abstraction , 2018, CoRL.

[73] Eder Santana,et al. Exploring the Limitations of Behavior Cloning for Autonomous Driving , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[74] Shiyue Zhang,et al. Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language? , 2020, FINDINGS.

[75] Chris Russell,et al. Counterfactual Explanations Without Opening the Black Box: Automated Decisions and the GDPR , 2017, ArXiv.

[76] Li Fei-Fei,et al. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[77] Antonio M. López,et al. The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[78] Frank Diermeyer,et al. Identification and Explanation of Challenging Conditions for Camera-Based Object Detection of Automated Vehicles , 2020, Sensors.

[79] Qiang Xu,et al. nuScenes: A Multimodal Dataset for Autonomous Driving , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[80] Dhruv Batra,et al. Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[81] Abubakar Abid,et al. Interpretation of Neural Networks is Fragile , 2017, AAAI.

[82] John D. Lee,et al. Trust in Automation: Designing for Appropriate Reliance , 2004, Hum. Factors.

[83] Kailun Yang,et al. Bridging the Day and Night Domain Gap for Semantic Segmentation , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[84] Trevor Darrell,et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[85] Pietro Perona,et al. Teaching Categories to Human Learners with Visual Explanations , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[86] Luke Fletcher,et al. A perception‐driven autonomous urban vehicle , 2008, J. Field Robotics.

[87] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[88] Andreas Geiger,et al. Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[89] Richard Socher,et al. Explain Yourself! Leveraging Language Models for Commonsense Reasoning , 2019, ACL.

[90] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[91] David Whitney,et al. Predicting Driver Attention in Critical Situations , 2017, ACCV.

[92] John K. Tsotsos,et al. PIE: A Large-Scale Dataset and Models for Pedestrian Intention Estimation and Trajectory Prediction , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[93] Yoav Goldberg,et al. Towards Faithfully Interpretable NLP Systems: How Should We Define and Evaluate Faithfulness? , 2020, ACL.

[94] Kate Saenko,et al. Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[95] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[96] Been Kim,et al. Towards A Rigorous Science of Interpretable Machine Learning , 2017, 1702.08608.

[97] Oluwasanmi Koyejo,et al. Examples are not enough, learn to criticize! Criticism for Interpretability , 2016, NIPS.

[98] Chen Chen,et al. An Analysis of Adversarial Attacks and Defenses on Autonomous Driving Models , 2020, 2020 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[99] Juan Carlos Niebles,et al. Explaining VQA predictions using visual grounding and a knowledge base , 2020, Image Vis. Comput..

[100] Bernhard Schölkopf,et al. Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations , 2018, ICML.

[101] Arun Das,et al. Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey , 2020, ArXiv.

[102] Thomas Lukasiewicz,et al. e-SNLI: Natural Language Inference with Natural Language Explanations , 2018, NeurIPS.

[103] Andreas Geiger,et al. Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes , 2017, International Journal of Computer Vision.

[104] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[105] Scott Lundberg,et al. A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[106] Germán Ros,et al. CARLA: An Open Urban Driving Simulator , 2017, CoRL.

[107] Sergio Casas,et al. End-To-End Interpretable Neural Motion Planner , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[108] Trevor Darrell,et al. Grounding Visual Explanations , 2018, ECCV.

[109] Carlos Guestrin,et al. "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[110] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[111] Matthieu Cord,et al. BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection , 2019, AAAI.

[112] Brian E. Ruttenberg,et al. Causal Learning and Explanation of Deep Neural Networks via Autoencoded Activations , 2018, ArXiv.

[113] Roberto Cipolla,et al. Concrete Problems for Autonomous Vehicle Safety: Advantages of Bayesian Deep Learning , 2017, IJCAI.

[114] Pascal Vincent,et al. Visualizing Higher-Layer Features of a Deep Network , 2009 .

[115] Mayank Bansal,et al. ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst , 2018, Robotics: Science and Systems.

[116] Jianxiong Xiao,et al. DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[117] Sergey Tulyakov,et al. Towards Photo-Realistic Facial Expression Manipulation , 2020, Int. J. Comput. Vis..

[118] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[119] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.

[120] Byron C. Wallace,et al. Attention is not Explanation , 2019, NAACL.

[121] Alexander J. Smola,et al. Stacked Attention Networks for Image Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[122] Luc Van Gool,et al. Learning Accurate and Human-Like Driving using Semantic Maps and Attention , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[123] Victor Talpaert,et al. Deep Reinforcement Learning for Autonomous Driving: A Survey , 2020, IEEE Transactions on Intelligent Transportation Systems.

[124] Amina Adadi,et al. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) , 2018, IEEE Access.

[125] Tim Miller,et al. Explainable Reinforcement Learning Through a Causal Lens , 2019, AAAI.

[126] J. Gibson. The Ecological Approach to Visual Perception , 1979 .

[127] Giedrius Burachas,et al. A Study on Multimodal and Interactive Explanations for Visual Question Answering , 2020, SafeAI@AAAI.

[128] Shie Mannor,et al. Graying the black box: Understanding DQNs , 2016, ICML.

[129] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[130] Henggang Cui,et al. Uncertainty-aware Short-term Motion Prediction of Traffic Actors for Autonomous Driving , 2018, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[131] Liang Chen,et al. GAN Augmentation: Augmenting Training Data using Generative Adversarial Networks , 2018, ArXiv.

[132] Geoffrey E. Hinton,et al. Distilling a Neural Network Into a Soft Decision Tree , 2017, CEx@AI*IA.

[133] L. Longo,et al. Explainable Artificial Intelligence: a Systematic Review , 2020, ArXiv.

[134] Lennart Svensson,et al. LIDAR-based driving path generation using fully convolutional neural networks , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[135] Trevor Darrell,et al. Multimodal Explanations: Justifying Decisions and Pointing to the Evidence , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[136] Chao-Han Huck Yang,et al. Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[137] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[138] Quanshi Zhang,et al. Visual interpretability for deep learning: a survey , 2018, Frontiers of Information Technology & Electronic Engineering.

[139] Philip H. S. Torr,et al. DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[140] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[141] Henggang Cui,et al. Multimodal Trajectory Predictions for Autonomous Driving using Deep Convolutional Networks , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[142] Anelia Angelova,et al. Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos , 2018, AAAI.

[143] Kate Saenko,et al. Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering , 2015, ECCV.

[144] Zhangyang Wang,et al. Predicting Model Failure using Saliency Maps in Autonomous Driving Systems , 2019, ArXiv.

[145] Segun I. Popoola,et al. A Survey on Deep Learning for Steering Angle Prediction in Autonomous Vehicles , 2020, IEEE Access.

[146] Shubham Rathi,et al. Generating Counterfactual and Contrastive Explanations using SHAP , 2019, ArXiv.

[147] Alexander Binder,et al. On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation , 2015, PloS one.

[148] Trevor Darrell,et al. BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling , 2018, ArXiv.

[149] Dawn M. Tilbury,et al. Explanations and Expectations: Trust Building in Automated Vehicles , 2018, HRI.

[150] David Weinberger,et al. Accountability of AI Under the Law: The Role of Explanation , 2017, ArXiv.

[151] Valeo,et al. Driving Behavior Explanation with Multi-level Fusion , 2020 .

[152] Mykel J. Kochenderfer,et al. Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks , 2017, CAV.

[153] Avanti Shrikumar,et al. Learning Important Features Through Propagating Activation Differences , 2017, ICML.

[154] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[155] Avi Rosenfeld,et al. Explainability in human–agent systems , 2019, Autonomous Agents and Multi-Agent Systems.

[156] Senthil Mani,et al. Explaining Deep Learning Models using Causal Inference , 2018, ArXiv.

[157] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[158] Min Wu,et al. Safety Verification of Deep Neural Networks , 2016, CAV.

[159] Xuan Di,et al. A Survey on Autonomous Vehicle Control in the Era of Mixed-Autonomy: From Physics-Based to AI-Guided Driving Policy Learning , 2020, Transportation Research Part C: Emerging Technologies.

[160] Xinghao Chen,et al. Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer , 2020, ECCV.

[161] Hong Zhang,et al. Semi-Supervised Monocular Depth Estimation with Left-Right Consistency Using Deep Neural Network , 2019, 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[162] Ruocheng Guo,et al. Causal Interpretability for Machine Learning - Problems, Methods and Evaluation , 2020, SIGKDD Explor..

[163] Alberto Del Bimbo,et al. Explaining Autonomous Driving by Learning End-to-End Visual Attention , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[164] Fawzi Nashashibi,et al. Sparse and Dense Data with CNNs: Depth Completion and Semantic Segmentation , 2018, 2018 International Conference on 3D Vision (3DV).

[165] Moulay A. Akhloufi,et al. Learning to Drive by Imitation: An Overview of Deep Behavior Cloning Methods , 2021, IEEE Transactions on Intelligent Vehicles.

[166] Elena Corina Grigore,et al. CoverNet: Multimodal Behavior Prediction Using Trajectory Sets , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[167] Ruigang Yang,et al. The ApolloScape Dataset for Autonomous Driving , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[168] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[169] William Yang Wang,et al. Towards Explainable NLP: A Generative Explanation Framework for Text Classification , 2018, ACL.

[170] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[171] Yu Yao,et al. When, Where, and What? A New Dataset for Anomaly Detection in Driving Videos , 2020, ArXiv.

[172] Dieter Fox,et al. Causal Discovery in Physical Systems from Videos , 2020, NeurIPS.

[173] Yoav Goldberg,et al. Aligning Faithful Interpretations with their Social Attribution , 2020, ArXiv.

[174] Anh Nguyen,et al. SAM: The Sensitivity of Attribution Methods to Hyperparameters , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[175] Nir Morgulis,et al. Fooling a Real Car with Adversarial Traffic Signs , 2019, ArXiv.

[176] Andreas Geiger,et al. Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art , 2017, Found. Trends Comput. Graph. Vis..

[177] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .

[178] Bernt Schiele,et al. Monocular 3D scene understanding with explicit occlusion reasoning , 2011, CVPR 2011.

[179] Hujun Bao,et al. Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[180] Trevor Darrell,et al. Generating Visual Explanations , 2016, ECCV.

[181] Bernhard Schölkopf,et al. Counterfactuals uncover the modular structure of deep generative models , 2018, ICLR.

[182] Bolei Zhou,et al. Object Detectors Emerge in Deep Scene CNNs , 2014, ICLR.

[183] N Moray,et al. Trust, control strategies and allocation of function in human-machine systems. , 1992, Ergonomics.

[184] Tommi S. Jaakkola,et al. Towards Robust Interpretability with Self-Explaining Neural Networks , 2018, NeurIPS.

[185] Thomas Brox,et al. Synthesizing the preferred inputs for neurons in neural networks via deep generator networks , 2016, NIPS.

[186] Yong-Sheng Chen,et al. Pyramid Stereo Matching Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[187] Karsten Behrendt,et al. A deep learning approach to traffic lights: Detection, tracking, and classification , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[188] Vinay P. Namboodiri,et al. Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA , 2019, AAAI.

[189] Sammy Omari,et al. One Thousand and One Hours: Self-driving Motion Prediction Dataset , 2020, CoRL.

[190] E. D. Dickmanns,et al. The development of machine vision for road vehicles in the last decade , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[191] Shigeki Sugano,et al. Rethinking Self-driving: Multi-task Knowledge for Better Generalization and Accident Explanation Ability , 2018, ArXiv.

[192] Dragomir Anguelov,et al. Scalability in Perception for Autonomous Driving: Waymo Open Dataset , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[193] William Whittaker,et al. Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.

[194] Zachary Chase Lipton. The mythos of model interpretability , 2016, ACM Queue.

[195] H. Tsukimoto,et al. Rule extraction from neural networks via decision tree induction , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[196] Alex Kendall,et al. End-to-End Learning of Geometry and Context for Deep Stereo Regression , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[197] Junfeng Yang,et al. DeepXplore: Automated Whitebox Testing of Deep Learning Systems , 2017, SOSP.

[198] Shadi G. Alawneh,et al. Predicting Pedestrian Intention to Cross the Road , 2020, IEEE Access.

[199] Stefan Lee,et al. Overcoming Language Priors in Visual Question Answering with Adversarial Regularization , 2018, NeurIPS.

[200] Bolei Zhou,et al. Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[201] Brandon M. Greenwell,et al. Interpretable Machine Learning , 2019, Hands-On Machine Learning with R.

[202] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[203] Eneldo Loza Mencía,et al. DeepRED - Rule Extraction from Deep Neural Networks , 2016, DS.

[204] Shinichi Nakajima,et al. How Much Can I Trust You? - Quantifying Uncertainties in Explaining Neural Networks , 2020, ArXiv.

[205] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[206] Ankur Taly,et al. Axiomatic Attribution for Deep Networks , 2017, ICML.

[207] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[208] John A. Michon,et al. A critical view of driver behavior models: What do we know , 1985 .

[209] Raquel Urtasun,et al. Recovering and Simulating Pedestrians in the Wild , 2020, CoRL.

[210] Gabriel J. Brostow,et al. Digging Into Self-Supervised Monocular Depth Estimation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[211] Jiasen Lu,et al. Hierarchical Question-Image Co-Attention for Visual Question Answering , 2016, NIPS.

[212] Jan Kautz,et al. MoCoGAN: Decomposing Motion and Content for Video Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[213] Bernt Schiele,et al. Monocular Visual Scene Understanding: Understanding Multi-Object Traffic Scenes , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[214] Cuntai Guan,et al. A Survey on Explainable Artificial Intelligence (XAI): Toward Medical XAI , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[215] Zhe Gan,et al. Variational Autoencoder for Deep Learning of Images, Labels and Captions , 2016, NIPS.

[216] Sebastian Thrun,et al. Apprenticeship learning for motion planning with application to parking lot navigation , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[217] Hoon Kim,et al. Crash to Not Crash: Learn to Identify Dangerous Vehicles Using a Simulator , 2019, AAAI.

[218] Marin Toromanoff,et al. End-to-End Model-Free Reinforcement Learning for Urban Driving Using Implicit Affordances , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[219] Nuno Vasconcelos,et al. Explainable Object-Induced Action Decision for Autonomous Vehicles , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[220] Xin Zhang,et al. End to End Learning for Self-Driving Cars , 2016, ArXiv.

[221] Bohyung Han,et al. Traffic Accident Benchmark for Causality Recognition , 2020, European Conference on Computer Vision.

[222] Jörg Stückler,et al. Semi-Supervised Deep Learning for Monocular Depth Map Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[223] Andreas Geiger,et al. Conditional Affordance Learning for Driving in Urban Environments , 2018, CoRL.

[224] David Janz,et al. Learning to Drive in a Day , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[225] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[226] Trevor Darrell,et al. Textual Explanations for Self-Driving Vehicles , 2018, ECCV.

[227] Andreas Geiger,et al. Understanding High-Level Semantics by Modeling Traffic Patterns , 2013, 2013 IEEE International Conference on Computer Vision.

[228] Bo Zhao,et al. Layout2image: Image Generation from Layout , 2020, International Journal of Computer Vision.

[229] Jeanna Neefe Matthews,et al. Toward algorithmic transparency and accountability , 2017, Commun. ACM.

[230] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[231] Andrea Vedaldi,et al. Interpretable Explanations of Black Boxes by Meaningful Perturbation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[232] Chen Gao,et al. Flow-edge Guided Video Completion , 2020, ECCV.

[233] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[234] Ziyan Wu,et al. Counterfactual Visual Explanations , 2019, ICML.