论文信息 - CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships

CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships

As machine learning models become increasingly prevalent in motion forecasting for autonomous vehicles (AVs), it is critical to ensure that model predictions are safe and reliable. However, exhaustively collecting and labeling the data necessary to fully test the long tail of rare and challenging scenarios is difficult and expensive. In this work, we construct a new benchmark for evaluating and improving model robustness by applying perturbations to existing data. Specifically, we conduct an extensive labeling effort to identify causal agents, or agents whose presence influences human drivers' behavior in any format, in the Waymo Open Motion Dataset (WOMD), and we use these labels to perturb the data by deleting non-causal agents from the scene. We evaluate a diverse set of state-of-the-art deep-learning model architectures on our proposed benchmark and find that all models exhibit large shifts under even non-causal perturbation: we observe a 25-38% relative change in minADE as compared to the original. We also investigate techniques to improve model robustness, including increasing the training dataset size and using targeted data augmentations that randomly drop non-causal agents throughout training. Finally, we release the causal agent labels (at https://github.com/google-research/causal-agents) as an additional attribute to WOMD and the robustness benchmarks to aid the community in building more reliable and safe deep-learning models for motion forecasting.

[1] Gerard de Melo,et al. NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation , 2021, Northern European Journal of Language Technology.

[2] Khaled S. Refaat,et al. Wayformer: Motion Forecasting via Simple & Efficient Attention Networks , 2022, 2023 IEEE International Conference on Robotics and Automation (ICRA).

[3] Benjamin Sapp,et al. MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior Prediction , 2021, 2022 International Conference on Robotics and Automation (ICRA).

[4] Benjamin Sapp,et al. Identifying Driver Interactions via Conditional Behavior Prediction , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[5] Jiquan Ngiam,et al. Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[6] T. Brox,et al. On Exposing the Challenging Long Tail in Future Prediction of Traffic Actors , 2021, IEEE International Conference on Computer Vision.

[7] Anca D. Dragan,et al. On complementing end-to-end human behavior predictors with planning , 2021, Robotics: Science and Systems.

[8] B. Recht,et al. Do Image Classifiers Generalize Across Time? , 2019, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[9] Jiquan Ngiam,et al. Scene Transformer: A unified multi-task model for behavior prediction and planning , 2021, ArXiv.

[10] Deva Ramanan,et al. What-If Motion Prediction for Autonomous Driving , 2020, ArXiv.

[11] R. Urtasun,et al. Learning Lane Graph Representations for Motion Forecasting , 2020, ECCV.

[12] Benjamin Recht,et al. Measuring Robustness to Natural Distribution Shifts in Image Classification , 2020, NeurIPS.

[13] Ahmet Yazici,et al. See, Attend and Brake: An Attention-based Saliency Map Prediction Model for End-to-End Driving , 2020, ArXiv.

[14] Marco Pavone,et al. Trajectron++: Dynamically-Feasible Trajectory Forecasting with Heterogeneous Data , 2020, ECCV.

[15] Renjie Liao,et al. SpAGNN: Spatially-Aware Graph Neural Networks for Relational Behavior Forecasting from Sensor Data , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[16] Wolfram Burgard,et al. Multimodal interaction-aware motion prediction for autonomous street crossing , 2018, Int. J. Robotics Res..

[17] Benjamin Sapp,et al. MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction , 2019, CoRL.

[18] Kai Ding,et al. Agent Prioritization for Autonomous Navigation , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19] Gita Alaghband,et al. Trajectory Prediction by Coupling Scene-LSTM with Human Movement LSTM , 2019, ISVC.

[20] Benjamin Sapp,et al. Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Sergey Levine,et al. PRECOG: PREdiction Conditioned on Goals in Visual Multi-Agent Settings , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22] Quoc V. Le,et al. Using Videos to Evaluate Image Model Robustness , 2019, ArXiv.

[23] Ying Nian Wu,et al. Multi-Agent Tensor Fusion for Contextual Trajectory Prediction , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Vincent Aravantinos,et al. The Simpler the Better: Constant Velocity for Pedestrian Motion Prediction , 2019, ArXiv.

[25] Benjamin Recht,et al. Do ImageNet Classifiers Generalize to ImageNet? , 2019, ICML.

[26] Yutao Han,et al. Pedestrian Motion Model Using Non-Parametric Trajectory Clustering and Discrete Transition Points , 2019, IEEE Robotics and Automation Letters.

[27] Henggang Cui,et al. Multimodal Trajectory Predictions for Autonomous Driving using Deep Convolutional Networks , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[28] Thomas G. Dietterich,et al. Benchmarking Neural Network Robustness to Common Corruptions and Perturbations , 2018, ICLR.

[29] Aleksander Madry,et al. Exploring the Landscape of Spatial Robustness , 2017, ICML.

[30] Matthias Bethge,et al. Generalisation in humans and deep neural networks , 2018, NeurIPS.

[31] Dariu Gavrila,et al. Context-Based Path Prediction for Targets with Switching Dynamics , 2018, International Journal of Computer Vision.

[32] Kate Saenko,et al. Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[33] Fabio Roli,et al. Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning , 2017, Pattern Recognit..

[34] Philip H. S. Torr,et al. DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Dinesh Manocha,et al. GLMP- realtime pedestrian path prediction using global and local movement patterns , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[36] Pascal Frossard,et al. Manitest: Are classifiers really invariant? , 2015, BMVC.

[37] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[38] Fabio Roli,et al. Evasion Attacks against Machine Learning at Test Time , 2013, ECML/PKDD.