论文信息 - Improving Movement Predictions of Traffic Actors in Bird's-Eye View Models using GANs and Differentiable Trajectory Rasterization

Improving Movement Predictions of Traffic Actors in Bird's-Eye View Models using GANs and Differentiable Trajectory Rasterization

One of the most critical pieces of the self-driving puzzle is the task of predicting future movement of surrounding traffic actors, which allows the autonomous vehicle to safely and effectively plan its future route in a complex world. Recently, a number of algorithms have been proposed to address this important problem, spurred by a growing interest of researchers from both industry and academia. Methods based on top-down scene rasterization on one side and Generative Adversarial Networks (GANs) on the other have shown to be particularly successful, obtaining state-of-the-art accuracies on the task of traffic movement prediction. In this paper we build upon these two directions and propose a raster-based conditional GAN architecture, powered by a novel differentiable rasterizer module at the input of the conditional discriminator that maps generated trajectories into the raster space in a differentiable manner. This simplifies the task for the discriminator as trajectories that are not scene-compliant are easier to discern, and allows the gradients to flow back forcing the generator to output better, more realistic trajectories. We evaluated the proposed method on a large-scale, real-world data set, showing that it outperforms state-of-the-art GAN-based baselines.

[1] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .

[2] Dean A. Pomerleau,et al. Neural Network Perception for Mobile Robot Guidance , 1993 .

[3] Dimitar Filev,et al. Intelligent systems in the automotive industry: applications and trends , 2007, Knowledge and Information Systems.

[4] William Whittaker,et al. Tartan Racing: A multi-modal approach to the DARPA Urban Challenge , 2007 .

[5] William Whittaker,et al. Self-Driving Cars and the Urban Challenge , 2008, IEEE Intelligent Systems.

[6] Michael Fleming,et al. Team Cornell's Skynet: Robust perception and planning in an urban environment , 2008 .

[7] Andrei Z. Broder,et al. Search advertising using web relevance feedback , 2008, CIKM '08.

[8] Peter King,et al. Odin: Team VictorTango's Entry in the DARPA Urban Challenge , 2009, The DARPA Urban Challenge.

[9] Dong Ryeol Shin,et al. A Survey of Intelligent Transportation Systems , 2011, 2011 Third International Conference on Computational Intelligence, Communication Systems and Networks.

[10] Giuseppe Nuti,et al. Algorithmic Trading , 2011, Computer.

[11] Adnan Shaout,et al. Advanced Driver Assistance Systems - Past, present and future , 2011, 2011 Seventh International Computer Engineering Conference (ICENCO'2011).

[12] Abdul Hanan Abdullah,et al. A Survey on Intelligent Transportation Systems , 2013 .

[13] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[14] Julius Ziegler,et al. Making Bertha Drive—An Autonomous Journey on a Historic Route , 2014, IEEE Intelligent Transportation Systems Magazine.

[15] Santokh Singh,et al. Critical Reasons for Crashes Investigated in the National Motor Vehicle Crash Causation Survey , 2015 .

[16] Dhananjay Singh,et al. Internet of vehicles for smart and safe driving , 2015, 2015 International Conference on Connected Vehicles and Expo (ICCVE).

[17] J. Parish. The Patient Will See You Now: The Future of Medicine is in Your Hands. , 2015, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[18] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19] Eun-Young Kim. Patient Will See You Now: The Future of Medicine is in Your Hands , 2015, Healthcare Informatics Research.

[20] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[21] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[22] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[23] Silvio Savarese,et al. Social LSTM: Human Trajectory Prediction in Crowded Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Zhi Chen,et al. Adversarial Feature Matching for Text Generation , 2017, ICML.

[25] Gregory D. Hager,et al. Temporal Convolutional Networks for Action Segmentation and Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Sanja Fidler,et al. Towards Diverse and Natural Image Descriptions via a Conditional GAN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[27] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.

[28] Demis Hassabis,et al. Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , 2017, ArXiv.

[29] Dumitru Erhan,et al. Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] Yuval Noah Harari,et al. Reboot for the AI revolution , 2017, Nature.

[31] Thomas Bäck,et al. Artificial Intelligence and Data Science in the Automotive Industry , 2017, ArXiv.

[32] Akansel Cosgun,et al. Towards full automated drive in urban environments: A demonstration in GoMentum Station, California , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[33] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[34] Andrew G. Howard,et al. Inverted Residuals and Linear Bottlenecks: Mobile Networks for Classification, Detection and Segmentation , 2018, ArXiv.

[35] Silvio Savarese,et al. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[36] Raquel Urtasun,et al. Matching Adversarial Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37] Bin Yang,et al. Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38] Henggang Cui,et al. Short-term Motion Prediction of Traffic Actors for Autonomous Driving using Deep Convolutional Networks , 2018 .

[39] Mark Sandler,et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40] Renjie Liao,et al. Discrete Residual Flow for Probabilistic Pedestrian Behavior Prediction , 2019, CoRL.

[41] Ying Nian Wu,et al. Multi-Agent Tensor Fusion for Contextual Trajectory Prediction , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Julien Pettré,et al. Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories With GANs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[43] Benjamin Sapp,et al. MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction , 2019, CoRL.

[44] Henggang Cui,et al. Improving Movement Prediction of Traffic Actors using Off-road Loss and Bias Mitigation , 2019 .

[45] Mayank Bansal,et al. ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst , 2018, Robotics: Science and Systems.

[46] Silvio Savarese,et al. SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47] Henggang Cui,et al. Multimodal Trajectory Predictions for Autonomous Driving using Deep Convolutional Networks , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[48] Silvio Savarese,et al. Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks , 2019, NeurIPS.

[49] Carlos Vallespi-Gonzalez,et al. LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50] David Bradley,et al. Deep Kinematic Models for Physically Realistic Prediction of Vehicle Trajectories , 2019, ArXiv.

[51] Benjamin Sapp,et al. Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[52] Henggang Cui,et al. Predicting Motion of Vulnerable Road Users using High-Definition Maps and Efficient ConvNets , 2019, 2020 IEEE Intelligent Vehicles Symposium (IV).

[53] J. Schneider,et al. Uncertainty-aware Short-term Motion Prediction of Traffic Actors for Autonomous Driving , 2018, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[54] Nemanja Djuric,et al. Long-term Prediction of Vehicle Behavior using Short-term Uncertainty-aware Trajectories and High-definition Maps , 2020, 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC).

[55] Henggang Cui,et al. Deep Kinematic Models for Kinematically Feasible Vehicle Trajectory Predictions , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[56] Denis Wolf,et al. Scene Compliant Trajectory Forecast With Agent-Centric Spatio-Temporal Grids , 2019, IEEE Robotics and Automation Letters.