Dynamic Occupancy Grid Prediction for Urban Autonomous Driving: A Deep Learning Approach with Fully Automatic Labeling

Long-term situation prediction plays a crucial role for intelligent vehicles. A major challenge still to overcome is the prediction of complex downtown scenarios with multiple road users, e.g., pedestrians, bikes, and motor vehicles, interacting with each other. This contribution tackles this challenge by combining a Bayesian filtering technique for environment representation, and machine learning as long-term predictor. More specifically, a dynamic occupancy grid map is utilized as input to a deep convolutional neural network. This yields the advantage of using spatially distributed velocity estimates from a single time step for prediction, rather than a raw data sequence, alleviating common problems dealing with input time series of multiple sensors. Furthermore, convolutional neural networks have the inherent characteristic of using context information, enabling the implicit modeling of road user interaction. Pixel-wise balancing is applied in the loss function counteracting the extreme imbalance between static and dynamic cells. One of the major advantages is the unsupervised learning character due to fully automatic label generation. The presented algorithm is trained and evaluated on multiple hours of recorded sensor data and compared to Monte-Carlo simulation. Experiments show the ability to model complex interactions.

[1]  Graham W. Taylor,et al.  Adaptive deconvolutional networks for mid and high level feature learning , 2011, 2011 International Conference on Computer Vision.

[2]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Arthur P. Dempster,et al.  A Generalization of Bayesian Inference , 1968, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[4]  Thomas Schamm,et al.  Understanding interactions between traffic participants based on learned behaviors , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[5]  Amy Loutfi,et al.  A review of unsupervised feature learning and deep learning for time-series modeling , 2014, Pattern Recognit. Lett..

[6]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[7]  Yaakov Bar-Shalom,et al.  Update with out-of-sequence measurements in tracking: exact solution , 2000, SPIE Defense + Commercial Sensing.

[8]  Jurgen Wiest,et al.  Statistical long-term motion prediction , 2017 .

[9]  Dirk Wollherr,et al.  Object tracking based on evidential dynamic occupancy grids in urban environments , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[10]  Alberto Elfes,et al.  Using occupancy grids for mobile robot perception and navigation , 1989, Computer.

[11]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Chih-Fong Tsai,et al.  Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches , 2010, Decis. Support Syst..

[13]  Christoph Stiller,et al.  Map-based long term motion prediction for vehicles in traffic environments , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[14]  Torsten Bertram,et al.  Track-to-Track Fusion With Asynchronous Sensors Using Information Matrix Fusion for Surround Environment Perception , 2012, IEEE Transactions on Intelligent Transportation Systems.

[15]  E. Fama The Behavior of Stock-Market Prices , 1965 .

[16]  David J. Cole,et al.  Models of driver speed choice in curves , 2004 .

[17]  Klaus C. J. Dietmayer,et al.  A random finite set approach for dynamic occupancy grid maps with real-time application , 2016, Int. J. Robotics Res..

[18]  Kai Oliver Arras,et al.  People tracking with human motion predictions from social forces , 2010, 2010 IEEE International Conference on Robotics and Automation.

[19]  Klaus C. J. Dietmayer,et al.  Probabilistic long-term prediction for autonomous vehicles , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[20]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[22]  LI X.RONG,et al.  Survey of maneuvering target tracking. Part I. Dynamic models , 2003 .

[23]  Klaus C. J. Dietmayer,et al.  A learning concept for behavior prediction at intersections , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[24]  J. Agrawal,et al.  State-of-the-Art in Stock Prediction Techniques , 2013 .

[25]  Wolfram Burgard,et al.  Probabilistic Robotics (Intelligent Robotics and Autonomous Agents) , 2005 .

[26]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[27]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[28]  Yi Zheng,et al.  Time Series Classification Using Multi-Channels Deep Convolutional Neural Networks , 2014, WAIM.

[29]  Klaus C. J. Dietmayer,et al.  Fusion of laser and radar sensor data with a sequential Monte Carlo Bayesian occupancy filter , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[30]  Martin Treiber,et al.  Calibrating Car-Following Models by Using Trajectory Data , 2008, 0803.4063.