High-speed highway scene prediction based on driver models learned from demonstrations

One of the key factors to ensure the safe operation of autonomous and semi-autonomous vehicles in dynamic environments is the ability to accurately predict the motion of the dynamic obstacles in the scene. In this work, we show how to use a realistic driver model learned from demonstrations via Inverse Reinforcement Learning to predict the long-term evolution of highway traffic scenes. We model each traffic participant as a Markov Decision Process in which the cost function is a linear combination of static and dynamic features. In particular, the static features capture the preferences of the driver while the dynamic features, which change over time depending on the actions of the other traffic participants, capture the driver's risk-aversive behavior. Using such a model for prediction enables us to explicitly consider the interactions between traffic participants while keeping the computational complexity quadratic in the number of vehicles in the scene. Preliminary experiments in simulated and real scenarios show the capability of our approach to produce reliable, human-like scene predictions.

[1]  Siddhartha S. Srinivasa,et al.  Planning-based prediction for pedestrians , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Dizan Vasquez,et al.  Novel planning-based algorithms for human motion prediction , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Wilko Schwarting,et al.  Recursive conflict resolution for cooperative motion planning in dynamic highway traffic , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[4]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[5]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[6]  Anind K. Dey,et al.  Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[7]  Kris M. Kitani,et al.  Predicting wide receiver trajectories in American football , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[8]  Lukas Rummelhard,et al.  Conditional Monte Carlo Dense Occupancy Tracker , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[9]  Katja Vogel,et al.  A comparison of headway and time to collision as safety indicators. , 2003, Accident; analysis and prevention.

[10]  Rüdiger Dillmann,et al.  Learning Driver Behavior Models from Traffic Observations for Decision Making and Planning , 2015, IEEE Intelligent Transportation Systems Magazine.

[11]  Luke Fletcher,et al.  The MIT - Cornell Collision and Why It Happened , 2009, The DARPA Urban Challenge.

[12]  Christian Vollmer,et al.  Learning to navigate through crowded environments , 2010, 2010 IEEE International Conference on Robotics and Automation.

[13]  Sergey Levine,et al.  Continuous Inverse Optimal Control with Locally Optimal Examples , 2012, ICML.

[14]  Martin Buss,et al.  Interactive scene prediction for automotive applications , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[15]  Kai Oliver Arras,et al.  Inverse Reinforcement Learning algorithms and features for robot navigation in crowds: An experimental comparison , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.