Interaction Detection Between Vehicles and Vulnerable Road Users: A Deep Generative Approach with Attention

Intersections where vehicles are permitted to turn and interact with vulnerable road users (VRUs) like pedestrians and cyclists are among some of the most challenging locations for automated and accurate recognition of road users’ behavior. In this paper, we propose a deep conditional generative model for interaction detection at such locations. It aims to automatically analyze massive video data about the continuity of road users’ behavior. This task is essential for many intelligent transportation systems such as traffic safety control and self-driving cars that depend on the understanding of road users’ locomotion. A Conditional Variational Auto-Encoder based model with Gaussian latent variables is trained to encode road users’ behavior and perform probabilistic and diverse predictions of interactions. The model takes as input the information of road users’ type, position and motion automatically extracted by a deep learning object detector and optical flow from videos, and generates frame-wise probabilities that represent the dynamics of interactions between a turning vehicle and any VRUs involved. The model’s efficacy was validated by testing on real–world datasets acquired from two different intersections. It achieved an F1-score above 0.96 at a right–turn intersection in Germany and 0.89 at a left–turn intersection in Japan, both with very busy traffic flows.

[1]  Tarek Sayed,et al.  Automated Analysis of Pedestrian–Vehicle Conflicts Using Video Data , 2009 .

[2]  Keping Li,et al.  Evaluation of pedestrian safety at intersections: A theoretical framework based on pedestrian-vehicle interaction patterns. , 2016, Accident; analysis and prevention.

[3]  Brian L Allen,et al.  ANALYSIS OF TRAFFIC CONFLICTS AND COLLISIONS , 1978 .

[4]  Ioannis Kaparias,et al.  Analysis of Pedestrian–Vehicle Traffic Conflicts in Street Designs with Elements of Shared Space , 2013 .

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  Tarek Sayed,et al.  Traffic conflict standards for intersections , 1999 .

[7]  Björn Ommer,et al.  Learning to Forecast Pedestrian Intention from Pose Dynamics , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[8]  John K. Tsotsos,et al.  Autonomous Vehicles That Interact With Pedestrians: A Survey of Theory and Practice , 2018, IEEE Transactions on Intelligent Transportation Systems.

[9]  Christer Hydén,et al.  Estimating the severity of safety related behaviour. , 2006, Accident; analysis and prevention.

[10]  Dariu Gavrila,et al.  The Issues , 2011 .

[11]  Hideki Nakamura,et al.  Estimation of left-turning vehicle maneuvers for the assessment of pedestrian safety at intersections , 2012 .

[12]  Justin Dauwels,et al.  Learning to Predict Pedestrian Intention via Variational Tracking Networks , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[13]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .

[14]  Nagui M Rouphail,et al.  Development and Implementation of Conflict-Based Assessment of Pedestrian Safety to Evaluate Accessibility of Complex Intersections , 2012, Transportation research record.

[15]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[16]  Hideki Nakamura,et al.  TrafficAnalyzer - the integrated video image processing system for traffic flow analysis , 2006 .

[17]  Daan Wierstra,et al.  Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[18]  Brendan Tran Morris,et al.  Looking at Intersections: A Survey of Intersection Monitoring, Behavior and Safety Analysis of Recent Studies , 2017, IEEE Transactions on Intelligent Transportation Systems.

[19]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  John K. Tsotsos,et al.  Are They Going to Cross? A Benchmark Dataset and Baseline for Pedestrian Crosswalk Behavior , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[21]  Ying Chen,et al.  M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network , 2018, AAAI.

[22]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[23]  Eun-Ha Choi,et al.  Crash Factors in Intersection-Related Crashes: An On-Scene Perspective , 2010 .

[24]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[25]  Tarek Sayed,et al.  Automated Safety Diagnosis of Vehicle–Bicycle Interactions Using Computer Vision Analysis , 2013 .

[26]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[27]  H W McGee,et al.  RIGHT TURN ON RED , 1976 .

[28]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[29]  John C Hayward,et al.  NEAR-MISS DETERMINATION THROUGH USE OF A SCALE OF DANGER , 1972 .

[30]  Stuart R Perkins,et al.  Traffic conflict characteristics-accident potential at intersections , 1968 .

[31]  Hiroshi Murase,et al.  Automatic Interaction Detection Between Vehicles and Vulnerable Road Users During Turning at an Intersection , 2020, 2020 IEEE Intelligent Vehicles Symposium (IV).

[32]  Honglak Lee,et al.  Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[33]  Tarek Sayed,et al.  Developing evasive action‐based indicators for identifying pedestrian conflicts in less organized traffic environments , 2016 .

[34]  Johan Davidsson,et al.  Requirements of a system to reduce car-to-vulnerable road user crashes in urban intersections. , 2011, Accident; analysis and prevention.

[35]  Monika Sester,et al.  TRAJECTORY EXTRACTION FOR ANALYSIS OF UNSAFE DRIVING BEHAVIOUR , 2019, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[36]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[37]  M. Shah,et al.  Object tracking: A survey , 2006, CSUR.

[38]  Tarek Sayed,et al.  Probabilistic Framework for Automated Analysis of Exposure to Road Collisions , 2008 .