Analyzing the relationship between head pose and gaze to model driver visual attention

Monitoring driver behavior is crucial in the design of advanced driver assistance systems (ADAS) that can detect driver actions, providing necessary warnings when not attentive to driving tasks. The visual attention of a driver is an important aspect to consider, as most driving tasks require visual resources. Previous work has investigated algorithms to detect driver visual attention by tracking the head or eye movement. While tracking pupil can give an accurate direction of visual attention, estimating gaze on vehicle environment is a challenging problem due to changes in illumination, head rotations, and occlusions (e.g. hand, glasses). Instead, this paper investigates the use of the head pose as a coarse estimate of the driver visual attention. The key challenge is the non-trivial relation between head and eye movements while glancing to a target object, which depends on the driver, the underlying cognitive and visual demand, and the environment. First, we evaluate the performance of a state-of-the-art head pose detection algorithm over natural driving recordings, which are compared with ground truth estimations derived from AprilTags attached to a headband. Then, the study proposes regression models to estimate the drivers' gaze based on the head position and orientation, which are built with data from natural driving recordings. The proposed system achieves high accuracy over the horizontal direction, but moderate/low performance over the vertical direction. We compare results while our participants were driving, and when the vehicle was parked.

[1]  Carlos Busso,et al.  Driver mirror-checking action detection using multi-modal signals , 2013 .

[2]  Carlos Busso,et al.  Predicting Perceived Visual and Cognitive Distractions of Drivers With Multimodal Features , 2015, IEEE Transactions on Intelligent Transportation Systems.

[3]  Qiang Ji,et al.  Real-Time Eye, Gaze, and Face Pose Tracking for Monitoring Driver Vigilance , 2002, Real Time Imaging.

[4]  L. Stark,et al.  Types of gaze movement: Variable interactions of eye and head movements , 1982, Experimental Neurology.

[5]  M. Trivedi,et al.  Head and eye gaze dynamics during visual attention shifts in complex environments. , 2012, Journal of vision.

[6]  Mohan M. Trivedi,et al.  On the Roles of Eye Gaze and Head Dynamics in Predicting Driver's Intent to Change Lanes , 2009, IEEE Transactions on Intelligent Transportation Systems.

[7]  Carlos Busso,et al.  Detecting Drivers' Mirror-Checking Actions and Its Application to Maneuver and Secondary Task Recognition , 2016, IEEE Transactions on Intelligent Transportation Systems.

[8]  Reinhard Klette,et al.  Look at the Driver, Look at the Road: No Distraction! No Accident! , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  M.M. Trivedi,et al.  HyHOPE: Hybrid Head Orientation and Position Estimation for vision-based driver head tracking , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[10]  G H Robinson,et al.  Visual Search by Automobile Drivers , 1972, Human factors.

[11]  Carlos Busso,et al.  Modeling of Driver Behavior in Real World Scenarios Using Multiple Noninvasive Sensors , 2013, IEEE Transactions on Multimedia.

[12]  Raja Bala,et al.  Estimating Gaze Direction of Vehicle Drivers Using a Smartphone Camera , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[13]  Luke Fletcher,et al.  Driver Inattention Detection based on Eye Gaze—Road Event Correlation , 2009, Int. J. Robotics Res..

[14]  Peter Robinson,et al.  Constrained Local Neural Fields for Robust Facial Landmark Detection in the Wild , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[15]  Mohan M. Trivedi,et al.  Robust and continuous estimation of driver gaze zone by dynamic analysis of multiple face videos , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[16]  Jiann-Der Lee,et al.  A Fuzzy Rules-Based Driver Assistance System , 2015 .

[17]  Teruhisa Misu,et al.  Visual Saliency and Crowdsourcing-based Priors for an In-car Situated Dialog System , 2015, ICMI.

[18]  John D Lee,et al.  Combining cognitive and visual distraction: less than the sum of its parts. , 2010, Accident; analysis and prevention.

[19]  Edwin Olson,et al.  AprilTag: A robust and flexible visual fiducial system , 2011, 2011 IEEE International Conference on Robotics and Automation.

[20]  Kang Ryoung Park,et al.  Real-Time Gaze Estimator Based on Driver's Head Orientation for Forward Collision Warning System , 2011, IEEE Transactions on Intelligent Transportation Systems.