Real-world Mapping of Gaze Fixations Using Instance Segmentation for Road Construction Safety Applications

Research studies have shown that a large proportion of hazards remain unrecognized, which expose construction workers to unanticipated safety risks. Recent studies have also found that a strong correlation exists between viewing patterns of workers, captured using eye-tracking devices, and their hazard recognition performance. Therefore, it is important to analyze the viewing patterns of workers to gain a better understanding of their hazard recognition performance. This paper proposes a method that can automatically map the gaze fixations collected using a wearable eye-tracker to the predefined areas of interests. The proposed method detects these areas or objects (i.e., hazards) of interests through a computer vision-based segmentation technique and transfer learning. The mapped fixation data is then used to analyze the viewing behaviors of workers and compute their attention distribution. The proposed method is implemented on an under construction road as a case study to evaluate the performance of the proposed method.

[1]  Tariq S. Abdelhamid,et al.  Identifying Root Causes of Construction Accidents , 2001 .

[2]  A F Kramer,et al.  Age differences in visual search for feature, conjunction, and triple-conjunction targets. , 1997, Psychology and aging.

[3]  Edgar J. Lobaton,et al.  Building an Integrated Mobile Robotic System for Real-Time Applications in Construction , 2018, Proceedings of the 35th International Symposium on Automation and Robotics in Construction (ISARC).

[4]  Alex Albert,et al.  Automating and scaling personalized safety training using eye-tracking data , 2018, Automation in Construction.

[5]  Guang-Zhong Yang,et al.  Eye tracking for skills assessment and training: a systematic review. , 2014, The Journal of surgical research.

[6]  Edgar Lobaton,et al.  Vision-based integrated mobile robotic system for real-time applications in construction , 2018, Automation in Construction.

[7]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[9]  Natalie V Schwatka,et al.  Safety climate and safety behaviors in the construction industry: The importance of co-workers commitment to safety. , 2016, Work.

[10]  Sathyanarayanan Rajendran,et al.  Impact of Green Building Design and Construction on Worker Safety and Health , 2009 .

[11]  Khashayar Asadi,et al.  Real-Time Image-to-BIM Registration Using Perspective Alignment for Automated Construction Monitoring , 2018 .

[12]  John Tran,et al.  cuDNN: Efficient Primitives for Deep Learning , 2014, ArXiv.

[13]  Edgar Lobaton,et al.  LNSNet: Lightweight Navigable Space Segmentation for Autonomous Robots on Construction Sites , 2019, Data.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Gregory A. Howell,et al.  Systems Model of Construction Accident Causation , 2005 .

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[18]  Kevin Skadron,et al.  Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).

[19]  Chin-Teng Lin,et al.  Using eye-tracker to compare search patterns between experienced and novice workers for site hazard identification , 2016 .

[20]  Alex Albert,et al.  Scaling Personalized Safety Training Using Automated Feedback Generation , 2018 .

[21]  Christopher D. Wickens,et al.  Modeling the Control of Attention in Visual Workspaces , 2011, Hum. Factors.

[22]  Alex Albert,et al.  Are Visual Search Patterns Predictive of Hazard Recognition Performance? Empirical Investigation Using Eye-Tracking Technology , 2019, Journal of Construction Engineering and Management.

[23]  Roger Azevedo,et al.  Development and Testing of a Personalized Hazard-Recognition Training Intervention , 2017 .

[24]  Matthew R. Hallowell,et al.  Experimental field testing of a real-time construction hazard identification and transmission technique , 2014 .

[25]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[26]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[27]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  S. Asadi,et al.  Advancing Safety by In-Depth Assessment of Workers Attention and Perception , 2017 .

[29]  Mojtaba Noghabaei,et al.  Real-Time Image Localization and Registration with BIM Using Perspective Alignment for Indoor Monitoring of Construction , 2019, J. Comput. Civ. Eng..

[30]  Susanne Bahn,et al.  Workplace hazard identification and management: The case of an underground mining operation , 2013 .

[31]  Mani Golparvar-Fard,et al.  Potential of big visual data and building information modeling for construction performance analytics: An exploratory study , 2017 .

[32]  Roberto Cabeza,et al.  Age-related preservation of top-down attentional guidance during visual search. , 2004, Psychology and aging.

[33]  Edgar J. Lobaton,et al.  Real-time Scene Segmentation Using a Light Deep Neural Network Architecture for Autonomous Robot Navigation on Construction Sites , 2019, Computing in Civil Engineering 2019.

[34]  Paul J Carlson,et al.  Use of Fixation Heat Maps to Evaluate Visual Behavior of Unfamiliar Drivers on Horizontal Curves , 2014 .

[35]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Kaiming He,et al.  Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour , 2017, ArXiv.

[37]  Aga Bojko,et al.  Eye Tracking the User Experience: A Practical Guide to Research , 2013 .

[38]  Vahid Balali,et al.  Improved Stakeholder Communication and Visualizations: Real-Time Interaction and Cost Estimation within Immersive Virtual Environments , 2018 .

[39]  Mani Golparvar Fard,et al.  Formalized knowledge of construction sequencing for visual monitoring of work-in-progress via incomplete point clouds and low-LoD 4D BIMs , 2015, Adv. Eng. Informatics.

[40]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Koen Lamberts,et al.  The time course of similarity effects in visual search. , 2011, Journal of experimental psychology. Human perception and performance.

[42]  Kevin K. Han,et al.  Perspective-Based Image-to-BIM Alignment for Automated Visual Data Collection and Construction Performance Monitoring , 2017 .

[43]  Helen Lingard,et al.  Occupational health and safety in the construction industry , 2013 .

[44]  Andry Rakotonirainy,et al.  Driver’s Visual Performance in Rear-End Collision Avoidance Process under the Influence of Cell Phone Use , 2018, Transportation Research Record: Journal of the Transportation Research Board.

[45]  Yu Zhang,et al.  Simulator Study of Driver Responses to Pedestrian Treatments at Multilane Roundabouts , 2012, Transportation research record.

[46]  Alex Albert,et al.  Development of Immersive Personalized Training Environment for Construction Workers , 2017 .

[47]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Alexander Pollatsek,et al.  Empirical Evaluation of Hazard Anticipation Behaviors in the Field and on Driving Simulator Using Eye Tracker , 2007 .

[49]  Simon Smith,et al.  Safety hazard identification on construction projects , 2006 .

[50]  Eugenio Culurciello,et al.  ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation , 2016, ArXiv.