Context-based information generation for managing UAV-acquired data using image captioning

Abstract Unmanned aerial vehicles (UAVs) can efficiently collect image data representing various situations at construction sites, however, it requires a lot of time and cost to analyze it manually to retrieve useful information for on-site management. The paper proposes a methodology to generate time-spatial and visual context-based information from UAV-acquired data. This methodology generates textual information on the position, status, movement, color, and quantity of construction resources from site images using image captioning. Then, construction site images, text generated from them, and UAV flight data containing the latitude, longitude, date, and time of day, are systemized into a database. For evaluating the proposed methodology, data obtained by UAV at actual construction sites was used. Our methodology could predict textual information with a mean average precision of 43.52%, which is superior to those of existing methods.

[1]  Brenda McCabe,et al.  Automated computer vision-based detection of components of under-construction indoor partitions , 2017 .

[2]  Seokho Chi,et al.  Action recognition of earthmoving excavators based on sequential pattern analysis of visual features and operation cycles , 2019, Automation in Construction.

[3]  Mani Golparvar-Fard,et al.  Potential of big visual data and building information modeling for construction performance analytics: An exploratory study , 2017 .

[4]  Jie Gong,et al.  Computer Vision-Based Video Interpretation Model for Automated Productivity Analysis of Construction Operations , 2010 .

[5]  John C. Dalton,et al.  Hierarchical browsing and search of large image databases , 2000, IEEE Trans. Image Process..

[6]  Amir Hossein Alavi,et al.  Big data in civil engineering , 2017 .

[7]  Edward Y. Chang,et al.  CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines , 2003, IEEE Trans. Circuits Syst. Video Technol..

[8]  Youngjib Ham,et al.  Vision-based nonintrusive context documentation for earthmoving productivity simulation , 2019, Automation in Construction.

[9]  Mani Golparvar-Fard,et al.  Appearance-based material classification for monitoring of operation-level construction progress using 4D BIM and site photologs , 2015 .

[10]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[11]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[12]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Xiaowei Luo,et al.  Transfer learning and deep convolutional neural networks for safety guardrail detection in 2D images , 2018 .

[14]  Kinam Kim,et al.  Image-based construction hazard avoidance system using augmented reality in wearable device , 2017 .

[15]  Markus König,et al.  Interior construction state recognition with 4D BIM registered image sequences , 2018 .

[16]  Youngjib Ham,et al.  Analyzing context and productivity of tunnel earthmoving processes using imaging and simulation , 2018, Automation in Construction.

[17]  QadirJunaid,et al.  Big Data in the construction industry , 2016 .

[18]  Patricio A. Vela,et al.  Fusion of Photogrammetry and Video Analysis for Productivity Assessment of Earthwork Processes , 2017, Comput. Aided Civ. Infrastructure Eng..

[19]  Michael S. Bernstein,et al.  Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.

[20]  Peter E. D. Love,et al.  Automated detection of workers and heavy equipment on construction sites: A convolutional neural network approach , 2018, Adv. Eng. Informatics.

[21]  Hiam Khoury,et al.  Vision-Based Framework for Intelligent Monitoring of Hardhat Wearing on Construction Sites , 2019, J. Comput. Civ. Eng..

[22]  Lukumon O. Oyedele,et al.  Big Data in the construction industry: A review of present status, opportunities, and future trends , 2016, Adv. Eng. Informatics.

[23]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Frédéric Bosché,et al.  Tracking of secondary and temporary objects in structural concrete work , 2014 .

[25]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[26]  Vineet R. Kamat,et al.  Remote proximity monitoring between mobile construction resources using camera-mounted UAVs , 2019, Automation in Construction.

[27]  Hyoungkwan Kim,et al.  UAV-based automatic generation of high-resolution panorama at a construction site with a focus on preprocessing for image stitching , 2017 .

[28]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.