Image dataset development for measuring construction equipment recognition performance

Abstract The recognition of construction operational resources (equipment, workers, materials, etc.) has played an important role in achieving fully automated construction. So far, many object recognition methods have been developed in computer vision; however, they have been tested with a few categories of objects in natural scenes. Therefore, their performance on the recognition of construction operational resources is unclear, especially considering construction sites are typically dirty, disorderly, and cluttered. This paper proposes a standard dataset of construction site images to measure the construction equipment recognition performance of existing object recognition methods. Thousands of images have been collected and compiled, which cover 5 classes of construction equipment (excavator, loader, dozer, roller and backhoe). Each image has been annotated with the equipment type, location, orientation, occlusion, and labeling of equipment components (bucket, stick, boom etc.). The effectiveness of the dataset has been evaluated with two well-known object recognition methods in computer vision. The results show that the dataset could successfully identify the performance of these methods in terms of correctness, robustness, and speed of recognizing construction equipment.

[1]  Joseph L. Mundy,et al.  Object Recognition in the Geometric Era: A Retrospective , 2006, Toward Category-Level Object Recognition.

[2]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[4]  Sinisa Todorovic,et al.  From contours to 3D object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[5]  Vincent Lepetit,et al.  Gradient Response Maps for Real-Time Detection of Textureless Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Mark D. Semon,et al.  POSTUSE REVIEW: An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements , 1982 .

[7]  John E. Hummel Object Recognition , 2014, Computer Vision, A Reference Guide.

[8]  Brenda McCabe,et al.  Vision-based recognition of dirt loading cycles in construction sites , 2012 .

[9]  Peter V. Gehler,et al.  Teaching 3D geometry to deformable part models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[11]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Jeffrey S. Bohn,et al.  Construction Project Monitoring Using High-Resolution Automated Cameras , 2010 .

[13]  Ioannis Brilakis,et al.  Construction worker detection in video frames for initializing vision trackers , 2012 .

[14]  S. M. S. Elattar,et al.  AUTOMATION AND ROBOTICS IN CONSTRUCTION: OPPORTUNITIES AND CHALLENGES , 2008 .

[15]  Cordelia Schmid,et al.  Multi-view object class detection with a 3D geometric model , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Antonio Torralba,et al.  Sharing features: efficient boosting procedures for multiclass object detection , 2004, CVPR 2004.

[17]  Cordelia Schmid,et al.  Dataset Issues in Object Recognition , 2006, Toward Category-Level Object Recognition.

[18]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[19]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[20]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21]  Yaser Sheikh,et al.  Monocular Object Detection Using 3D Geometric Primitives , 2012, ECCV.

[22]  ZissermanAndrew,et al.  The Pascal Visual Object Classes Challenge , 2015 .

[23]  John K. Tsotsos,et al.  50 Years of object recognition: Directions forward , 2013, Comput. Vis. Image Underst..

[24]  Markus Ulrich,et al.  Performance Comparison of 2D Object Recognition Techniques , 2002 .

[25]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[26]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[28]  Bastian Leibe,et al.  Visual Object Recognition , 2011, Visual Object Recognition.

[29]  Barbara Caputo,et al.  A new kernel method for object recognition:spin glass-Markov random fields , 2004 .

[30]  Dan Roth,et al.  Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.