Using Classification Techniques for Assigning Work Descriptions to Task Groups on the Basis of Construction Vocabulary

Construction project management produces a huge amount of documents in a variety of formats. The efficient use of the data contained in these documents is crucial to enhance control and to improve performance. A central pillar throughout the project life cycle is the Bill of Quantities (BoQ) document. It provides economic information and details a collection of work descriptions describing the nature of the different works needed to be done to achieve the project goal. In this work, we focus on the problem of automatically classifying such work descriptions into a predefined task organization hierarchy, so that it can be possible to store them in a common data repository. We describe a methodology for preprocessing the text associated to work descriptions to build training and test data sets and carry out a complete experimentation with several well-known machine learning

[1]  Nicolás Marín,et al.  An Approach for the Automatic Classification of Work Descriptions in Construction Projects , 2015, Comput. Aided Civ. Infrastructure Eng..

[2]  John M. Kamara,et al.  A framework for using mobile computing for information management on construction sites , 2011 .

[3]  Hojjat Adeli,et al.  Neural Networks in Civil Engineering: 1989–2000 , 2001 .

[4]  Jung-Ho Yu,et al.  BIM and ontology-based approach for building cost estimation , 2014 .

[5]  Carlos H. Caldas,et al.  Automating hierarchical document classification for construction management information systems , 2003 .

[6]  Ioannis Brilakis Content Based Integration of Construction Site Images in Architecture, Engineering, Construction, and Facilities Management (AEC/FM) Model based Systems , 2009 .

[7]  Abid Nadeem,et al.  Bill of Quantities with 3D Views Using Building Information Modeling , 2015 .

[8]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[9]  SangUk Han,et al.  A Machine-Learning Classification Approach to Automatic Detection of Workers' Actions for Behavior-Based Safety Analysis , 2012 .

[10]  Nicolás Marín,et al.  An intelligent system for the acquisition and management of information from bill of quantities in building projects , 2016, Expert Syst. Appl..

[11]  Nicolás Marín,et al.  The Role of Information Technologies to Address Data Handling in Construction Project Management , 2016, J. Comput. Civ. Eng..

[12]  Saeed Karshenas,et al.  Integrating Distributed Sources of Information for Construction Cost Estimating using Semantic Web and Semantic Web Service technologies , 2015 .

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Asim Karim,et al.  CONSCOM: An OO Construction Scheduling and Change Management System , 1999 .

[15]  Wen Yi,et al.  Mixed‐Integer Linear Programming on Work‐Rest Schedule Design for Construction Sites in Hot Weather , 2017, Comput. Aided Civ. Infrastructure Eng..

[16]  Lukumon O. Oyedele,et al.  Big Data in the construction industry: A review of present status, opportunities, and future trends , 2016, Adv. Eng. Informatics.

[17]  Jia-Rui Lin,et al.  A Natural‐Language‐Based Approach to Intelligent Data Retrieval and Representation for Cloud BIM , 2016, Comput. Aided Civ. Infrastructure Eng..

[18]  Nuria Yagües Pérez Víctor del Moral, Consejero de Fomento, Vivienda, Ordenación del Territorio y Turismo de la Comunidad Autónoma de Extremadura: "siento que Extremadura sigue siendo la gran desconocida" , 2011 .

[19]  Asim Karim,et al.  Construction scheduling, cost optimization, and management : a new model based on neurocomputing and object technologies , 2001 .

[20]  John Weissmann,et al.  What a machine , 2004 .

[21]  Hojjat Adeli,et al.  Pseudospectra, MUSIC, and dynamic wavelet neural network for damage detection of highrise buildings , 2007 .

[22]  Nora El-Gohary,et al.  Domain Ontology for Processes in Infrastructure and Construction , 2010 .

[23]  Shih-Hsu Wang,et al.  Neuro‐Fuzzy Cost Estimation Model Enhanced by Fast Messy Genetic Algorithms for Semiconductor Hookup Construction , 2012, Comput. Aided Civ. Infrastructure Eng..

[24]  Abdelrahman Osman Elfaki,et al.  Using Intelligent Techniques in Construction Project Cost Estimation: 10-Year Survey , 2014 .

[25]  Wen Yi,et al.  Multi-Objective Mathematical Programming Approach to Construction Laborer Assignment with Equity Consideration , 2016, Comput. Aided Civ. Infrastructure Eng..

[26]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[27]  Ronen Feldman,et al.  The Text Mining Handbook by Ronen Feldman , 2006 .

[28]  João Pedro Poças Martins,et al.  A survey on modeling guidelines for quantity takeoff-oriented BIM-based design , 2013 .

[29]  Mingchao Li,et al.  A multidimensional information model for managing construction information , 2015 .

[30]  Zhiliang Ma,et al.  Formalized Representation of Specifications for Construction Cost Estimation by Using Ontology , 2016, Comput. Aided Civ. Infrastructure Eng..

[31]  Hojjat Adeli,et al.  Regularization neural network for construction cost estimation , 1998 .

[32]  Laura Florez,et al.  Crew Allocation System for the Masonry Industry , 2017, Comput. Aided Civ. Infrastructure Eng..

[33]  Asim Karim,et al.  Construction Scheduling, Cost Optimization and Management , 2001 .

[34]  Georgios Dounias,et al.  Evolutionary computation for resource leveling optimization in project management , 2016, Integr. Comput. Aided Eng..

[35]  David Arditi,et al.  An Advanced Stochastic Time‐Cost Tradeoff Analysis Based on a CPM‐Guided Genetic Algorithm , 2015, Comput. Aided Civ. Infrastructure Eng..

[36]  Samuel Labi,et al.  Predicting Cost Escalation Pathways and Deviation Severities of Infrastructure Projects Using Risk‐Based Econometric Models and Monte Carlo Simulation , 2017, Comput. Aided Civ. Infrastructure Eng..

[37]  Carlos H. Caldas,et al.  Management and analysis of unstructured construction data types , 2008, Adv. Eng. Informatics.

[38]  Ronen Feldman,et al.  Book Reviews: The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data by Ronen Feldman and James Sanger , 2008, CL.

[39]  Burcu Akinci,et al.  Workflow-Based Construction Research Data Management and Dissemination , 2014 .

[40]  Claudio Mourgues,et al.  Modeling Virtual Design and Construction Implementation Strategies Considering Lean Management Impacts , 2017, Comput. Aided Civ. Infrastructure Eng..

[41]  Charles M. Eastman,et al.  A Comparison of Construction Classification Systems Used for Classifying Building Product Models , 2016 .

[42]  Eugenio Pellicer,et al.  A Parallel Branch and Bound Algorithm for the Resource Leveling Problem with Minimal Lags , 2017, Comput. Aided Civ. Infrastructure Eng..

[43]  Amr Kandil,et al.  Document Discourse for Managing Construction Project Documents , 2013, J. Comput. Civ. Eng..

[44]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.