BIM log mining: Learning and predicting design commands

Abstract This paper develops a framework to learn and predict design commands based upon building information modeling (BIM) event log data stored in Autodesk Revit journal files, which has the potential to improve the modeling efficiency. BIM design logs, which automatically keep detailed records on the modeling process, are the basis of data acquisition and data mining. Long Short-Term Memory Neural Network (LSTM NN), as a probabilistic deep learning model for learning sequential data with varying lengths from logs, is established to provide designers with predictions about the possible design command class in the next step. To demonstrate the feasibility of this method, a case study runs at large design logs over 4 GB from an international design firm for command class prediction. To begin with, useful data retrieved from logs is cleaned and saved in a 320 MB Comma Separated Values (CSV) file with totally 352,056 lines of commands over 289 projects. Subsequently, various design commands are categorized into 14 classes according to their effects and given numerical labels, which are then fed into LSTM NN for training and testing. As a result, the overall accuracy of this particular case study can reach 70.5% in the test set, which outperforms some classical machine learning methods, like k nearest neighbor, random forest and support vector machine. This research contributes to applying a probabilistic LSTM NN with optimal parameters to learn features from designers' subjective behaviors effectively and predict the next possible design command class intelligently towards automation of the design process. Moreover, the three most possible command classes will be offered as the recommendations under the assumption that the correct class tends to appear owning the top three highest probabilities, which can possibly enhance the reliability of predictions.

[1]  Baabak Ashuri,et al.  BIM Log Mining: Measuring Design Productivity , 2018, J. Comput. Civ. Eng..

[2]  Kamal Suprabhas,et al.  Integration of BIM and Utility Sensor Data for Facilities Management , 2017 .

[3]  Rongxing Duan,et al.  Reliability evaluation for complex systems based on interval-valued triangular fuzzy weighted mean and evidence network , 2018 .

[4]  Jeff Haberl,et al.  Building Information Modeling (BIM)-based daylighting simulation and analysis , 2014 .

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  Bidisha Roy,et al.  Dynamic Recommendation System Using Web Usage Mining for E-commerce Users , 2015 .

[7]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[8]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[9]  Mathias Géry,et al.  Evaluation of web usage mining approaches for user's next request prediction , 2003, WIDM '03.

[10]  Hyun Kang The prevention and handling of the missing data , 2013, Korean journal of anesthesiology.

[11]  Pieter Pauwels,et al.  In Search of Sustainable Design Patterns: Combining Data Mining and Semantic Data Modelling on Disparate Building Data , 2018, Advances in Informatics and Computing in Civil and Construction Engineering.

[12]  Aitor Almeida,et al.  Embedding-Level Attention and Multi-Scale Convolutional Neural Networks for Behaviour Modelling , 2018, 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[13]  Limao Zhang,et al.  Improved Fuzzy Bayesian Network-Based Risk Analysis With Interval-Valued Fuzzy Sets and D–S Evidence Theory , 2020, IEEE Transactions on Fuzzy Systems.

[14]  Peter C. Y. Chen,et al.  LSTM network: a deep learning approach for short-term traffic forecast , 2017 .

[15]  Sepp Hochreiter,et al.  The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[16]  Álvar Arnaiz-González,et al.  MR-DIS: democratic instance selection for big data by MapReduce , 2017, Progress in Artificial Intelligence.

[17]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Aitor Almeida,et al.  Predicting Human Behaviour with Recurrent Neural Networks , 2018 .

[19]  J. Gonzalez-Dominguez,et al.  Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks , 2016, PloS one.

[20]  Issa M. Khalil,et al.  Prediction of User's Web-Browsing Behavior: Application of Markov Model , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[21]  Xiangyu Wang,et al.  BIM-Enabled Structural Design: Impacts and Future Developments in Structural Modelling, Analysis and Optimisation Processes , 2015 .

[22]  Małgorzata Wesoły,et al.  Comparison of various data analysis techniques applied for the classification of pharmaceutical samples by electronic tongue , 2018, Sensors and Actuators B: Chemical.

[23]  Sham Tickoo,et al.  Autodesk Revit Architecture 2014 for Architects and Designers , 2011 .

[24]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[25]  Peter E.D. Love,et al.  Design Error Costs in Construction Projects , 2012 .

[26]  Dejing Dou,et al.  A deep learning approach for human behavior prediction with explanations in health social networks: social restricted Boltzmann machine (SRBM+) , 2016, Social Network Analysis and Mining.

[27]  Sanyam Shukla,et al.  Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification , 2016, 2016 IEEE 6th International Conference on Advanced Computing (IACC).

[28]  Dong Zhao,et al.  Simultaneous Data Exchange between BIM and VR for Collaborative Decision Making , 2017 .

[29]  V. S. Subrahmanian,et al.  Predicting human behavior: The next frontiers , 2017, Science.

[30]  Yong Du,et al.  Hierarchical recurrent neural network for skeleton based action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Xin Yao,et al.  A Systematic Study of Online Class Imbalance Learning With Concept Drift , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[32]  Takeshi Nishida,et al.  Robot Path Planning by LSTM Network Under Changing Environment , 2018, Advances in Intelligent Systems and Computing.

[33]  Young-Jin Kim,et al.  An architecture for emergency event prediction using LSTM recurrent neural networks , 2018, Expert Syst. Appl..

[34]  Jitendra Kumar,et al.  Long Short Term Memory Recurrent Neural Network (LSTM-RNN) Based Workload Forecasting Model For Cloud Datacenters , 2018 .

[35]  I-Chen Wu,et al.  A BIM-based monitoring system for urban deep excavation projects , 2015 .

[36]  Limao Zhang,et al.  BIM log mining: Exploring design productivity characteristics , 2020 .

[37]  Lior Rokach,et al.  Choosing the right word: Using bidirectional LSTM tagger for writing support systems , 2019, Eng. Appl. Artif. Intell..

[38]  Sergey Zotkin,et al.  The extraction and processing of BIM data , 2018 .

[39]  Li Feng,et al.  Deep Learning for Fall Detection: Three-Dimensional CNN Combined With LSTM on Video Kinematic Data , 2019, IEEE Journal of Biomedical and Health Informatics.

[40]  Razvan Pascanu,et al.  Advances in optimizing recurrent networks , 2012, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[41]  Pan Zhao,et al.  A Scenario-Adaptive Driving Behavior Prediction Approach to Urban Autonomous Driving , 2017 .

[42]  Zhao Yang Dong,et al.  Research on Unstructured Text Data Mining and Fault Classification Based on RNN-LSTM with Malfunction Inspection Report , 2017 .

[43]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[44]  Edmundas Kazimieras Zavadskas,et al.  Multiple criteria decision-making techniques in transportation systems: a systematic review of the state of the art literature , 2015 .

[45]  B. Ashuri,et al.  BIM log mining: Discovering social networks , 2018, Automation in Construction.

[46]  Najah AbuAli,et al.  Driver Behavior Modeling: Developments and Future Directions , 2016 .

[47]  Juan José Rodríguez Diez,et al.  Local sets for multi-label instance selection , 2018, Appl. Soft Comput..

[48]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[49]  Rick Kazman,et al.  Mining Big Data for Detecting, Extracting and Recommending Architectural Design Concepts , 2015, 2015 IEEE/ACM 1st International Workshop on Big Data Software Engineering.

[50]  João Pedro Poças Martins,et al.  A survey on modeling guidelines for quantity takeoff-oriented BIM-based design , 2013 .

[51]  Alaa El. Sagheer,et al.  Time series forecasting of petroleum production using deep LSTM recurrent networks , 2019, Neurocomputing.

[52]  Robert Eadie,et al.  BIM implementation throughout the UK construction project lifecycle: An analysis , 2013 .

[53]  Daniel Castro-Lacouture,et al.  Mining implicit 3D modeling patterns from unstructured temporal BIM log text data , 2017 .

[54]  Feniosky Peña-Mora,et al.  Understanding and managing iterative error and change cycles in construction , 2007 .

[55]  Reda Alhajj,et al.  Effective web log mining and online navigational pattern prediction , 2013, Knowl. Based Syst..

[56]  Saleh Mesbah Elkaffas,et al.  A Framework for Collaborative Human–Computer Interaction E-learning , 2016 .