Machine learning and BIM visualization for maintenance issue classification and enhanced data collection

Abstract Occupant-generated work orders are recognized as a good potential data to support Facility Management (FM) activities, however they are unstructured and rarely contain the specific information engineers require to resolve the reported issues. Instead, this often requires multiple trips are often needed to identify the required trade, identify the problem and required parts/tools, and resolve. A key challenge is data quality: free-form (unstructured) text is collected that frequently lacks necessary detail for problem diagnosis. Machine Learning provides new opportunities within the FM domain to improve the quality of information collected through online work order reporting systems by automatically classifying WOs and prompting building occupants with appropriate FM team-developed questions in real time to gather the required specific information in structured form. This paper presents the development, comparison, and application of two sets of supervised machine learning models to perform this classification for WOs generated from occupant complaints. A set of ∼150,000 historical WOs was used for model development and textual classification using with various term and itemset frequency approaches was tested. Classifier prediction accuracies ranged from 46.6% to 81.3% for classification by detailed subcategory; this increased to between 68% (simple term frequency) to 90% (random forest) when the dataset only included the ten most common (accounting for 70% of all WOs) subcategories. Hierarchical classification decreased performance. An FM-BIM integration approach is finally presented using the resultant classifiers to provide facilities management teams with spatio-temporal visualization of the work order categories across a series of buildings to help prioritize and streamline operations and maintenance task assignments.

[1]  Frank Schultmann,et al.  Building Information Modeling (BIM) for existing buildings — Literature review and future needs , 2014 .

[2]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[3]  Fan Min,et al.  Three-way recommender systems based on random forests , 2016, Knowl. Based Syst..

[4]  Melinda Hodkiewicz,et al.  Cleaning historical maintenance work order data for reliability analysis , 2016 .

[5]  Christian Borgelt,et al.  Induction of Association Rules: Apriori Implementation , 2002, COMPSTAT.

[6]  Mithra Moezzi,et al.  Linking occupant complaints to building performance , 2013 .

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[9]  David Bryde,et al.  The project benefits of Building Information Modelling (BIM) , 2013 .

[10]  Zhenmin Li,et al.  PR-Miner: automatically extracting implicit programming rules and detecting violations in large software code , 2005, ESEC/FSE-13.

[11]  H. B. Barlow,et al.  Unsupervised Learning , 1989, Neural Computation.

[12]  MengChu Zhou,et al.  Bilevel Feature Extraction-Based Text Mining for Fault Diagnosis of Railway Systems , 2017, IEEE Transactions on Intelligent Transportation Systems.

[13]  J. J. McArthur,et al.  Optimizing BIM Metadata Manipulation Using Parametric Tools , 2016 .

[14]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[15]  Tianzhen Hong,et al.  Occupancy schedules learning process through a data mining framework , 2015 .

[16]  Lin Ma,et al.  Extracting failure time data from industrial maintenance records using text mining , 2017, Adv. Eng. Informatics.

[17]  Arthur Zimek,et al.  A Study of Hierarchical and Flat Classification of Proteins , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[18]  George Karypis,et al.  Hierarchical Clustering Algorithms for Document Datasets , 2005, Data Mining and Knowledge Discovery.

[19]  Curt Burgess,et al.  From simple associations to the building blocks of language: Modeling meaning in memory with the HAL model , 1998 .

[20]  Robert Eadie,et al.  BIM implementation throughout the UK construction project lifecycle: An analysis , 2013 .

[21]  Charu C. Aggarwal,et al.  A Survey of Text Classification Algorithms , 2012, Mining Text Data.

[22]  Esin Ergen,et al.  BIM for building refurbishment and maintenance: current status and research directions , 2015 .

[23]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[24]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[25]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[26]  María N. Moreno García,et al.  Making use of associative classifiers in order to alleviate typical drawbacks in recommender systems , 2012, Expert Syst. Appl..

[27]  Amin Hammad,et al.  Knowledge-assisted BIM-based visual analytics for failure root cause detection in facilities management , 2014 .

[28]  Yuan-Chun Jiang,et al.  Maximizing customer satisfaction through an online recommendation system: A novel associative classification model , 2010, Decis. Support Syst..