Ontology-based automated information extraction from building energy conservation codes

Abstract An ontology-based information extraction algorithm for automatically extracting energy requirements from energy conservation codes is proposed. The proposed algorithm aims to support fully-automated energy compliance checking in the construction domain by allowing automated extraction of the requirements from the codes instead of the status quo which relies on manual extraction of requirements from codes and manual formalization of those requirements in a computer-processable format. Automated information extraction from energy conservation codes, compared to other building codes, is a far complex task because many code provisions are long, hierarchically-complex, and with exceptions. A combination of text classification methods, domain-specific preprocessing techniques, ontology-based pattern-matching extraction techniques, sequential dependency-based extraction methods, and cascaded extraction methods is proposed to deal with such complexity in extraction. The proposed algorithm was tested in extracting energy requirements from Chapter 4 of the 2012 International Energy Conservation Code, and the results showed 97.4% recall and 98.5% precision.

[1]  Eilif Hjelseth,et al.  EXPLORING SEMANTIC BASED MODEL CHECKING , 2010 .

[2]  Jimmie Hinze,et al.  Integration of Safety in Design through the Use of Building Information Modeling , 2011 .

[3]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[4]  Jochen Teizer,et al.  A case study on automated safety compliance checking to assist fall protection design and planning in building information models , 2013 .

[5]  Nora El-Gohary,et al.  Semantic NLP-Based Information Extraction from Construction Regulatory Documents for Automated Compliance Checking , 2016, J. Comput. Civ. Eng..

[6]  Nora El-Gohary,et al.  Automated Compliance Checking of Construction Operation Plans Using a Deontology for the Construction Domain , 2013, J. Comput. Civ. Eng..

[7]  Vangelis Karkaletsis,et al.  Ontology Based Information Extraction from Text , 2011, Knowledge-Driven Multimedia Information Extraction and Ontology Evolution.

[8]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[9]  C Eastman,et al.  A Knowledge Representation Approach to Capturing BIM Based Rule Checking Requirements Using Conceptual Graph , 2015 .

[10]  Halil Kilicoglu,et al.  Constructing a semantic predication gold standard from the biomedical literature , 2011, BMC Bioinformatics.

[11]  Nora El-Gohary,et al.  Automated Information Transformation for Automated Regulatory Compliance Checking in Construction , 2015, J. Comput. Civ. Eng..

[12]  João Pedro Poças Martins,et al.  LicA: A BIM based automated code-checking application for water distribution systems , 2013 .

[13]  Arnold Robbins,et al.  Learning the vi and Vim Editors , 2008 .

[14]  Amin Hammad,et al.  Automated Code Compliance Checking for Building Envelope Design , 2010, J. Comput. Civ. Eng..

[15]  Tong Shu Li,et al.  Exposing ambiguities in a relation-extraction gold standard with crowdsourcing , 2015, ArXiv.

[16]  Fabio Ciravegna,et al.  Evaluating machine learning for information extraction , 2005, ICML.

[17]  Nawari O. Nawari Automating Codes Conformance , 2012 .

[18]  Yacine Rezgui,et al.  A rule-based semantic approach for automated regulatory compliance in the construction sector , 2015, Expert Syst. Appl..

[19]  Charles M. Eastman,et al.  Classification of rules for automated BIM rule checking development , 2015 .

[20]  Karthik Ramani,et al.  Ontology-based design information extraction and retrieval , 2007, Artificial Intelligence for Engineering Design, Analysis and Manufacturing.

[21]  Nora El-Gohary,et al.  Ontology-Based Multilabel Text Classification of Construction Regulatory Documents , 2016, J. Comput. Civ. Eng..

[22]  Yousef Abuzir,et al.  Constructing the Civil Engineering Thesaurus (CET) Using ThesWB , 2002 .

[23]  Ergin Soysal,et al.  Design and evaluation of an ontology based information extraction system for radiological reports , 2010, Comput. Biol. Medicine.

[24]  Lieyun Ding,et al.  Ontology-based semantic modeling of regulation constraint for automated construction quality compliance checking , 2012 .

[25]  Antonio Moreno,et al.  Ontology-based information extraction of regulatory networks from scientific articles with case studies for Escherichia coli , 2013, Expert Syst. Appl..

[26]  Li Jiang,et al.  Automated Rule-Based Constructability Checking: Case Study of Formwork , 2015 .

[27]  Amr Kandil,et al.  Concept Relation Extraction from Construction Documents Using Natural Language Processing , 2010 .

[28]  Peter E.D. Love,et al.  Development of an object model for automated compliance checking , 2015 .

[29]  Dejing Dou,et al.  Ontology-based information extraction: An introduction and a survey of current approaches , 2010, J. Inf. Sci..

[30]  Marie-Francine Moens,et al.  Information Extraction: Algorithms and Prospects in a Retrieval Context , 2006, The Information Retrieval Series.

[31]  Amélie Marian,et al.  URSA - User Review Structure Analysis: Understanding Online Reviewing Trends , 2010 .

[32]  Judith W. Dexheimer,et al.  Natural Language Processing – The Basics , 2012 .

[33]  Yusuke Miyao,et al.  Annotation of Computer Science Papers for Semantic Relation Extrac-tion , 2014, LREC.

[34]  Nora El-Gohary,et al.  Domain Ontology for Processes in Infrastructure and Construction , 2010 .

[35]  Jakub Piskorski,et al.  Information Extraction: Past, Present and Future , 2013, Multi-source, Multilingual Information Extraction and Summarization.

[36]  Charles M. Eastman,et al.  Automatic rule-based checking of building designs , 2009 .

[37]  Diana Maynard,et al.  Metrics for Evaluation of Ontology-based Information Extraction , 2006, EON@WWW.

[38]  Tuomo Kakkonen,et al.  Ontology-Based Information and Event Extraction for Business Intelligence , 2012, AIMSA.

[39]  Junho Choi,et al.  Development of BIM-based evacuation regulation checking system for high-rise and complex buildings , 2014 .

[40]  Robert Amor,et al.  Regulatory Knowledge Encoding Guidelines for Automated Compliance Audit of Building Engineering Design , 2014 .

[41]  Zhipeng Zhou,et al.  Overview and Analysis of Ontology Studies Supporting Development of the Construction Industry , 2016, J. Comput. Civ. Eng..

[42]  Omar F. El-Gayar,et al.  An Ontology-Based Information Extraction (OBIE) Framework for Analyzing Initial Public Offering (IPO) Prospectus , 2014, 2014 47th Hawaii International Conference on System Sciences.