An efficient binary storage format for IFC building models using HDF5 hierarchical data format

Abstract The Industry Foundation Classes (IFC) are a prevalent data model in which Building Information Models can be exchanged, typically with a file-based nature. Processing the full extent of these models can be time-consuming. Considering the multi-disciplinary nature of the construction industry, stakeholders will typically only be interested in a small subset, depending on the purpose of the exchange. Therefore, the retrieval of relevant subsets, whether spatially, based on discipline, or others, is necessary to effectively consume such datasets in downstream applications. Prevalent encoding forms of IFC models are text-based and do not facilitate random-access seeking within the file and do not impose an ordering on the definition of elements within the file. Therefore, typically, the entire file needs to be read in order to find the data of interest. Furthermore, text-based data is slower to parse in comparison to binary data. This paper assesses a binary serialization format originating from the family of EXPRESS standards. It is based on an existing open, binary, hierarchical data format called HDF5 that allows random access to specific instances and therefore efficient retrieval of relevant subsets. The block-level, transparent compression yields a reduction of file sizes as compared to traditional serializations. Fully specified datatypes embedded in the exchange guarantee interoperable use. In this paper, several serialization profiles are introduced that cater to specific use cases by governing storage settings. Advanced functionality from the HDF5 library is applied to offer novel paradigms for fine-grained access rights, varying level of detail, revision management and aggregation of aspect models.

[1]  D. C. Price,et al.  HDFITS: Porting the FITS data model to HDF5 , 2015, Astron. Comput..

[2]  Charles M. Eastman,et al.  Modularized rule-based validation of a BIM model pertaining to model views , 2016 .

[3]  Thierry Bertin-Mahieux,et al.  The Million Song Dataset , 2011, ISMIR.

[4]  Tf Thomas Krijnen,et al.  Assessing implicit knowledge in BIM models with machine learning , 2015 .

[5]  Gerd Heber,et al.  An overview of the HDF5 technology suite and its applications , 2011, AD '11.

[6]  Erez Zadok,et al.  Unifying biological image formats with HDF5 , 2009, CACM.

[7]  Burcu Akinci,et al.  Requirements and Evaluation of Standards for Integration of Sensor Data with Building Information Models , 2009 .

[8]  Thomas Krijnen,et al.  An IFC schema extension and binary serialization format to efficiently integrate point cloud data into building models , 2017, Adv. Eng. Informatics.

[9]  Jakob Beetz,et al.  BIMQL - An open query language for building information models , 2013, Adv. Eng. Informatics.

[10]  Mike Folk,et al.  Balancing performance and preservation lessons learned with HDF5 , 2010, US-DPIF '10.

[11]  Tf Thomas Krijnen,et al.  A SPARQL query engine for binary-formatted IFC building models , 2018, Automation in Construction.

[12]  Kereshmeh Afsari,et al.  JavaScript Object Notation (JSON) data serialization for IFC schema in web-based BIM data exchange , 2017 .

[13]  D. Huffman A Method for the Construction of Minimum-Redundancy Codes , 1952 .

[14]  Wojciech Rytter,et al.  Text Algorithms , 1994 .

[15]  Jakob Beetz,et al.  Collaborative engineering with IFC : new insights and technology , 2012 .

[16]  Filip Biljecki,et al.  Towards an integration of GIS and BIM data : what are the geometric and topological issues? , 2017 .

[17]  David J. DeWitt,et al.  Scientific data management in the coming decade , 2005, SGMD.

[18]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[19]  Charles M. Eastman,et al.  Validations for ensuring the interoperability of data exchange of a building information model , 2015 .

[20]  Pieter Pauwels,et al.  EXPRESS to OWL for construction industry: Towards a recommendable and usable ifcOWL ontology , 2016 .

[21]  Seppo Törmä,et al.  Semantic Linking of Building Information Models , 2013, 2013 IEEE Seventh International Conference on Semantic Computing.

[22]  Pieter Pauwels,et al.  Enhancing the ifcOWL ontology with an alternative representation for geometric data , 2017 .

[23]  Raoul Wessel,et al.  Enrichment and Preservation of Architectural Knowledge , 2016, 3D Research Challenges in Cultural Heritage.

[24]  André Borrmann,et al.  Processing of Topological BIM Queries using Boundary Representation Based Methods , 2014, Adv. Eng. Informatics.

[25]  P Katranuschkov,et al.  GENERALISED MODEL SUBSET DEFINITION SCHEMA , 2003 .

[26]  Thomas Krijnen,et al.  Working with Open BIM Standards to Source Legal Spaces for a 3D Cadastre , 2017, ISPRS Int. J. Geo Inf..