Semantic Mapping of Energy Simulation Data Using Bag of Words and Graph Matching

Energy simulation is essential to urban planning and environment protection. It can be used to compare the effectiveness of different investment or construction strategies for decision makers. However, in the city level, the energy consumption simulation is difficult to implement because of the lack of datasets. In this paper, we propose an algorithm of automatic semantic mapping of energy simulation data from CityGML to data format of existing energy simulation software such as EnergyPlus. The proposed method makes use of bag of words algorithm to detect the semantic similarity between two objects in different data format and then the graph matching method is employed for overall mapping and information transformation. The experimental results indicate that the proposed method can effectively extract semantic information from CityGML for energy simulation.

[1]  Lukasz A. Kurgan,et al.  Semantic Mapping of XML Tags Using Inductive Machine Learning , 2002, ICMLA.

[2]  Francesc Serratosa,et al.  A new graph matching method for point-set correspondence using the EM algorithm and Softassign , 2012, Comput. Vis. Image Underst..

[3]  Alberto Sanfeliu,et al.  Second-Order Random Graphs For Modeling Sets Of Attributed Graphs And Their Application To Object Learning And Recognition , 2004, Int. J. Pattern Recognit. Artif. Intell..

[4]  Richard Chbeir,et al.  Building semantic trees from XML documents , 2016, J. Web Semant..

[5]  Nektarios Gioldasis,et al.  SPARQL2XQuery 2.0: Supporting Semantic-based queries over XML data , 2010, 2010 Fifth International Workshop Semantic Media Adaptation and Personalization.

[6]  Tova Milo,et al.  Using Schema Matching to Simplify Heterogeneous Data Translation , 1998, VLDB.

[7]  Lawrence B. Holder,et al.  Using a Graph-Based Data Mining System to Perform Web Search , 2003, Int. J. Pattern Recognit. Artif. Intell..

[8]  Mario Vento,et al.  Graph Matching and Learning in Pattern Recognition in the Last 10 Years , 2014, Int. J. Pattern Recognit. Artif. Intell..

[9]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[10]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[11]  Tong Heng Lee,et al.  Shape classification using invariant features and contextual information in the bag-of-words model , 2015, Pattern Recognit..

[12]  Sergio Greco,et al.  Semantic clustering of XML documents , 2010, TOIS.

[13]  Sören Auer,et al.  Mapping XML to OWL Ontologies , 2005, Leipziger Informatik-Tage.

[14]  Jean Ponce,et al.  A tensor-based algorithm for high-order graph matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Ramez Elmasri,et al.  XCDSearch: An XML Context-Driven Search Engine , 2010, IEEE Transactions on Knowledge and Data Engineering.

[16]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[17]  Nektarios Gioldasis,et al.  The SPARQL2XQuery interoperability framework , 2013, World Wide Web.

[18]  Rong Jin,et al.  Understanding bag-of-words model: a statistical framework , 2010, Int. J. Mach. Learn. Cybern..

[19]  M. Zaslavskiy,et al.  A Path Following Algorithm for the Graph Matching Problem , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Mohamed Bahaj,et al.  Semantic mapping at the schema level of XML documents to ontologies , 2014, 2014 International Conference on Next Generation Networks and Services (NGNS).