Semantic Heterogeneity Reduction for Big Data in Industrial Automation

The large amounts of diverse data collected in industrial au- tomation domain, such as sensor measurements together with informa- tion in MES/ERP 1 systems need special handling that was not possible in past. The Big Data technologies contribute a lot to the possibility of analyzing such amounts of data. However, we need to handle not only data volume, which is usually the major focus of Big Data research, but we also need to focus on variety of data. In this paper, we primarily focus on variety of industrial automation data and present and discuss a possible approach of handling the semantic heterogeneity of them. We show the process of heterogeneity reduction that exploits Semantic Web technologies. The steps include construction of upper ontology describ- ing all data sources, transformation of data according to this ontology and finally the analysis with the help of Big Data paradigm. The pro- posed approach is demonstrated on data measured by sensors in a passive house.

[1]  Alon Y. Halevy,et al.  Enterprise information integration: successes, challenges and controversies , 2005, SIGMOD '05.

[2]  Ian Horrocks,et al.  Optique: OBDA Solution for Big Data , 2013, ESWC.

[3]  Melnned M. Kantardzic Big Data Analytics , 2013, Lecture Notes in Computer Science.

[4]  Jérôme Euzenat,et al.  Ontology matching , 2007 .

[5]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[6]  Hamish Cunningham,et al.  GATE-a General Architecture for Text Engineering , 1996, COLING.

[7]  Jan Smid,et al.  Ontology Design with Formal Concept Analysis , 2004, CLA.

[8]  Ryutaro Ichise,et al.  Discovering Relationships Among Catalogs , 2004, Discovery Science.

[9]  Ryutaro Ichise,et al.  MAPSOM: User Involvement in Ontology Matching , 2013, JIST.

[10]  Marek Obitko,et al.  Big data analysis for sensor time-series in automation , 2014, Proceedings of the 2014 IEEE Emerging Technology and Factory Automation (ETFA).

[11]  Petr Novák,et al.  Design and verification of simulation models of passive houses , 2012, Proceedings of 2012 IEEE 17th International Conference on Emerging Technologies & Factory Automation (ETFA 2012).

[12]  Stuart E. Madnick,et al.  Representing and reasoning about semantic conflicts in heterogeneous information systems , 1997 .

[13]  Ioannis Konstantinou,et al.  H2RDF: adaptive query processing on RDF data in the cloud. , 2012, WWW.