Semantically Enriched Multi-level Sequential Pattern Mining for Exploring Heterogeneous Event Log Data

Photovoltaic (PV) event log data are typically underexploited mainly because of the heterogeneity of the events. To unlock these data, we propose an explorative methodology that overcomes two main constraints: (1) the rampant variability in event labelling, and (2) the unavailability of a clear methodology to traverse the amount of generated event sequences. With respect to the latter constraint, we propose to integrate heterogeneous event logs from PV plants with a semantic model of the events. However, since different manufacturers report events at different levels of granularity and since the finest granularity may sometimes not be the right level of detail for exploitable insights, we propose to explore PV event logs with Multi-level Sequential Pattern Mining. On the basis of patterns that are retrieved across taxonomic levels, several event-related processes can be optimized, e.g. by predicting PV inverter failures. The methodology is validated on real-life data from two PV plants.

[1]  Peter F. Patel-Schneider,et al.  An Empirical Analysis of Semantic Techniques Applied to a Network Management Classification Problem , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[2]  Adam Wright,et al.  The use of sequential pattern mining to predict next prescribed medications , 2015, J. Biomed. Informatics.

[3]  Ivan Merelli,et al.  Managing, Analysing, and Integrating Big Data in Medical Bioinformatics: Open Problems and Future Perspectives , 2014, BioMed research international.

[4]  Yen-Liang Chen,et al.  A novel knowledge discovering model for mining fuzzy multi-level sequential patterns in sequence databases , 2008, Data Knowl. Eng..

[5]  Chedy Raïssi,et al.  A FCA-based Analysis of Sequential Care Trajectories , 2011, CLA.

[6]  Jun Chen,et al.  Pattern Mining for Predicting Critical Events from Sequential Event Data Log , 2014, WODES.

[7]  F. Henry Abanda,et al.  PV-TONS: A photovoltaic technology ontology system for the design of PV-systems , 2013, Eng. Appl. Artif. Intell..

[8]  Li Yun,et al.  Multi-Level Sequential Pattern Mining Based on Prime Encoding , 2012 .

[9]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[10]  Raphaël Troncy,et al.  LODE: Linking Open Descriptions of Events , 2009, ASWC.

[11]  John F. Roddick,et al.  Sequential pattern mining -- approaches and algorithms , 2013, CSUR.

[12]  Elena Tsiporkova,et al.  A semantic model of events for integrating photovoltaic monitoring data , 2015, 2015 IEEE 13th International Conference on Industrial Informatics (INDIN).

[13]  Jaroslav Zendulka,et al.  MLSP: Mining Hierarchically-Closed Multi-Level Sequential Patterns , 2013, ADMA.

[14]  Mohammad Reza Gholamian,et al.  A novel algorithm for extracting knowledge based on mining multi-level sequential patterns , 2012 .

[15]  Anne Laurent,et al.  Mining multidimensional and multilevel sequential patterns , 2010, TKDD.

[16]  Chedy Raïssi,et al.  Mining Heterogeneous Multidimensional Sequential Patterns , 2014, ECAI.

[17]  Divesh Srivastava,et al.  Big data integration , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).