Analytics over Multi-sensor Time Series Data – A Case-Study on Prediction of Mining Hazards

Mining of high-dimensional time series data that represent readings of multiple sensors is a challenging task. We focus on several important aspects of analytics over such data. We describe a methodology for extracting informative features from multidimensional data streams, as well as algorithms for finding compact representations of such data, in order to facilitate the construction of prediction models. We pay special attention to designing new approaches to dimensionality reduction and interchangeability of features that such representations comprise of. We validate our algorithms on data sets obtained from coal mines and we demonstrate how their results can be applied for a construction of a decision support system. We show that such system is efficient and that its outcomes can be easily interpreted by subject matter experts.

[1]  Dominik Slezak,et al.  Rough Set Methods for Attribute Clustering and Selection , 2014, Appl. Artif. Intell..

[2]  Marek Sikora,et al.  Modern ICT and Mechatronic Systems in Contemporary Mining Industry , 2016, IJCRS.

[3]  John L. Ellenberger,et al.  Coal Mine Seismicity And Bumps: Historical Case Studies And Current Field Activity , 1900 .

[4]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[5]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Jan Komorowski,et al.  Random Reducts: A Monte Carlo Rough Set-based Method for Feature Selection in Large Datasets , 2013, Fundam. Informaticae.

[7]  Tiago F. S. Miranda,et al.  Rockburst laboratory tests database — Application of data mining techniques , 2015 .

[8]  Marcin Michalak,et al.  Predicting seismic events in coal mines based on underground sensor measurements , 2017, Eng. Appl. Artif. Intell..

[9]  Rodney X. Sturdivant,et al.  Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[10]  Jakub Wroblewski,et al.  Ensembles of Classifiers Based on Approximate Reducts , 2001, Fundam. Informaticae.

[11]  Andrzej Skowron,et al.  Rough sets and Boolean reasoning , 2007, Inf. Sci..

[12]  Dominik Slezak,et al.  A framework for learning and embedding multi-sensor forecasting models into a decision support system: A case study of methane concentration in coal mines , 2018, Inf. Sci..

[13]  Dominik Slezak,et al.  Computation of Approximate Reducts with Dynamically Adjusted Approximation Threshold , 2015, ISMIS.

[14]  Dominik Slezak,et al.  Utilizing Hybrid Information Sources to Learn Representations of Cards in Collectible Card Video Games , 2018, 2018 IEEE International Conference on Data Mining Workshops (ICDMW).

[15]  Francisco Herrera,et al.  Implementing algorithms of rough set theory and fuzzy rough set theory in the R package "RoughSets" , 2014, Inf. Sci..