Predicting seismic events in coal mines based on underground sensor measurements

Abstract In this paper, we address the problem of safety monitoring in underground coal mines. In particular, we investigate and compare practical methods for the assessment of seismic hazards using analytical models constructed based on sensory data and domain knowledge. For our case study, we use a rich data set collected during a period of over five years from several active Polish coal mines. We focus on comparing the prediction quality between expert methods which serve as a standard in the coal mining industry and state-of-the-art machine learning methods for mining high-dimensional time series data. We describe an international data mining challenge organized to facilitate our study. We also demonstrate a technique which we employed to construct an ensemble of regression models able to outperform other approaches used by participants of the challenge. Finally, we explain how we utilized the data obtained during the competition for the purpose of research on the cold start problem in deploying decision support systems at new mining sites.

[1]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.

[2]  A. Gomes Correia,et al.  Bayesian methodology for updating geomechanical parameters and uncertainty quantification , 2009 .

[3]  Marc Boullé Predicting dangerous seismic events in coal mines under distribution drift , 2016, 2016 Federated Conference on Computer Science and Information Systems (FedCSIS).

[4]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[5]  A. Zagorecki Application of Sensor Fusion and Data Mining for Prediction of Methane Concentration in Coal Mines , 2015 .

[6]  Andrzej Lesniak,et al.  Space–time clustering of seismic events and hazard assessment in the Zabrze-Bielszowice coal mine, Poland , 2009 .

[7]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[8]  Marek Grzegorowski,et al.  Massively parallel feature extraction framework application in predicting dangerous seismic events , 2016, 2016 Federated Conference on Computer Science and Information Systems (FedCSIS).

[9]  Wentian Zhao,et al.  Fault diagnosis network design for vehicle on-board equipments of high-speed railway: A deep learning approach , 2016, Eng. Appl. Artif. Intell..

[10]  Dominik Slezak,et al.  Knowledge Pit - A Data Challenge Platform , 2015, CS&P.

[11]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[12]  Le Hoang Son Dealing with the new user cold-start problem in recommender systems: A comparative review , 2016, Inf. Syst..

[13]  Stanislaw Lasocki,et al.  Seismicity induced by mining: Ten years later , 2001 .

[14]  Min Han,et al.  Joint mutual information-based input variable selection for multivariate time series modeling , 2015, Eng. Appl. Artif. Intell..

[15]  Marek Gagolewski,et al.  The winning solution to the AAIA'15 data mining competition: Tagging Firefighter Activities at a Fire Scene , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[16]  J. Kornowski LINEAR PREDICTION OF AGGREGATED SEISMIC AND SEISMOACOUSTIC ENERGY EMITTED FROM A MINING LONGWALL , 2000 .

[17]  Dominik Slezak,et al.  Predicting Dangerous Seismic Events: AAIA'16 Data Mining Challenge , 2016, 2016 Federated Conference on Computer Science and Information Systems (FedCSIS).

[18]  Karol Kurach,et al.  Predicting dangerous seismic activity with Recurrent Neural Networks , 2016, 2016 Federated Conference on Computer Science and Information Systems (FedCSIS).

[19]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[20]  Wojciech Moczulski,et al.  Identification of dynamic diagnostic models with the use of methodology of knowledge discovery in databases , 2007, Eng. Appl. Artif. Intell..

[21]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[22]  Tiago F. S. Miranda,et al.  Rockburst laboratory tests database — Application of data mining techniques , 2015 .

[23]  Mateusz Kalisch,et al.  Application of selected classification schemes for fault diagnosis of actuator systems , 2014, 2014 Federated Conference on Computer Science and Information Systems.

[24]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[25]  Marek Sikora,et al.  Modern ICT and Mechatronic Systems in Contemporary Mining Industry , 2016, IJCRS.

[26]  Marek Sikora,et al.  Application of rule-based models for seismic hazard prediction in coal mines , 2014 .

[27]  Marek Grzegorowski,et al.  Mining Data from Coal Mines: IJCRS'15 Data Challenge , 2015, RSFDGrC.

[28]  Jean-Christophe Popieul,et al.  Multivariate analysis of human behavior data using fuzzy windowing: Example with driver-car-environment system , 2012, Eng. Appl. Artif. Intell..

[29]  Krzysztof Siminski,et al.  Rough subspace neuro-fuzzy system , 2015, Fuzzy Sets Syst..

[30]  Cristiano Hora Fontes,et al.  Pattern recognition in multivariate time series - A case study applied to fault detection in a gas turbine , 2016, Eng. Appl. Artif. Intell..

[31]  Francisco Herrera,et al.  Implementing algorithms of rough set theory and fuzzy rough set theory in the R package "RoughSets" , 2014, Inf. Sci..

[32]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[33]  Stanislaw Lasocki Probabilistic Analysis of Seismic Hazard Posed by Mining Induced Events , 2005 .

[34]  J. A. Pérez-Benitez,et al.  A system for classification of time-series data from industrial non-destructive device , 2013, Eng. Appl. Artif. Intell..

[35]  Clement T. Yu,et al.  Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping , 2003, IEEE Trans. Knowl. Data Eng..

[36]  Jan Kanty Milczek,et al.  Early warning system for seismic events in Coal Mines using machine learning , 2016, 2016 Federated Conference on Computer Science and Information Systems (FedCSIS).

[37]  Bertalan Bodri A neural-network model for earthquake occurrence , 2001 .

[38]  Erhan Kozan,et al.  A demand-responsive decision support system for coal transportation , 2012, Decis. Support Syst..

[39]  Lijuan Mu,et al.  Integrated Coal Mine Safety Monitoring System , 2012 .

[40]  Marek Grzegorowski,et al.  Governance of the Redundancy in the Feature Selection Based on Rough Sets' Reducts , 2016, IJCRS.

[41]  Józef Kabiesz,et al.  Effect of the form of data on the quality of mine tremors hazard forecasting using neural networks , 2006 .

[42]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[43]  Marcin Michalak,et al.  Outlier Detection and Elimination in Stream Data - An Experimental Approach , 2016, IJCRS.

[44]  Piotr Synak,et al.  Application of Temporal Descriptors to Musical Instrument Sound Recognition , 2003, Journal of Intelligent Information Systems.

[45]  Michal Kozielski,et al.  DISESOR - decision support system for mining industry , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[46]  Dominik Slezak,et al.  Computation of Approximate Reducts with Dynamically Adjusted Approximation Threshold , 2015, ISMIS.

[47]  Marek Grzegorowski,et al.  Window-Based Feature Engineering for Prediction of Methane Threats in Coal Mines , 2015, RSFDGrC.

[48]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[49]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[50]  Adam Zagorecki A versatile approach to classification of multivariate time series data , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[51]  Marc Boullé Tagging fireworkers activities from body sensors under distribution drift , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[52]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[53]  Marek Sikora,et al.  Improving prediction models applied in systems monitoring natural hazards and machinery , 2012, Int. J. Appl. Math. Comput. Sci..

[54]  Dominik Slezak,et al.  Rough Set Tools for Practical Data Exploration , 2015, RSKT.

[55]  A. T. Iannacchione,et al.  Rock Damage Characterisation From Microseismic Monitoring , 2001 .

[56]  Marek Grzegorowski,et al.  Scaling of Complex Calculations over Big Data-Sets , 2014, AMT.

[57]  Guanghui Zhao,et al.  CMEOC—An expert system in the coal mining industry , 1999 .

[58]  Dimitrios Gunopulos,et al.  Iterative Incremental Clustering of Time Series , 2004, EDBT.

[59]  Alan R. Hevner,et al.  Integrated decision support systems: A data warehousing perspective , 2007, Decis. Support Syst..

[60]  Jim Hunter,et al.  Modelling the Task of Summarising Time Series Data Using KA Techniques , 2002 .

[61]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[62]  Iftikhar U. Sikder,et al.  Application of rough set and decision tree for characterization of premonitory factors of low seismic activity , 2009, Expert Syst. Appl..

[63]  Pradipta Kishore Dash,et al.  Mining for similarities in time series data using wavelet-based feature vectors and neural networks , 2007, Eng. Appl. Artif. Intell..