Comparison of Evolving Granular Classifiers applied to Anomaly Detection for Predictive Maintenance in Computing Centers

Log-based predictive maintenance of computing centers is a main concern regarding the worldwide computing grid that supports the CERN (European Organization for Nu-clear Research) physics experiments. A log, as event-oriented ad-hoc information, is quite often given as unstructured big data. Log data processing is a time-consuming computational task. The goal is to grab essential information from a continuously changeable grid environment to construct a classification model. Evolving granular classifiers are suited to learn from time-varying log streams and, therefore, perform online classification of the severity of anomalies. We formulated a 4-class online anomaly classification problem, and employed time windows between landmarks and two granular computing methods, namely, Fuzzy-set-Based evolving Modeling (FBeM) and evolving Granular Neural Network (eGNN), to model and monitor logging activity rate. The results of classification are of utmost importance for predictive maintenance because priority can be given to specific time intervals in which the classifier indicates the existence of high or medium severity anomalies.

[1]  Gleb Beliakov,et al.  Aggregation Functions: A Guide for Practitioners , 2007, Studies in Fuzziness and Soft Computing.

[2]  Mahardhika Pratama,et al.  A novel online multi-label classifier for high-speed streaming data applications , 2016, Evolving Systems.

[3]  Daniel F. Leite,et al.  Evolving fuzzy granular modeling from nonstationary fuzzy data streams , 2012, Evol. Syst..

[4]  Cheong Hee Park,et al.  An Efficient Concept Drift Detection Method for Streaming Data under Limited Labeling , 2017, IEICE Trans. Inf. Syst..

[5]  Plamen Angelov,et al.  Fully online clustering of evolving data streams into arbitrarily shaped clusters , 2017, Inf. Sci..

[6]  Daniel Leite,et al.  Real-Time Anomaly Detection in Data Centers for Log-based Predictive Maintenance using an Evolving Fuzzy-Rule-Based Approach , 2020, 2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[7]  Daniel Leite,et al.  Unsupervised Fuzzy eIX: Evolving Internal-eXternal Fuzzy Clustering , 2020, 2020 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS).

[8]  Alexei Klimentov,et al.  Operational Intelligence for Distributed Computing Systems for Exascale Science , 2020 .

[9]  Fernando Gomide,et al.  Optimal Rule-Based Granular Systems From Data Streams , 2020, IEEE Transactions on Fuzzy Systems.

[10]  Witold Pedrycz,et al.  Online Tool Condition Monitoring Based on Parsimonious Ensemble+ , 2017, IEEE Transactions on Cybernetics.

[11]  Eduardo Freire Nakamura,et al.  Event detection framework for wireless sensor networks considering data anomaly , 2012, 2012 IEEE Symposium on Computers and Communications (ISCC).

[12]  Plamen P. Angelov,et al.  Self-organising fuzzy logic classifier , 2018, Inf. Sci..

[13]  Plamen P. Angelov,et al.  An evolving approach to data streams clustering based on typicality and eccentricity data analytics , 2020, Inf. Sci..

[14]  Igor Škrjanc,et al.  Incremental Missing-Data Imputation for Evolving Fuzzy Granular Prediction , 2020, IEEE Transactions on Fuzzy Systems.

[15]  Vladik Kreinovich,et al.  Handbook of Granular Computing , 2008 .

[16]  Edwin Lughofer,et al.  Self-adaptive and local strategies for a smooth treatment of drifts in data streams , 2014, Evol. Syst..

[17]  W. H. Deitenbeck Introduction to statistical process control. , 1995, Healthcare facilities management series.

[18]  W. Pedrycz,et al.  An introduction to fuzzy sets : analysis and design , 1998 .

[19]  Diego Michelotto,et al.  Collection and harmonization of system logs and prototypal Analytics services with the Elastic (ELK) suite at the INFN-CNAF computing centre , 2019, Proceedings of International Symposium on Grids & Clouds 2019 — PoS(ISGC2019).

[20]  Mahardhika Pratama,et al.  Scaffolding type-2 classifier for incremental learning under concept drifts , 2016, Neurocomputing.

[21]  Daniel F. Leite,et al.  Ensemble of evolving data clouds and fuzzy models for weather time series prediction , 2018, Appl. Soft Comput..

[22]  Daniel F. Leite,et al.  Evolving granular neural networks from fuzzy data streams , 2013, Neural Networks.

[23]  Igor Skrjanc,et al.  Evolving fuzzy and neuro-fuzzy approaches in clustering, regression, identification, and classification: A Survey , 2019, Inf. Sci..

[24]  Lucas Oliveira Batista,et al.  Evolving fuzzy neural networks to aid in the construction of systems specialists in cyber attacks , 2019, J. Intell. Fuzzy Syst..

[25]  Hossein Javedani Sadaei,et al.  Probabilistic Forecasting With Fuzzy Time Series , 2020, IEEE Transactions on Fuzzy Systems.

[26]  Daniel F. Leite,et al.  Granular Approach for Evolving System Modeling , 2010, IPMU.

[27]  Luiz Alberto Queiroz Cordovil,et al.  Uncertain Data Modeling Based on Evolving Ellipsoidal Fuzzy Information Granules , 2020, IEEE Transactions on Fuzzy Systems.

[28]  Nan Liu,et al.  Ensemble of subset online sequential extreme learning machine for class imbalance and concept drift , 2015, Neurocomputing.

[29]  Andrea Ceccanti,et al.  Towards Predictive Maintenance with Machine Learning at the INFN-CNAF computing centre , 2019 .