相关论文

Dynamic syslog mining for network failure monitoring

Abstract:Syslog monitoring technologies have recently received vast attentions in the areas of network management and network monitoring. They are used to address a wide range of important issues including network failure symptom detection and event correlation discovery. Syslogs are intrinsically dynamic in the sense that they form a time series and that their behavior may change over time. This paper proposes a new methodology of dynamic syslog mining in order to detect failure symptoms with higher confidence and to discover sequential alarm patterns among computer devices. The key ideas of dynamic syslog mining are 1) to represent syslog behavior using a mixture of Hidden Markov Models, 2) to adaptively learn the model using an on-line discounting learning algorithm in combination with dynamic selection of the optimal number of mixture components, and 3) to give anomaly scores using universal test statistics with a dynamically optimized threshold. Using real syslog data we demonstrate the validity of our methodology in the scenarios of failure symptom detection, emerging pattern identification, and correlation discovery.

参考文献

[1]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[2]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[3]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[4]  Raphail E. Krichevsky,et al.  The performance of universal encoding , 1981, IEEE Trans. Inf. Theory.

[5]  Jorma Rissanen,et al.  Universal coding, information, prediction, and estimation , 1984, IEEE Trans. Inf. Theory.

[6]  Jacob Ziv,et al.  On classification with empirically observed statistics and universal data compression , 1988, IEEE Trans. Inf. Theory.

[7]  G. Jakobson,et al.  Alarm correlation , 1993, IEEE Network.

[8]  Stephen E. Hansen,et al.  Automated System Monitoring and Notification with Swatch , 1993, LISA.

[9]  Padhraic Smyth,et al.  Markov monitoring with unknown states , 1994, IEEE J. Sel. Areas Commun..

[10]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[11]  D. Ohsie,et al.  High speed and robust event correlation , 1996, IEEE Commun. Mag..

[12]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[13]  Graham J. Williams,et al.  On-line unsupervised outlier detection using finite mixtures with discounting learning algorithms , 2000, KDD '00.

[14]  Malgorzata Steinder,et al.  The present and future of event correlation: A need for end-to-end service fault localization , 2001 .

[15]  Chris Lonvick,et al.  The BSD Syslog Protocol , 2001, RFC.

[16]  Kenji Yamanishi,et al.  A unifying framework for detecting outliers and change points from non-stationary time series data , 2002, KDD.

[17]  Risto Vaarandi,et al.  SEC - a lightweight event correlation tool , 2002, IEEE Workshop on IP Operations and Management.

[18]  Risto Vaarandi,et al.  A data clustering algorithm for mining patterns from event logs , 2003, Proceedings of the 3rd IEEE Workshop on IP Operations & Management (IPOM 2003) (IEEE Cat. No.03EX764).

[19]  Joseph L. Hellerstein,et al.  Data-driven validation, completion and construction of event relationship networks , 2003, KDD '03.

[20]  Kenji Yamanishi,et al.  Dynamic model selection with its applications to computer security , 2004, Information Theory Workshop.

[21]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[22]  Heikki Mannila,et al.  Rule Discovery in Telecommunication Alarm Data , 1999, Journal of Network and Systems Management.

[23]  Malgorzata Steinder,et al.  Probabilistic fault localization in communication systems using belief networks , 2004, IEEE/ACM Transactions on Networking.

[24]  Graham J. Williams,et al.  On-Line Unsupervised Outlier Detection Using Finite Mixtures with Discounting Learning Algorithms , 2000, KDD '00.

引用
A Failure Detection and Prediction Mechanism for Enhancing Dependability of Data Centers
2012
An end-to-end data transformation process for increasing the information yield of system traces
2013
Proactive failure detection learning generation patterns of large-scale network logs
2015 11th International Conference on Network and Service Management (CNSM)
2015
Performance Metric Selection for Autonomic Anomaly Detection on Cloud Computing Systems
2011 IEEE Global Telecommunications Conference - GLOBECOM 2011
2011
Modern MDL meets Data Mining Insights, Theory, and Practice
KDD
2019
Alert Detection in System Logs
2008 Eighth IEEE International Conference on Data Mining
2008
GAUL: Gestalt Analysis of Unstructured Logs for Diagnosing Recurring Problems in Large Enterprise Storage Systems
2010 29th IEEE Symposium on Reliable Distributed Systems
2010
Stage-aware anomaly detection through tracking log points
Middleware
2014
Online System Problem Detection by Mining Patterns of Console Logs
2009 Ninth IEEE International Conference on Data Mining
2009
Sequential network change detection with its applications to ad impact relation analysis
2012 IEEE 12th International Conference on Data Mining
2012
Sequential Network Change Detection with Its Applications to Ad Impact Relation Analysis
ICDM
2012
Model Change Detection With the MDL Principle
IEEE Transactions on Information Theory
2018
Detecting changes of clustering structures using normalized maximum likelihood coding
KDD
2012
Flexible Log File Parsing using Hidden Markov Models
ICNLP 2019
2019
With Semantics and Hidden Markov Models to an Adaptive Log File Parser
2019
Finding Surprisingly Frequent Patterns of Variable Lengths in Sequence Data
SDM
2016
Spatio-temporal factorization of log data for understanding network events
IEEE INFOCOM 2014 - IEEE Conference on Computer Communications
2014
Understanding university campus network reliability characteristics using a big data analytics tool
2015 11th International Conference on the Design of Reliable Communication Networks (DRCN)
2015
Understanding the health of an access network
2011 Third International Conference on Ubiquitous and Future Networks (ICUFN)
2011
Errors and Faults
2015