Alarm Prioritization and Diagnosis for Cellular Networks

Alarm events occurring in telecommunication networks can be an invaluable tool for network operators. However, given the size and complexity of today’s networks, handling of alarm events represents a challenge in itself, due to two key aspects: high volume and lack of descriptiveness. The latter derives from the fact that not all alarm events report the actual source of failure. A failure in a higher-level managed object could result in alarm events observed on its controlled objects. In addition, alarm events may not be indicative of network distress, as many devices have automatic fallback solutions that may permit normal network operation to continue. Indeed, given the amount of equipment in a network, there can be a “normal” amount of failure that occurs on a regular basis; if each alarm is treated with equal attention, the volume can quickly become untenable. To address these shortcomings, we propose a novel framework that prioritizes and diagnoses alarm events. We rely on a priori information about the managed network structure, relationships, and fault management practices, and use a probabilistic logic engine that allows evidence and rules to be encoded as sentences in first order logic. Our work, tested using real cellular network data, achieves a significant reduction in the amount of analyzed objects in the network by combining alarms into sub-graphs and prioritizing them, and offers the most probable diagnosis outcome.

[1]  Lundy Lewis,et al.  Event Correlation in Integrated Management: Lessons Learned and Outlook , 2007, Journal of Network and Systems Management.

[2]  Ulf Lindqvist,et al.  Anomaly Detection and Diagnosis for Automatic Radio Network Verification , 2014, MONAMI.

[3]  Georg Carle,et al.  Graph coloring based physical-cell-ID assignment for LTE networks , 2009, IWCMC.

[4]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[5]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[6]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[7]  M. Jacomy,et al.  ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software , 2014, PloS one.

[8]  Peter C. Doerschuk,et al.  Cluster Expansions for the Deterministic Computation of Bayesian Estimators Based on Markov Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Kimmo Hätönen,et al.  Data mining for telecommunications network log analysis , 2009 .

[10]  Kimmo Hätönen,et al.  Domain Structures in Filtering Irrelevant Frequent Patterns , 2004, Database Support for Data Mining Applications.

[11]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[12]  Mischa Schwartz,et al.  Schemes for fault identification in communication networks , 1995, TNET.

[13]  Dilmar Malheiros Meira A Model For Alarm Correlation in Telecommunications Networks , 1997 .

[14]  Anne Bouillard,et al.  Alarms correlation in telecommunication networks , 2013 .