Using domain knowledge for targeted alerting in PBA for IT service management

Process behavior analysis (PBA) has been applied to service quality control for decades. In IT service delivery management, PBA has been widely and successfully applied to monitor key performance indicators (KPIs) over time to identify process anomalies. However, we notice that PBA is often limited to a relatively small number of high-level indicators instead of slicing and dicing data for each KPI to find the key dimensions that highlight specific problems. This is due to the fact that without applying domain knowledge to identify relevant alerts, monitoring all possible dimensions leads to too many irrelevant alerts. In this paper, we demonstrate that, using domain-specific rules for each dimension as well as for dimension combinations, we generate targeted alerts for specific consumers. While the presented framework is generic, we demonstrate specifically how targeted alerting can be applied to the incident management process, in particular to monitoring incident volumes over time. We showcase the value of our targeted alerting mechanism using data from 18 outsourcing clients.

[1]  Fred Spiring,et al.  Introduction to Statistical Quality Control , 2007, Technometrics.

[2]  Karin Murthy,et al.  COACH: Cognitive analytics for change , 2017, 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM).

[3]  Liang Tang,et al.  Optimizing system monitoring configurations for non-actionable alerts , 2012, 2012 IEEE Network Operations and Management Symposium.

[4]  Joseph L. Hellerstein,et al.  An approach to predictive detection for service management , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[5]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[6]  Victor W. Marek,et al.  Collating time-series resource data for system-wide job profiling , 2016, NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium.

[7]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988 .

[8]  Bikram Sengupta,et al.  Reducing defects in IT service delivery , 2013, Proceedings of 2013 IEEE International Conference on Service Operations and Logistics, and Informatics.

[9]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[10]  Jasmina Bogojeska,et al.  Hierarchical Incident Ticket Classification with Minimal Supervision , 2014, 2014 IEEE International Conference on Data Mining.

[11]  Xifeng Yan,et al.  Workload characterization and prediction in the cloud: A multiple time series approach , 2012, 2012 IEEE Network Operations and Management Symposium.