Detecting Performance Degradation of Software-Intensive Systems in the Presence of Trends and Long-Range Dependence

As contemporary software-intensive systems reach increasingly large scale, it is imperative that failure detection schemes be developed to help prevent costly system downtimes. A promising direction towards the construction of such schemes is the exploitation of easily available measurements of system performance characteristics such as average number of processed requests and queue size per unit of time. In this work, we investigate a holistic methodology for detection of abrupt changes in time series data in the presence of quasi-seasonal trends and long-range dependence with a focus on failure detection in computer systems. We propose a trend estimation method enjoying optimality properties in the presence of long-range dependent noise to estimate what is considered "normal" system behaviour. To detect change-points and anomalies, we develop an approach based on the ensembles of "weak" detectors. We demonstrate the performance of the proposed change-point detection scheme using an artificial dataset, the publicly available Abilene dataset as well as the proprietary geoinformation system dataset.

[1]  Peter R. Winters,et al.  Forecasting Sales by Exponentially Weighted Moving Averages , 1960 .

[2]  Tamara Radivilova,et al.  COMPARATIVE ANALYSIS FOR ESTIMATING OF THE HURST EXPONET FOR STATIONARY AND NONSTATIONARY TIME SERIES , 2011 .

[3]  R. Vautard,et al.  Singular-spectrum analysis: a toolkit for short, noisy chaotic signals , 1992 .

[4]  Optimal estimation of a signal perturbed by a fractional Brownian noise , 2016, 1707.07329.

[5]  Vadim V. Nikulin,et al.  Detrended Fluctuation Analysis: A Scale-Free View on Neuronal Oscillations , 2012, Front. Physio..

[6]  B. Mandelbrot,et al.  Fractional Brownian Motions, Fractional Noises and Applications , 1968 .

[7]  Andreas Daffertshofer,et al.  Scale-free dynamics and critical phenomena in cortical activity , 2013, Front. Physiol..

[8]  David F. Findley,et al.  New Capabilities and Methods of the X-12-ARIMA Seasonal-Adjustment Program , 1998 .

[9]  Evgeny Burnaev,et al.  Ensembles of detectors for online detection of transient changes , 2015, International Conference on Machine Vision.

[10]  Walter Willinger,et al.  On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.

[11]  A. R. Crathorne,et al.  Economic Control of Quality of Manufactured Product. , 1933 .

[12]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .

[13]  Desmond P. Taylor,et al.  On the SelfSimilar Nature of Ethernet Traffic (Extended Version) , 2007 .

[14]  Evgeny Burnaev,et al.  Nonparametric decomposition of quasi-periodic time series for change-point detection , 2015, International Conference on Machine Vision.

[15]  Mark Crovella,et al.  Diagnosing network-wide traffic anomalies , 2004, SIGCOMM '04.

[16]  Svetha Venkatesh,et al.  Anomaly detection in large-scale data stream networks , 2012, Data Mining and Knowledge Discovery.

[17]  Pedro Casas,et al.  Optimal volume anomaly detection and isolation in large-scale IP networks using coarse-grained measurements , 2010, Comput. Networks.

[18]  Alexandru Iosup,et al.  Analysis and modeling of time-correlated failures in large-scale distributed systems , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[19]  VenkateshSvetha,et al.  Anomaly detection in large-scale data stream networks , 2014 .

[20]  Walter Willinger,et al.  On the Self-Similar Nature of Ethernet Traffic ( extended version ) , 1995 .

[21]  E. Prescott,et al.  Postwar U.S. Business Cycles: An Empirical Investigation , 1997 .

[22]  Aleksey S. Polunchenko,et al.  State-of-the-Art in Sequential Change-Point Detection , 2011, 1109.2938.

[23]  Douglas C. Schmidt,et al.  Ultra-Large-Scale Systems: The Software Challenge of the Future , 2006 .