Learning Actions in Complex Software Systems

Administering service-oriented architecture (SOA) systems could require sophisticated rules to decide for instance whether to add or remove servers and when. Rule construction often necessitates experts to study patterns that contribute to changes or events. This is a time consuming and error-prone process for complex software systems. In this paper we test the feasibility of automating this process by mining historical data such as past service requests (in time series) and server change events that the administrator committed. We propose a new method to relate frequent patterns in a given time series to changes recorded in the event's history. We implemented and tested our method on a simulation system for SOA applications. First, we use Euclidean distance, DTW, and FastDTW to identify frequent patterns in a time series that represents performance metric of a SOA simulation system. Then, we calculate the confidence and support of frequent patterns that contribute to changes to identify a set of rules for automating changes. We tested rules that are generated using the proposed method in a training set on a testing set. The average accuracy of generated rules for the change event "remove" exceeded 80% in our experiments.

[1]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.

[2]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[3]  Eleni Stroulia,et al.  Toward a simulation-generated knowledge base of service performance , 2009, MWSOC '09.

[4]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[5]  Wesley W. Chu,et al.  An index-based approach for similarity search supporting time warping in large sequence databases , 2001, Proceedings 17th International Conference on Data Engineering.

[6]  Catherine Garbay,et al.  Knowledge construction from time series data using a collaborative exploration system , 2007, J. Biomed. Informatics.

[7]  Ioannis P. Androulakis,et al.  Selecting maximally informative genes , 2005, Comput. Chem. Eng..

[8]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[9]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[10]  David J. Hand,et al.  Advances in intelligent data analysis , 2000 .

[11]  Joseph B. Kruskall,et al.  The Symmetric Time-Warping Problem : From Continuous to Discrete , 1983 .

[12]  Jiawei Han,et al.  Classification of software behaviors for failure detection: a discriminative pattern mining approach , 2009, KDD.

[13]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[14]  Konstantinos Kalpakis,et al.  Distance measures for effective clustering of ARIMA time-series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[15]  Magnus Lie Hetland,et al.  Evolutionary Rule Mining in Time Series Databases , 2005, Machine Learning.

[16]  Heikki Mannila,et al.  Knowledge discovery from telecommunication network alarm databases , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[17]  Frank Klawonn,et al.  Finding informative rules in interval sequences , 2001, Intell. Data Anal..

[18]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[19]  Fabian Mörchen,et al.  Time Series Knowledge Mining , 2006 .

[20]  Stan Salvador,et al.  FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space , 2004 .

[21]  Andrew R. Post,et al.  Temporal data mining. , 2008, Clinics in laboratory medicine.

[22]  Shamkant B. Navathe,et al.  An Efficient Algorithm for Mining Association Rules in Large Databases , 1995, VLDB.

[23]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[24]  Ulrich Güntzer,et al.  Algorithms for association rule mining — a general survey and comparison , 2000, SKDD.

[25]  谷口 倫一郎,et al.  Frequent Motion Pattern Extraction for Motion Recognition in Real-time Human Proxy , 2005 .

[26]  Dr. Alex A. Freitas Data Mining and Knowledge Discovery with Evolutionary Algorithms , 2002, Natural Computing Series.

[27]  Chao Liu,et al.  Efficient mining of iterative patterns for software specification discovery , 2007, KDD '07.

[28]  Giuseppe Psaila,et al.  Querying Shapes of Histories , 1995, VLDB.

[29]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[30]  Paul S. Bradley,et al.  Initialization of Iterative Refinement Clustering Algorithms , 1998, KDD.

[31]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[32]  Eamonn J. Keogh,et al.  Finding surprising patterns in a time series database in linear time and space , 2002, KDD.

[33]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.

[34]  Magnus Lie Hetland,et al.  Temporal Rule Discovery using Genetic Programming and Specialized Hardware , 2004 .

[35]  Haym Hirsh,et al.  Learning to Predict Rare Events in Event Sequences , 1998, KDD.