Using Graph Aggregation for Service Interaction Message Correlation

Discovering the behavior of services and their interactions in an enterprise requires the ability to correlate service interaction messages into process instances. The service interaction logic (or process model) is then discovered from the set of process instances that are the result of a given way of correlating messages. However, sometimes, the Correlation Conditions (CC) allowing to identify correlations of messages from a service interaction log are not known. In such cases, and with a large number of message's correlator attributes, we are facing a large space of possible ways messages may be correlated which makes identifying process instances difficult. In this paper, we propose an approach based on message indexation and aggregation to generate a size-efficient Aggregated Correlation Graph (ACG) that exhibits all the ways messages correlate in a service interaction log not only for disparate pairs of messages but also for sequences of messages corresponding to process instances. Adapted filtering techniques based on user defined heuristics are then applied on such a graph to help the analysts efficiently identify the most frequently executed processes from their sequences of CCs. The approach has been implemented and experiments show its effectiveness to identify relevant sequences of CCs from large service interaction logs.

[1]  Marlon Dumas,et al.  Correlation Patterns in Service-Oriented Architectures , 2007, FASE.

[2]  Torsten Suel,et al.  Performance of compressed inverted list caching in search engines , 2008, WWW.

[3]  W. De Pauw,et al.  Web Services Navigator: Visualizing the execution of Web Services , 2005, IBM Syst. J..

[4]  Daniel Gillblad,et al.  Discovering Process Models from Unlabelled Event Logs , 2009, BPM.

[5]  Myra Spiliopoulou,et al.  A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis , 2003, INFORMS J. Comput..

[6]  Wil M. P. van der Aalst,et al.  A Rule-Based Approach for Process Discovery: Dealing with Noise and Imbalance in Process Logs , 2005, Data Mining and Knowledge Discovery.

[7]  Fabio Casati,et al.  Deriving Protocol Models from Imperfect Service Conversation Logs , 2008, IEEE Transactions on Knowledge and Data Engineering.

[8]  Yi Huang,et al.  Discovering Conversations in Web Services Using Semantic Correlation Analysis , 2007, IEEE International Conference on Web Services (ICWS 2007).

[9]  Fabio Casati,et al.  Protocol Discovery from Imperfect Service Interaction Logs , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[10]  Fabio Casati,et al.  Event correlation for process discovery from web service interaction logs , 2011, The VLDB Journal.

[11]  Boudewijn F. van Dongen,et al.  Workflow mining: A survey of issues and approaches , 2003, Data Knowl. Eng..

[12]  Fabio Casati,et al.  Process spaceship: discovering and exploring process views from event logs in data spaces , 2008, Proc. VLDB Endow..