Automatic Discovery of Data-Centric and Artifact-Centric Processes

Process discovery is a technique that allows for automatically discovering a process model from recorded executions of a process as it happens in reality. This technique has successfully been applied for classical processes where one process execution is constituted by a single case with a unique case identifier. Data-centric and artifact-centric systems such as ERP systems violate this assumption. Here a process execution is driven by process data having various notions of interrelated identifiers that distinguish the various interrelated data objects of the process. Classical process mining techniques fail in this setting. This paper presents an automatic technique for discovering for each notion of data object in the process a separate process model that describes the evolution of this object, also known as artifact life-cycle model. Given a relational database that stores process execution information of a data-centric system, the technique extracts event information, case identifiers and their interrelations, discovers the central process data objects and their associated events, and decomposes the data source into multiple logs, each describing the cases of a separate data object. Then classical process discovery techniques can be applied to obtain a process model for each object. The technique is implemented and has been evaluated on the production ERP system of a large retailer.

[1]  Felix Naumann,et al.  Advancing the discovery of unique column combinations , 2011, CIKM '11.

[2]  Berthold Reinwald,et al.  Discovering topical structures of databases , 2008, SIGMOD Conference.

[3]  Santhosh Kumaran,et al.  A model-driven approach to industrializing discovery processes in pharmaceutical research , 2005, IBM Syst. J..

[4]  Jan Martijn E. M. van der Werf,et al.  Process Diagnostics: A Method Based on Process Mining , 2009, 2009 International Conference on Information, Process, and Knowledge Management.

[5]  Jon Espen Ingvaldsen,et al.  Preprocessing Support for Large Scale Process Mining of SAP Transactions , 2007, Business Process Management Workshops.

[6]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[7]  Divesh Srivastava,et al.  Type-based categorization of relational attributes , 2009, EDBT '09.

[8]  Ashutosh Tiwari,et al.  A review of business process mining: state-of-the-art and future trends , 2008, Bus. Process. Manag. J..

[9]  Guido Governatori,et al.  Compliance aware business process design , 2008 .

[10]  Josef Stoer,et al.  Numerische Mathematik 1 , 1989 .

[11]  Beng Chin Ooi,et al.  Automatic discovery of attributes in relational databases , 2011, SIGMOD '11.

[12]  Boudewijn F. van Dongen,et al.  XES, XESame, and ProM 6 , 2010, CAiSE Forum.

[13]  Divesh Srivastava,et al.  Summarizing Relational Databases , 2009, Proc. VLDB Endow..

[14]  Megha Ramesh Kumar Discovering Topical Structures of Databases Professor : , 2008 .

[15]  Richard Hull,et al.  Business Artifacts: A Data-centric Approach to Modeling Business Operations and Processes , 2009, IEEE Data Eng. Bull..

[16]  Beng Chin Ooi,et al.  On multi-column foreign key discovery , 2010, Proc. VLDB Endow..

[17]  Marlon Dumas On the Convergence of Data and Process Engineering , 2011, ADBIS.

[18]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[19]  P. Soffer,et al.  Information Systems Evolution - CAiSE Forum 2010, Hammamet, Tunisia, June 7-9, 2010, Selected Extended Papers , 2011, CAiSE Forum.

[20]  Felix Naumann,et al.  Efficiently Detecting Inclusion Dependencies , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[21]  Jean-Marc Petit,et al.  Unary and n-ary inclusion dependency discovery in relational databases , 2009, Journal of Intelligent Information Systems.

[22]  G. G. Meyer,et al.  Lecture notes in business information processing , 2009 .

[23]  Kamal Bhattacharya,et al.  Modeling Business Contexture and Behavior Using Business Artifacts , 2007, CAiSE.