Statistical Sequence Analysis For Business Process Mining And Organizational Routines

Analyzing discrete event sequences has become a popular field in recent years. In the area of business process mining, numerous techniques have been developed to discover the structure of business processes by means of traces they leave behind in information systems. In organizational routines literature, these traces have been identified as a valuable source of information to investigate the dynamics of routines and how they evolve over time. However, both areas have been discussed in separation only. But in both areas alike the fundamental problem is to acquire knowledge about regularities in sequences of events based on observations thereof, and thus, we argue that process mining has the potential to advance research on organizational routines. As with any data analysis problem, one has to deal with problems due to noisy data and small samples. Thus, we show in this paper how to apply simple statistical tools to pattern detection in sequences. Subsequently, we integrate this into the popular algorithm. This paves the way for statistically controlling the risk of falling for erroneous results. To the best of our knowledge, no process mining algorithm is capable of doing this. We are convinced that this will facilitate applicability in organizational routines studies.

[1]  Brian T. Pentland,et al.  Comparing Organizational Routines as Recurrent Patterns of Action , 2010 .

[2]  G. G. Meyer,et al.  Lecture notes in business information processing , 2009 .

[3]  Wil M.P. van der Aalst,et al.  Process mining with the HeuristicsMiner algorithm , 2006 .

[4]  Boudewijn F. van Dongen,et al.  Business process mining: An industrial application , 2007, Inf. Syst..

[5]  Brian T. Pentland,et al.  Sequential Variety in Work Processes , 2003, Organ. Sci..

[6]  C. Blyth,et al.  Binomial Confidence Intervals , 1983 .

[7]  Markus C. Becker Organizational routines: a review of the literature , 2004 .

[8]  Boudewijn F. van Dongen,et al.  Process mining: a two-step approach to balance between underfitting and overfitting , 2008, Software & Systems Modeling.

[9]  Martha S. Feldman,et al.  Organizational Routines as a Unit of Analysis , 2005 .

[10]  Albert Boonstra,et al.  Proceedings of the 21st European Conference on Information Systems , 2013 .

[11]  Brian T. Pentland,et al.  Using Workflow Data to Explore the Structure of an Organizational Routine , 2009 .

[12]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[13]  P. Laplace Théorie analytique des probabilités , 1995 .

[14]  M. Feldman,et al.  Reconceptualizing Organizational Routines as a Source of Flexibility and Change , 2003 .

[15]  S. Winter,et al.  An evolutionary theory of economic change , 1983 .

[16]  B. K. Ghosh,et al.  A Comparison of Some Approximate Confidence Intervals for the Binomial Parameter , 1979 .

[17]  Helly Grundbegriffe der Wahrscheinlichkeitsrechnung , 1936 .

[18]  Feller William,et al.  An Introduction To Probability Theory And Its Applications , 1950 .

[19]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[20]  Markus C. Becker,et al.  Applying Organizational Routines in Analyzing the Behavior of Organizations , 2008 .

[21]  Y. Hochberg A sharper Bonferroni procedure for multiple tests of significance , 1988 .

[22]  Carlo Salvato,et al.  Capabilities Unveiled: The Role of Ordinary Activities in the Evolution of Product Development Processes , 2009, Organ. Sci..

[23]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[24]  B. Pentland,et al.  Organizational Routines as Grammars of Action , 1994 .

[25]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[26]  Martha S. Feldman,et al.  Dynamics of Organizational Routines: A Generative Model , 2012 .

[27]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[28]  E. S. Pearson,et al.  THE USE OF CONFIDENCE OR FIDUCIAL LIMITS ILLUSTRATED IN THE CASE OF THE BINOMIAL , 1934 .

[29]  Claus Rerup,et al.  Beyond Collective Entities: Multilevel Research on Organizational Routines and Capabilities , 2011 .

[30]  Larry Wasserman,et al.  All of Statistics: A Concise Course in Statistical Inference , 2004 .

[31]  C. Salvato The Contribution of Event-sequence Analysis to the Study of Organizational Routines , 2009 .

[32]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[33]  H. Bateman Book Review: Ergebnisse der Mathematik und ihrer Grenzgebiete , 1933 .

[34]  Nicholas Walliman,et al.  Social research methods , 2006 .

[35]  S. Winter Economic "Natural Selection" and the Theory of the Firm , 1964 .

[36]  J. Shaffer Multiple Hypothesis Testing , 1995 .

[37]  S. Winter,et al.  An Evolutionary Theory of Economic Change.by Richard R. Nelson; Sidney G. Winter , 1987 .

[38]  Silke Aisenbrey,et al.  New Life for Old Ideas: The "Second Wave" of Sequence Analysis Bringing the "Course" Back Into the Life Course , 2010 .