Sequential Patterns Postprocessing for Structural Relation Patterns Mining

Sequential patterns mining is an important data-mining technique used to identify frequently observed sequential occurrence of items across ordered transactions over time. It has been extensively studied in the literature, and there exists a diversity of algorithms. However, more complex structural patterns are often hidden behind sequences. This article begins with the introduction of a model for the representation of sequential patterns—Sequential Patterns Graph—which motivates the search for new structural relation patterns. An integrative framework for the discovery of these patterns–Postsequential Patterns Mining–is then described which underpins the postprocessing of sequential patterns. A corresponding data-mining method based on sequential patterns postprocessing is proposed and shown to be effective in the search for concurrent patterns. From experiments conducted on three component algorithms, it is demonstrated that sequential patterns-based concurrent patterns mining provides an efficient method for structural knowledge discovery.

[1]  Lawrence B. Holder,et al.  Graph-Based Data Mining , 2000, IEEE Intell. Syst..

[2]  Steen Brahe Enterprise Specific BPM Languages and Tools , 2011 .

[3]  Peter Rittgen,et al.  Handbook of Ontologies for Business Interaction , 2007 .

[4]  M. Gordon Hunter,et al.  Information Systems and Small Business , 2005, Encyclopedia of Information Science and Technology.

[5]  Mikhail J. Atallah,et al.  Detection of significant sets of episodes in event sequences , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[6]  Gemma C. Garriga,et al.  Summarizing Sequential Data with Closed Partial Orders , 2005, SDM.

[7]  Gemma C. Garriga,et al.  Coproduct Transformations on Lattices of Closed Partial Orders , 2004, ICGT.

[8]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[9]  Jing Lu,et al.  Sequential patterns graph and its construction algorithm , 2004 .

[10]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[11]  Suh-Yin Lee,et al.  Fast Discovery of Sequential Patterns through Memory Indexing and Database Partitioning , 2005, J. Inf. Sci. Eng..

[12]  Marianne Afifi Process Mapping for Electronic Resources: A Lesson from Business Models , 2008 .

[13]  Dimitris Askounis,et al.  Ontology-Based Registries: An E-Business Transactions’ Registry , 2011 .

[14]  Nils Urbach,et al.  Measuring Organizational Information Systems Success: New Technologies and Practices , 2012 .

[15]  Heikki Mannila,et al.  Global partial orders from sequential data , 2000, KDD '00.

[16]  Anna Marie Balling Høstgaard End-user participation in Health IT development: the EUPHIT method , 2012 .

[17]  Jian Pei,et al.  CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[18]  Mohammed J. Zaki Efficiently mining frequent trees in a forest , 2002, KDD.

[19]  Jérôme Gensel,et al.  Spatial OLAP and Map Generalization: Model and Algebra , 2012, Int. J. Data Warehous. Min..

[20]  Eshaa M. Alkhalifa E-Strategies for Resource Management Systems: Planning and Implementation , 2010 .

[21]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[22]  Jian Pei,et al.  From sequential pattern mining to structured pattern mining: A pattern-growth approach , 2004, Journal of Computer Science and Technology.

[23]  Renata Iváncsy,et al.  A Survey of Discovering Frequent Patterns in Graph Data , 2005, Databases and Applications.

[24]  Marisa Analía Sánchez,et al.  Mining Tuberculosis Data , 2009 .

[25]  Luigi Pontieri,et al.  An Information-Theoretic Framework for Process Structure and Data Mining , 2006, Int. J. Data Warehous. Min..

[26]  Jing Lu From sequential patterns to concurrent branch patterns : a new post sequential patterns mining approach , 2006 .

[27]  Patricia Cerrito Text Mining Techniques for Healthcare Provider Quality Determination: Methods for Rank Comparisons , 2009 .

[28]  Nikos Pelekis,et al.  Visual Mobility Analysis using T-Warehouse , 2011, Int. J. Data Warehous. Min..

[30]  Yasin Ozcelik,et al.  IT-Enabled Reengineering: Productivity Impacts , 2010 .

[31]  Jan Rauch,et al.  Data Mining and Medical Knowledge Management: Cases and Applications , 2009 .

[32]  Jiong Yang,et al.  SPIN: mining maximal frequent subgraphs from graph databases , 2004, KDD.

[33]  R. Gershon Intelligent Networking and Business Process Innovation: A Case Study Analysis of Home Box Office and Dell Computers , 2009 .

[34]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[35]  Laura Giurca Vasilescu,et al.  Data Mining Used for Analyzing the Bankruptcy Risk of the Romanian SMEs , 2011 .