Constraint-Based Sequence Mining Using Constraint Programming

The goal of constraint-based sequence mining is to find sequences of symbols that are included in a large number of input sequences and that satisfy some constraints specified by the user. Many constraints have been proposed in the literature, but a general framework is still missing. We investigate the use of constraint programming as general framework for this task.

[1]  Patrice Boizumault,et al.  Mining Relevant Sequence Patterns with CP-Based Framework , 2014, 2014 IEEE 26th International Conference on Tools with Artificial Intelligence.

[2]  Lennart Martens,et al.  Predicting tryptic cleavage from proteomics data using decision tree ensembles. , 2013, Journal of proteome research.

[3]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[4]  Anton Dries,et al.  Dominance Programming for Itemset Mining , 2013, 2013 IEEE 13th International Conference on Data Mining.

[5]  Jean-Philippe Métivier,et al.  A Constraint Programming Approach for Mining Sequential Patterns in a Sequence Database , 2013, ArXiv.

[6]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[7]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[8]  Mohammed J. Zaki Sequence mining in categorical domains: incorporating constraints , 2000, CIKM '00.

[9]  Jilles Vreeken,et al.  The long and the short of it: summarising event sequences with serial episodes , 2012, KDD.

[10]  Emmanuel Coquery,et al.  A SAT-Based Approach for Discovering Frequent, Closed and Maximal Patterns in a Sequence , 2012, ECAI.

[11]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[12]  Lakhdar Sais,et al.  Boolean satisfiability for sequence mining , 2013, CIKM.

[13]  Luc De Raedt,et al.  Correlated itemset mining in ROC space: a constraint programming approach , 2009, KDD.

[14]  Kai Ye,et al.  An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences , 2007, Bioinform..

[15]  Hiroki Arimura,et al.  Efficient serial episode mining with minimal occurrences , 2009, ICUIMC '09.

[16]  Patrice Boizumault,et al.  Mining (Soft-) Skypatterns Using Dynamic CSP , 2014, CPAIOR.

[17]  Luc De Raedt,et al.  Itemset mining: A constraint programming perspective , 2011, Artif. Intell..

[18]  Jiawei Han,et al.  BIDE: efficient mining of frequent closed sequences , 2004, Proceedings. 20th International Conference on Data Engineering.

[19]  A. Akhmetova Discovery of Frequent Episodes in Event Sequences , 2006 .

[20]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.