The discovery of frequent patterns with logic and constraint programming

The basic goal of data mining is to discover patterns occurring in the databases, such as associations, classification models, sequential patterns, and so on. In this paper we focus on the problem of frequent pattern discovery, which is the process of searching for patterns such as sets of features or items that appear in data frequently. Such frequent patterns can reveal associations, correlations, and many other interesting relationships hidden in a database. Most of frequent pattern mining systems in the market are too generic and become inefficient when set of patterns is large and the frequent patterns are very long. A new trend in data mining is a scalable method that uses constraints to guide the system in its search for interesting patterns. Our main research objective is the development of constraint-based mining methodology and this paper presents the preliminary results of our study and prototype development. We present the implementation of frequent pattern mining system based on declarative programming paradigm using logic programming and constraint logic programming. The comparative performance studies on speed and memory usage of logic versus constraint programming are also reported in the paper.

[1]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[2]  Stefano Bistarelli,et al.  Soft constraint based pattern mining , 2007, Data Knowl. Eng..

[3]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[4]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[5]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Databases: An Overview , 1992, AI Mag..

[6]  Ramakrishnan Srikant,et al.  Mining Association Rules with Item Constraints , 1997, KDD.

[7]  Jian Pei,et al.  Can we push more constraints into frequent pattern mining? , 2000, KDD '00.

[8]  Ivan Bratko,et al.  Prolog Programming for Artificial Intelligence , 1986 .

[9]  Laks V. S. Lakshmanan,et al.  Pushing Convertible Constraints in Frequent Itemset Mining , 2004, Data Mining and Knowledge Discovery.

[10]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[11]  Hannu Toivonen,et al.  Sampling Large Databases for Association Rules , 1996, VLDB.

[12]  Luc De Raedt,et al.  Constraint programming for itemset mining , 2008, KDD.

[13]  Ivan Bratko,et al.  Prolog (3rd ed.): programming for artificial intelligence , 2000 .

[14]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[15]  Wei Li,et al.  New parallel algorithms for fast discovery of associ-ation rules , 1997 .

[16]  Krzysztof R. Apt,et al.  Constraint logic programming using Eclipse , 2007 .