Specifying Mining Algorithms with Iterative User-Defined Aggregates: A Case Study

We present a way of exploiting domain knowledge in the design and implementation of data mining algorithms, with special attention to frequent patterns discovery, within a deductive framework. In our framework domain knowledge is represented by deductive rules, and data mining algorithms are constructed by means of iterative user-defined aggregates. Iterative user-defined aggregates have a fixed scheme that allows the modularization of data mining algorithms, thus providing a way to exploit domain knowledge in the right point. As a case study, the paper presents user-defined aggregates for specifying a version of the apriori algorithm. Some performance analyses and comparisons are discussed in order to show the effectiveness of the approach.

[1]  Fosca Giannotti,et al.  Making Knowledge Extraction and Reasoning Closer , 2000, PAKDD.

[2]  Fosca Giannotti,et al.  Declarative Knowledge Extraction with Interactive User-Defined Aggregates , 2000, FQAS.

[3]  Tomasz Imielinski,et al.  MSQL: A Query Language for Database Mining , 1999, Data Mining and Knowledge Discovery.

[4]  Giuseppe Psaila,et al.  A New SQL-like Operator for Mining Association Rules , 1996, VLDB.

[5]  Fosca Giannotti,et al.  Querying Inductive Databases via Logic-Based User-Defined Aggregates , 1999, PKDD.

[6]  Surajit Chaudhuri,et al.  Optimization of queries with user-defined predicates , 1996, TODS.

[7]  Dino Pedreschi,et al.  Nondeterministic, Nonmonotonic Logic Databases , 2001, IEEE Trans. Knowl. Data Eng..

[8]  Heikki Mannila,et al.  Inductive Databases and Condensed Representations for Data Mining , 1997, ILPS.

[9]  Laks V. S. Lakshmanan,et al.  Exploratory mining and pruning optimizations of constrained associations rules , 1998, SIGMOD '98.

[10]  Ravi Krishnamurthy,et al.  Towards on Open Architecture for LDL , 1989, VLDB.

[11]  Carlo Zaniolo,et al.  Logic-Based User-Defined Aggregates for the Next Generation of Database Systems , 1999, The Logic Programming Paradigm.

[12]  Sunita Sarawagi,et al.  Integrating association rule mining with relational database systems: alternatives and implications , 1998, SIGMOD '98.

[13]  Carlo Zaniolo,et al.  Negation and Aggregates in Recursive Rules: the LDL++ Approach , 1993, DOOD.

[14]  Franco Turini,et al.  An XML Based Environment in Support of the Overall KDD Process , 2000, FQAS.

[15]  C. V. Ramamoorthy,et al.  Knowledge and Data Engineering , 1989, IEEE Trans. Knowl. Data Eng..

[16]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules and sequential patterns , 1996 .