论文信息 - A proposal for a model for dealing with value-based data dependencies to improve the rule discovery process

A proposal for a model for dealing with value-based data dependencies to improve the rule discovery process

The discovery of conjunctive "if-then" classification rules may be intractable when enumerating all possible conjunctions of terms. Various algorithms, notably C4.5 and CART, adopt a univariate strategy which reduces the process to a one-at-a-time best variable type of approach. While computationally feasible, such an approach may lead to unexplored portions of the database which may contain valuable nuggets. On the other hand, an exhaustive evaluation of all possible conjunctions may be intractable even for relatively small datasets. We propose a general approach to reduce the size of the search space of conjunctive "if-then" rule discovery algorithms by exploiting value-based data dependencies existing among the independent variables.

Vincenzo Cutello | G. Giuffrida | V. Cutello | G. Giuffrida

[1] Heikki Mannila,et al. Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[2] Jeffrey D. Ullman,et al. Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[3] Patrick Bosc,et al. Functional dependencies and the design of relational databases extended to imprecise data , 1998 .

[4] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[5] Arun K. Majumdar,et al. Fuzzy Functional Dependencies and Lossless Join Decomposition of Fuzzy Relational Database Systems , 1988, ACM Trans. Database Syst..

[6] Vincenzo Cutello,et al. Recursive connective rules , 1999, Int. J. Intell. Syst..

[7] Juan C. Cubero,et al. A new definition of fuzzy functional dependency in fuzzy relational databases , 1994, Int. J. Intell. Syst..

[8] Guoqing Chen,et al. Fuzzy Functional Dependency and a Series of Design Issues of Fuzzy Relational Databases , 1995 .

[9] Jeffrey D. Uuman. Principles of database and knowledge- base systems , 1989 .

[10] Vincenzo Cutello,et al. Hierarchies of aggregation operators , 1994, Int. J. Intell. Syst..

[11] Roberto J. Bayardo. Brute-Force Mining of High-Confidence Classification Rules , 1997, KDD.