论文信息 - Extracting optimal association rules over numeric attributes

Extracting optimal association rules over numeric attributes

Data Mining is the process of finding novel, useful and understandable patterns in massive data. Association rules are a commonly used method to discover these patterns. Until recently, the known techniques for generating association rules required that the data fields (called attributes) be binary. In a series of papers, F’ukuda, Yoda, and collaborators have developed an approach to obtain association rules on numeric attributes. They have found optimal association rules for 2-dimensional numeric antecedents, where the shape of the region obtained is a convex region; their algorithm has time complexity O(n’.‘) on a grid of n pixels. We obtain efficient algorithms to find optimal association rules for 2-dimensional numeric antecedents, where the shape of the region obtained is what we call an anchored convex region or an anchored triagular region. These algorithms have time complexity O(n). However, unless P = NP, no polynomial time algorithm finds. an anchored convex region or anchored triangular region which is optimal with regard to support or confidence. These two classes of region can find application in situations where it is known from the outset that the data of greatest interest lies close to one edge of the grid, and where the improvement in execution speed from O(n’.‘) to O(n) is critical.

Alan P. Sprague | A. Sprague

[1] Yasuhiko Morimoto,et al. Mining Optimized Association Rules for Numeric Attributes , 1999, J. Comput. Syst. Sci..

[2] Padhraic Smyth,et al. From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[3] David S. Johnson,et al. Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[4] Yasuhiko Morimoto,et al. Computing Optimized Rectilinear Regions for Association Rules , 1997, KDD.

[5] Yasuhiko Morimoto,et al. Mining optimized association rules for numeric attributes , 1996, J. Comput. Syst. Sci..

[6] Ramakrishnan Srikant,et al. Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[7] Renée J. Miller,et al. Association rules over interval data , 1997, SIGMOD '97.

[8] Yasuhiko Morimoto,et al. Data mining using two-dimensional optimized association rules: scheme, algorithms, and visualization , 1996, SIGMOD '96.