论文信息 - Pattern-based clustering for database attribute values

Pattern-based clustering for database attribute values

We present a method for automatically clustering similar attribute values in a database system spanning mulitple domains. The method constructs an attribute abstraction hierarchy for each attribute using rules that are derived from the database instance. The rules have a confidence and popularity that combine to express the "usefullness" of the rule. Attribute values are clustered if they are used as the premise for rules with the same consequence. By iteratively applying the algorithm, a hierarchy of clusters can be found. The algorithm can be improved by allowing domain expert supervision during the clustering process. An example as well as experimental results from a large transportation database are included.

Wesley W. Chu | Matthew Merzbacher

[1] Qiming Chen,et al. Cooperative Query Answering via Type Abstraction Hierarchy , 1991 .

[2] Frédéric Cuppens,et al. Cooperative Answering: A Methodology to Provide Intelligent Access to databases , 1988, Expert Database Conf..

[3] Gerard Salton,et al. Comment on "an evaluation of query expansion by the addition of clustered terms for a document retrieval system" , 1972, Inf. Storage Retr..

[4] Ryszard S. Michalski,et al. Conceptual Clustering: Inventing Goal-Oriented Classifications of Structured Objects , 1986 .

[5] Jiawei Han,et al. Attribute-Oriented Induction in Relational Databases , 1991, Knowledge Discovery in Databases.

[6] Qiming Chen,et al. PATTERN-BASED KNOWLEDGE INDUCTION FROM DATABASES , 1993 .

[7] Bruce G. Buchanan,et al. Cooperating knowledge-based systems , 1988 .