Knowledge Discovery in Databases: An Attribute-Oriented Approach

Knowledge discovery in databases, or data mining, is an important issue in the development of data- and knowledge-base systems. An attribute-oriented induction method has been developed for knowledge discovery in databases. The method integrates a machine learning paradigm, especially learning-from-examples techniques, with set-oriented database operations and extracts generalized data from actual data in databases. An attribute-oriented concept tree ascension technique is applied in generalization, which substantially reduces the computational complex@ of database learning processes. Different kinas of knowledge rules, including characteristic rules, discrimination rules, quantitative rules, and data evolution regularities can be discovered efficiently using the attribute-oriented approach. In addition to learning in relational databases, the approach can be applied to knowledge discovery in nested relational and deductive databases. Learning can also be performed with databases containing noisy data and exceptional cases using database statistics. Furthermore, the rules discovered can be used to query database knowledge, answer cooperative queries and facilitate semantic query optimization. Based upon these principles, a prototyped database learning system, DBLEARN, has been constructed for experimentation.

[1]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[2]  Thomas G. Dietterich,et al.  A Comparative Review of Selected Methods for Learning from Examples , 1983 .

[3]  Ryszard S. Michalski,et al.  A Theory and Methodology of Inductive Learning , 1983, Artificial Intelligence.

[4]  Jack Minker,et al.  Logic and Databases: A Deductive Approach , 1984, CSUR.

[5]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[6]  Joan Feigenbaum,et al.  Factorization in Experiment Generation , 1986, AAAI.

[7]  Michael R. Genesereth,et al.  Logical foundations of artificial intelligence , 1987 .

[8]  David Haussler Bias, Version Spaces and Valiant's Learning Framework , 1987 .

[9]  Douglas H. Fisher,et al.  Improving Inference through Conceptual Clustering , 1987, AAAI.

[10]  Michel Manago,et al.  Noise and Knowledge Acquisition , 1987, IJCAI.

[11]  Richard R. Muntz,et al.  Implicit Representation for Extensional Answers , 1988, Expert Database Conf..

[12]  Abraham Silberschatz,et al.  Extended algebra and calculus for nested relational databases , 1988, TODS.

[13]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[14]  Frédéric Cuppens,et al.  Cooperative Answering: A Methodology to Provide Intelligent Access to databases , 1988, Expert Database Conf..

[15]  Larry Kerschberg,et al.  Mining for Knowledge in Databases: Goals and General Description of the INLEN System , 1989, Knowledge Discovery in Databases.

[16]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[17]  John Grant,et al.  Logic-based approach to semantic query optimization , 1990, TODS.

[18]  Amihai Motro,et al.  Querying database knowledge , 1990, SIGMOD '90.

[19]  Ryszard S. Michalski,et al.  Machine learning: an artificial intelligence approach volume III , 1990 .

[20]  Jiawei Han,et al.  Learning in relational databases: an attribute‐oriented approach , 1991, Comput. Intell..

[21]  Andrew K. C. Wong,et al.  Statistical Technique for Extracting Classificatory Knowledge from Databases , 1991, Knowledge Discovery in Databases.

[22]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[23]  Michael Stonebraker,et al.  Database systems: achievements and opportunities , 1991, CACM.

[24]  Jiawei Han,et al.  Attribute-Oriented Induction in Relational Databases , 1991, Knowledge Discovery in Databases.

[25]  Jan M. Zytkow,et al.  Interactive Mining of Regularities in Databases , 1991, Knowledge Discovery in Databases.

[26]  Jiawei Han,et al.  Data-Driven Discovery of Quantitative Rules in Relational Databases , 1993, IEEE Trans. Knowl. Data Eng..