Attribute-Oriented Induction in Relational Databases

It is beneficial as well as challenging to learn knowledge rules from relational databases because of the vast amount of knowledge implied in databases and the large amount of data stored in databases. In this thesis, we develop an attributeoriented induction method to extract characteristic rules and classification rules from relational databases. The method adopts the artificial intelligence "learning from examples" paradigm and applies an attribute-oriented concept tree ascending technique in the learning process which integrates database operations with the learning process and provides a simple, efficient way of learning from large databases. Conjunctive rules as well as restricted forms of disjunctive rules are learned using this method. Moreover, by incorporating statistical techniques, qualitative rules with quantitative information can be learned and noisy data and exceptional cases are elegantly handled. Our analysis of the algorithms indicates that attribute-oriented induction substantially reduces the complexity of database learning processes. A prototype database learning system, DBLEARN, has been designed and implemented; early experiments with the prototype system illustrate the promise of attribute-oriented learning in relational databases.

[1]  Nada Lavrac,et al.  The Multi-Purpose Incremental Learning System AQ15 and Its Testing Application to Three Medical Domains , 1986, AAAI.

[2]  Joan Feigenbaum,et al.  Factorization in Experiment Generation , 1986, AAAI.

[3]  Jan M. Zytkow,et al.  Combining many searches in the FAHRENHEIT discovery system , 1987 .

[4]  Abraham Silberschatz,et al.  Database System Concepts , 1980 .

[5]  H. Simon,et al.  Rediscovering Chemistry with the Bacon System , 1983 .

[6]  Ronald L. Iman,et al.  A Modern Approach To Statistics , 1983 .

[7]  Ryszard S. Michalski,et al.  Automated Construction of Classifications: Conceptual Clustering Versus Numerical Taxonomy , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Jack Minker,et al.  Logic and Databases: A Deductive Approach , 1984, CSUR.

[9]  Michael R. Genesereth,et al.  Logical foundations of artificial intelligence , 1987 .

[10]  Douglas H. Fisher,et al.  A Computational Account of Basic Level and Typicality Effects , 1988, AAAI.

[11]  Ryszard S. Michalski,et al.  A theory and methodology of inductive learning , 1993 .

[12]  Frederick Hayes-Roth,et al.  An interference matching technique for inducing abstractions , 1978, CACM.

[13]  David Haussler Learning Conjunctive Concepts in Structural Domains , 1989, Mach. Learn..

[14]  David Haussler Bias, Version Spaces and Valiant's Learning Framework , 1987 .

[15]  Tom M. Mitchell,et al.  An Analysis of Generalization as a Search Problem , 1979, IJCAI.

[16]  Frederick Hayes-Roth,et al.  Knowledge acquisition from structural descriptions , 1977, IJCAI 1977.

[17]  Avron Barr,et al.  The Handbook of Artificial Intelligence , 1982 .

[18]  R. Michalski How to Learn Imprecise Concepts: A Method for Employing a Two-Tiered Knowledge Representation in Learning , 1987 .

[19]  D. Kleinbaum,et al.  Applied Regression Analysis and Other Multivariate Methods , 1978 .

[20]  David Haussler Quantifying the Inductive Bias in Concept Learning (Extended Abstract) , 1986, AAAI.

[21]  John Grant,et al.  Foundations of Semantic Query Optimization for Deductive Databases , 1988, Foundations of Deductive Databases and Logic Programming..

[22]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[23]  Z. Pawlak Learning from examples , 1986 .

[24]  Douglas H. Fisher,et al.  Improving Inference through Conceptual Clustering , 1987, AAAI.

[25]  Barr and Feigenbaum Edward A. Avron,et al.  The Handbook of Artificial Intelligence , 1981 .

[26]  李幼升,et al.  Ph , 1989 .

[27]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[28]  Herbert A. Simon,et al.  The Processes of Scientific Discovery: The Strategy of Experimentation , 1988, Cogn. Sci..

[29]  Robert E. Stepp,et al.  Concepts in Conceptual Clustering , 1987, IJCAI.

[30]  Tom M. Mitchell,et al.  Version Spaces: A Candidate Elimination Approach to Rule Learning , 1977, IJCAI.

[31]  C. Roads,et al.  The Handbook of Artificial Intelligence, Volume 1 , 1982 .

[32]  Larry Kerschberg,et al.  Mining for Knowledge in Databases: Goals and General Description of the INLEN System , 1989, Knowledge Discovery in Databases.

[33]  Sholom M. Weiss,et al.  Optimizing the Predictive Value of Diagnostic Decision Rules , 1987, AAAI.

[34]  Mieczyslaw M. Kokar COPER: a methodology for learning invariant functional descriptions , 1986 .

[35]  Thomas G. Dietterich,et al.  A Comparative Review of Selected Methods for Learning from Examples , 1983 .

[36]  Stuart J. Russell Tree-Structured Bias , 1988, AAAI.

[37]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[38]  Patrick Henry Winston,et al.  Learning structural descriptions from examples , 1970 .

[39]  S. Vera,et al.  Induction of Concepts in the Predicate Calculus , 1975, IJCAI.

[40]  King-Sun Fu,et al.  Conceptual Clustering in Knowledge Organization , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Thomas G. Dietterich,et al.  Inductive Learning of Structural Descriptions: Evaluation Criteria and Comparative Review of Selected Methods , 1981, Artif. Intell..

[42]  Tom M. Mitchell,et al.  MODEL-DIRECTED LEARNING OF PRODUCTION RULES1 , 1978 .

[43]  Raymond Reiter,et al.  Towards a Logical Reconstruction of Relational Database Theory , 1982, On Conceptual Modelling.

[44]  Michel Manago,et al.  Noise and Knowledge Acquisition , 1987, IJCAI.

[45]  Jiawei Han,et al.  An attribute-oriented approach for learning classification rules from relational databases , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[46]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .