Towards Efficient Induction Mechanisms in Database Systems

Abstract With the wide availability of huge amounts of data in database systems, the extraction of knowledge in databases by efficient and powerful induction or knowledge discovery mechanisms has become an important issue in the construction of new generation database and knowledge-base systems. In this article, an attribute-oriented induction method for knowledge discovery in databases is investigated, which provides an efficient, set-oriented induction mechanism for extraction of different kinds of knowledge rules, such as characteristic rules, discriminant rules, data evolution regularities and high level dependency rules in large relational databases. Our study shows that the method is robust in the existence of noise and database updates, is extensible to knowledge discovery in advanced and/or special purpose databases, such as object-oriented databases, active databases, spatial databases, etc., and has wide applications.

[1]  Ryszard S. Michalski,et al.  A theory and methodology of inductive learning , 1993 .

[2]  Michael Stonebraker,et al.  Database systems: achievements and opportunities , 1991, CACM.

[3]  Won Kim,et al.  Introduction to Object-Oriented Databases , 1991, Computer systems.

[4]  Wojciech Ziarko,et al.  The Discovery, Analysis, and Representation of Data Dependencies in Databases , 1991, Knowledge Discovery in Databases.

[5]  J. D. Uiiman,et al.  Principles of Database Systems , 2004, PODS 2004.

[6]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Databases: An Overview , 1992, AI Mag..

[7]  Jiawei Han,et al.  Attribute-Oriented Induction in Relational Databases , 1991, Knowledge Discovery in Databases.

[8]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[9]  Jiawei Han,et al.  Data-Driven Discovery of Quantitative Rules in Relational Databases , 1993, IEEE Trans. Knowl. Data Eng..

[10]  Hamid Pirahesh,et al.  Extensions to Starburst: objects, types, functions, and rules , 1991, CACM.

[11]  Umeshwar Dayal,et al.  A uniform model for temporal object-oriented databases , 1992, [1992] Eighth International Conference on Data Engineering.

[12]  Michael R. Genesereth,et al.  Logical foundations of artificial intelligence , 1987 .

[13]  Thomas G. Dietterich,et al.  A Comparative Review of Selected Methods for Learning from Examples , 1983 .

[14]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[15]  Jiawei Han,et al.  Knowledge Discovery in Databases: An Attribute-Oriented Approach , 1992, VLDB.

[16]  Douglas H. Fisher,et al.  Improving Inference through Conceptual Clustering , 1987, AAAI.

[17]  Earl B. Hunt,et al.  Machine learning: An artificial intelligence approach (vol. 2): R. S. Michalski, J. G. Carbonell, and T. M. Mitchell (Eds.). Los Alton, CA: Morgan Kaufmann, 1986. Pp. x + 738. $39.95 , 1987 .

[18]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[19]  Jan M. Zytkow,et al.  Interactive Mining of Regularities in Databases , 1991, Knowledge Discovery in Databases.

[20]  Ryszard S. Michalski,et al.  Machine learning: an artificial intelligence approach volume III , 1990 .

[21]  Joan Feigenbaum,et al.  Factorization in Experiment Generation , 1986, AAAI.

[22]  Jiawei Han,et al.  Efficient Deduction and Induction: Key to the Success of Data-Intensive Knowledge-Base Systems , 1992, Formal Methods in Databases and Software Engineering.

[23]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[24]  Umeshwar Dayal,et al.  The architecture of an active database management system , 1989, SIGMOD '89.

[25]  Raymond Reiter,et al.  Towards a Logical Reconstruction of Relational Database Theory , 1982, On Conceptual Modelling.

[26]  Catriel Beeri,et al.  A Model for Active Object Oriented Databases , 1991, VLDB.

[27]  Jiawei Han Knowledge Discovery in Object-Oriented and Active Databases , 1993 .

[28]  François Bancilhon,et al.  Building an Object-Oriented Database System, The Story of O2 , 1992 .