Using domain knowledge in knowledge discovery

With the explosive growth of the size of databases, many knowledge discovery applications deal with large quantities of data. There is an urgent need to develop methodologies which will allow the applications to focus search to a potentially interesting and relevant portion of the data, which can reduce the computational complexity of the knowledge discovery process and improve the interestingness of discovered knowledge. Previous work on semantic query optimization, which is an approach to take advantage of domain knowledge for query optimization, has demonstrated that significant cost reduction can be achieved by reformulating a query into a less expensive yet equivalent query which produces the same answer as the original one. In this paper, we introduce a method to utilize three types of domain knowledge in reducing the cost of finding a potentially interesting and relevant portion of the data while improving the quality of discovered knowledge. In addition, we propose a method to select relevant domain knowledge without an exhaustive search of all domain knowledge. The contribution of this paper is that we lay out a general framework for using domain knowledge in the knowledge discovery process effectively by providing guidelines.

[1]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[2]  Michael Stonebraker,et al.  Database systems: achievements and opportunities , 1991, CACM.

[3]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[4]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[5]  John Grant,et al.  Logic-based approach to semantic query optimization , 1990, TODS.

[6]  Won Kim,et al.  Introduction to Object-Oriented Databases , 1991, Computer systems.

[7]  E. K. Park,et al.  Intelligent query answering in deductive and object-oriented databases , 1994, CIKM '94.

[8]  Vasant Dhar,et al.  Abstract-Driven Pattern Discovery in Databases , 1992, IEEE Trans. Knowl. Data Eng..

[9]  John Wylie Lloyd,et al.  Foundations of Logic Programming , 1987, Symbolic Computation.

[10]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[11]  Larry Kerschberg,et al.  Mining for Knowledge in Databases: Goals and General Description of the INLEN System , 1989, Knowledge Discovery in Databases.

[12]  Jack Minker,et al.  Semantic Query Optimization in Expert Systems and Database Systems , 1984, Expert Database Workshop.

[13]  Philip S. Yu,et al.  An effective hash-based algorithm for mining association rules , 1995, SIGMOD '95.

[14]  François Bancilhon,et al.  Building an Object-Oriented Database System, The Story of O2 , 1992 .

[15]  Jack Minker,et al.  Logic and Databases: A Deductive Approach , 1984, CSUR.

[16]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[17]  Elke A. Rundensteiner A classification algorithm for supporting object-oriented views , 1994, CIKM '94.

[18]  Jiawei Han,et al.  Data-Driven Discovery of Quantitative Rules in Relational Databases , 1993, IEEE Trans. Knowl. Data Eng..

[19]  Michael Siegel,et al.  Automatic Rule Derivation For Semantic Query Optimization , 1989, Expert Database Conf..

[20]  Carlo Zaniolo,et al.  Metaqueries for Data Mining , 1996, Advances in Knowledge Discovery and Data Mining.

[21]  E. K. Park,et al.  Semantic query processing in object-oriented databases using deductive approach , 1995, CIKM '95.

[22]  Suk-Chung Yoon,et al.  Semantic Query Processing in Deductive Object-Oriented Databases , 1995, SEKE.

[23]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[24]  Jörg Rech,et al.  Knowledge Discovery in Databases , 2001, Künstliche Intell..

[25]  John Lloyd,et al.  Foundation of logic programming , 1983 .

[26]  David Maier,et al.  Readings in Object-Oriented Database Systems , 1989 .

[27]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[28]  Heikki Mannila,et al.  Finding interesting rules from large sets of discovered association rules , 1994, CIKM '94.

[29]  Usama M. Fayyad,et al.  Knowledge Discovery in Databases: An Overview , 1997, ILP.

[30]  Gottfried Vossen,et al.  Query Processing for Advanced Database Systems , 1993 .