Systems for Knowledge Discovery in Databases

Knowledge-discovery systems face challenging problems from real-world databases, which tend to be dynamic, incomplete, redundant, noisy, sparse, and very large. These problems are addressed and some techniques for handling them are described. A model of an idealized knowledge-discovery system is presented as a reference for studying and designing new systems. This model is used in the comparison of three systems: CoverStory, EXPLORA, and the Knowledge Discovery Workbench. The deficiencies of existing systems relative to the model reveal several open problems for future research. >

[1]  Richard Scheines,et al.  Finding latent variable models in large databases , 1992, Int. J. Intell. Syst..

[2]  John K. Ousterhout,et al.  Tcl: An Embeddable Command Language , 1989, USENIX Winter.

[3]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[4]  Shashi Shekhar,et al.  Learning Transformation Rules for Semantic Query Optimization: A Data-Driven Approach , 1993, IEEE Trans. Knowl. Data Eng..

[5]  Lawrence B. Holder,et al.  Discovery of Inexact Concepts from Structural Data , 1993, IEEE Trans. Knowl. Data Eng..

[6]  M. Pazzani,et al.  Concept formation knowledge and experience in unsupervised learning , 1991 .

[7]  Edward Rolf Tufte,et al.  The visual display of quantitative information , 1985 .

[8]  Willi Klösgen,et al.  A Support System for Interpreting Statistical Data , 1991, Knowledge Discovery in Databases.

[9]  Edward R. Tufte,et al.  The Visual Display of Quantitative Information , 1986 .

[10]  Larry Kerschberg,et al.  Mining for Knowledge in Databases: Goals and General Description of the INLEN System , 1989, Knowledge Discovery in Databases.

[11]  Jan M. Zytkow,et al.  Interactive Mining of Regularities in Databases , 1991, Knowledge Discovery in Databases.

[12]  G. Dunn,et al.  An Introduction to Mathematical Taxonomy , 1983 .

[13]  Douglas W. Nychka,et al.  Discovering Causal Structure , 1989 .

[14]  Ryszard S. Michalski,et al.  Machine learning: an artificial intelligence approach volume III , 1990 .

[15]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[16]  Heikki Mannila,et al.  Dependency Inference , 1987, VLDB.

[17]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[18]  Michael Stonebraker,et al.  Triggers and inference in data base systems , 1985, ACM '85.

[19]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[20]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[21]  J. Ross Quinlan,et al.  Unknown Attribute Values in Induction , 1989, ML.

[22]  John D. C. Little,et al.  Coverstory: Automated News Finding in Marketing , 1990 .

[23]  Thomas G. Dietterich,et al.  Learning with Many Irrelevant Features , 1991, AAAI.

[24]  John H. Holland,et al.  Induction: Processes of Inference, Learning, and Discovery , 1987, IEEE Expert.

[25]  Vasant Dhar,et al.  Abstract-Driven Pattern Discovery in Databases , 1992, IEEE Trans. Knowl. Data Eng..

[26]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[27]  J. D. Uiiman Principles of database systems , 1982 .

[28]  Thomas G. Dietterich,et al.  Learning Boolean Concepts in the Presence of Many Irrelevant Features , 1994, Artif. Intell..

[29]  Paul Thagard,et al.  Induction: Processes Of Inference , 1989 .

[30]  R. Daniel Bergeron,et al.  Stereophonic and surface sound generation for exploratory data analysis , 1990, CHI '90.

[31]  Judea Pearl,et al.  A Theory of Inferred Causation , 1991, KR.

[32]  Y. Chien,et al.  Pattern classification and scene analysis , 1974 .

[33]  Heikki Mannila,et al.  On the Complexity of Inferring Functional Dependencies , 1992, Discret. Appl. Math..

[34]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[35]  Kevin T. Kelly,et al.  Discovering Causal Structure. , 1989 .

[36]  Steven F. Roth,et al.  Automating the presentation of information , 1991, [1991] Proceedings. The Seventh IEEE Conference on Artificial Intelligence Application.

[37]  Saso Dzeroski,et al.  Inductive Learning in Deductive Databases , 1993, IEEE Trans. Knowl. Data Eng..

[38]  Pat Langley,et al.  A general theory of discrimination learning , 1987 .

[39]  T. Anand,et al.  SPOTLIGHT: a data explanation system , 1992, Proceedings Eighth Conference on Artificial Intelligence for Applications.