Mining for knowledge in databases: The INLEN architecture, initial implementation and first results

The architecture of an intelligent multistrategy assistant for knowledge discovery from facts, INLEN, is described and illustrated by an exploratory application. INLEN integrates a database, a knowledge base, and machine learning methods within a uniform user-oriented framework. A variety of machine learning programs are incorporated into the system to serve as high-levelknowledge generation operators (KGOs). These operators can generate diverse kinds of knowledge about the properties and regularities existing in the data. For example, they can hypothesize general rules from facts, optimize the rules according to problem-dependent criteria, determine differences and similarities among groups of facts, propose new variables, create conceptual classifications, determine equations governing numeric variables and the conditions under which the equations apply, deriving statistical properties and using them for qualitative evaluations, etc. The initial implementation of the system, INLEN 1b, is described, and its performance is illustrated by applying it to a database of scientific publications.

[1]  Ryszard S. Michalski,et al.  A theory and methodology of inductive learning , 1993 .

[2]  Larry Kerschberg,et al.  Expert Database Systems , 1987 .

[3]  Ryszard S. Michalski,et al.  An Experimental Comparison of Symbolic and Subsymbolic Learning Paradigms: Phase I-Learning Logic-st , 1991 .

[4]  Ryszard S. Michalski,et al.  SPARC/E(V.2), An Eleusis Rule Generator and Game Player , 1985 .

[5]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[6]  Ryszard S. Michalski,et al.  AgAssistant: An Experimental Expert System Builder for Agricultural Applications , 1987 .

[7]  Larry Kerschberg Expert Database Systems, Proceedings From the First international Workshop, Kiawah Island, South Carolina, USA, October 24-27, 1984 , 1986 .

[8]  Robert E. Reinke,et al.  Knowledge Acquisition and Refinement Tools for the ADVISE Meta-Expert System , 1984 .

[9]  Ryszard S. Michalski,et al.  Toward a unified theory of learning: multistrategy task-adaptive learning , 1993 .

[10]  Ryszard S. Michalski,et al.  Data-driven constructive induction in AQ17-PRE: A method and experiments , 1991, [Proceedings] Third International Conference on Tools for Artificial Intelligence - TAI 91.

[11]  Ryszard S. Michalski,et al.  AQ15: Incremental Learning of Attribute-Based Descriptions from Examples: The Method and User's Guide , 1986 .

[12]  Robert Earl Stepp,et al.  Conjunctive Conceptual Clustering: A Methodology and Experimentation , 1987 .

[13]  Thomas G. Dietterich,et al.  Learning to Predict Sequences , 1985 .

[14]  Ryszard S. Michalski,et al.  Automated Construction of Classifications: Conceptual Clustering Versus Numerical Taxonomy , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Ryszard S. Michalski,et al.  Selection of Most Representative Training Examples and Incremental Generation of VL1 Hypotheses: The Underlying Methodology and the Description of Programs ESEL and AQ11 , 1978 .

[16]  Kejitan Dontas,et al.  APPLAUSE: An implementation of the Collins-Michalski theory of plausible reasoning , 1990, Inf. Sci..

[17]  Ryszard S. Michalski,et al.  The ADVISE.1 Meta-Expert System: The General Design and a Technical Description , 1987 .

[18]  Larry Kerschberg,et al.  Mining for Knowledge in Databases: Goals and General Description of the INLEN System , 1989, Knowledge Discovery in Databases.

[19]  H. Simon,et al.  Rediscovering Chemistry with the Bacon System , 1983 .

[20]  Brian Falkenhainer,et al.  6 – INTEGRATING QUANTITATIVE AND QUALITATIVE DISCOVERY IN THE ABACUS SYSTEM , 1990 .

[21]  Ryszard S. Michalski,et al.  The AQ15 Inductive Learning System: An Overview and Experiments , 1986 .

[22]  Mieczyslaw M. Kokar COPER: a methodology for learning invariant functional descriptions , 1986 .

[23]  Ryszard S. Michalski,et al.  The Logic of Plausible Reasoning: A Core Theory , 1989, Cogn. Sci..

[24]  Edwin Diday,et al.  A Recent Advance in Data Analysis: Clustering Objects into Classes Characterized by Conjunctive Concepts , 1981 .

[25]  Jan M. Zytkow,et al.  Combining many searches in the FAHRENHEIT discovery system , 1987 .

[26]  Ryszard S. Michalski Designing Extended Entry Decision Tables and Optimal Decision Trees Using Decision Diagrams , 1978 .

[27]  Ryszard S. Michalski,et al.  An Integrated Approach to the Construction of Knowledge-Based Systems: Experience with Advise and Related Programs , 1989 .

[28]  Ryszard S. Michalski,et al.  Book III: Scientific and Research Applications in Medical Care: A Logic-Based Approach to Conceptual Database Analysis , 1982 .

[29]  J. R. Quinlan Probabilistic decision trees , 1990 .

[30]  Kent A. Spackman,et al.  QUIN: Integration of Inferential Operators within a Relational Database , 1983 .

[31]  Ryszard S. Michalski,et al.  Integrating Multiple Knowledge Representations and Learning Capabilities in an Expert System: The ADVISE System , 1983, IJCAI.

[32]  Janusz Wnek,et al.  Hypothesis-driven constructive induction , 1993 .

[33]  Kenneth A. Kaufman,et al.  EMERALD 1: An Integrated System of Machine Learning and Discovery Programs for Education and Researc , 1989 .

[34]  Dennis McLeod,et al.  Larry Kerschberg, ed., Expert Database Systems: Proceedings from the Second International Conference , 1991, Artif. Intell..