Self-organizing systems for knowledge discovery in large databases

We present a framework in which self-organizing systems can be used to perform change of representation on knowledge discovery problems and to learn from very large databases. Clustering using self-organizing maps is applied to produce multiple, intermediate training targets that are used to define a new supervised learning and mixture estimation problem. The input data is partitioned using a state space search over subdivisions of attributes, to which self-organizing maps are applied to the input data as restricted to a subset of input attributes. This approach yields the variance-reducing benefits of techniques such as stacked generalization, but uses self-organizing systems to discover factorial (modular) structure among abstract learning targets. This research demonstrates the feasibility of applying such structure in very large databases to build a mixture of ANNs for data mining and KDD.

[1]  Ron Kohavi,et al.  Data Mining Using MLC a Machine Learning Library in C++ , 1996, Int. J. Artif. Intell. Tools.

[2]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[3]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[4]  Ron Kohavi,et al.  Data mining using /spl Mscr//spl Lscr//spl Cscr/++ a machine learning library in C++ , 1996, Proceedings Eighth IEEE International Conference on Tools with Artificial Intelligence.

[5]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[6]  Ryszard S. Michalski,et al.  Conceptual Clustering: Inventing Goal-Oriented Classifications of Structured Objects , 1986 .

[7]  Sylvian R. Ray,et al.  Time series learning with probabilistic network composites , 1998 .

[8]  Tao Li,et al.  Hierarchical classification and vector quantization with neural trees , 1993, Neurocomputing.

[9]  Michael I. Jordan,et al.  Hierarchical Mixtures of Experts and the EM Algorithm , 1994, Neural Computation.

[10]  J. Davenport Editor , 1960 .

[11]  D. Benjamin Change of Representation and Inductive Bias , 1989 .

[12]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[13]  Bruce G. Buchanan,et al.  Learning Intermediate Concepts in Constructing a Hierarchical Knowledge Base , 1985, IJCAI.

[14]  Larry A. Rendell,et al.  Rerepresenting and Restructuring Domain Theories: A Constructive Induction Approach , 1994, J. Artif. Intell. Res..

[15]  Sylvian R. Ray,et al.  Self-Organized-Expert Modular Network for Classification of Spatiotemporal Sequences , 1998, Intell. Data Anal..

[16]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .