论文信息 - Self-organizing systems for knowledge discovery in large databases

Self-organizing systems for knowledge discovery in large databases

We present a framework in which self-organizing systems can be used to perform change of representation on knowledge discovery problems and to learn from very large databases. Clustering using self-organizing maps is applied to produce multiple, intermediate training targets that are used to define a new supervised learning and mixture estimation problem. The input data is partitioned using a state space search over subdivisions of attributes, to which self-organizing maps are applied to the input data as restricted to a subset of input attributes. This approach yields the variance-reducing benefits of techniques such as stacked generalization, but uses self-organizing systems to discover factorial (modular) structure among abstract learning targets. This research demonstrates the feasibility of applying such structure in very large databases to build a mixture of ANNs for data mining and KDD.

[1] Ron Kohavi,et al. Data Mining Using MLC a Machine Learning Library in C++ , 1996, Int. J. Artif. Intell. Tools.

[2] Ron Kohavi,et al. Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[3] Larry A. Rendell,et al. The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[4] Ron Kohavi,et al. Data mining using /spl Mscr//spl Lscr//spl Cscr/++ a machine learning library in C++ , 1996, Proceedings Eighth IEEE International Conference on Tools with Artificial Intelligence.

[5] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[6] Ryszard S. Michalski,et al. Conceptual Clustering: Inventing Goal-Oriented Classifications of Structured Objects , 1986 .

[7] Sylvian R. Ray,et al. Time series learning with probabilistic network composites , 1998 .

[8] Tao Li,et al. Hierarchical classification and vector quantization with neural trees , 1993, Neurocomputing.

[9] Michael I. Jordan,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1994, Neural Computation.

[10] J. Davenport. Editor , 1960 .

[11] D. Benjamin. Change of Representation and Inductive Bias , 1989 .

[12] David H. Wolpert,et al. Stacked generalization , 1992, Neural Networks.

[13] Bruce G. Buchanan,et al. Learning Intermediate Concepts in Constructing a Hierarchical Knowledge Base , 1985, IJCAI.

[14] Larry A. Rendell,et al. Rerepresenting and Restructuring Domain Theories: A Constructive Induction Approach , 1994, J. Artif. Intell. Res..

[15] Sylvian R. Ray,et al. Self-Organized-Expert Modular Network for Classification of Spatiotemporal Sequences , 1998, Intell. Data Anal..

[16] Teuvo Kohonen,et al. The self-organizing map , 1990 .