Agent Based Distributed Data Mining

This paper presents an agent-based distributed data mining approach dealing with heterogeneous databases located at different sites. It introduces a modified decision tree algorithm on an agent based framework, which produces an accurate global model without transferring data between agents. The novel approach is evaluated over a test bed of texture feature data of 184 aerial photograph images. The experimental results show that the distributed version with more agents outperforms the version with fewer agents when the rule generation from the large database is not complicated.

[1]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[2]  Anil K. Jain,et al.  A multi-channel filtering approach to texture segmentation , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[4]  Yike Guo,et al.  Probing Knowledge in Distributed Data Mining , 1999, PAKDD.

[5]  Salvatore J. Stolfo,et al.  JAM: Java Agents for Meta-Learning over Distributed Databases , 1997, KDD.