Distributed Data Mining Using An Agent Based Architecture

Algorithm scalability and the distributed nature of both data and computation deserve serious attention in the context of data mining. This paper presents PADMA (PArallel Data Mining Agents), a parallel agent based system, that makes an effort to address these issues. PADMA contains modules for (1) parallel data accessing operations, (2) parallel hierarchical clustering, and (3) web-based data visualization. This paper describes the general architecture of PADMA and experimental results.

[1]  M Damashek,et al.  Gauging Similarity with n-Grams: Language-Independent Categorization of Text , 1995, Science.

[2]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[3]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[4]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[5]  Pattie Maes,et al.  A learning interface agent for scheduling meetings , 1993, IUI '93.

[6]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[7]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[8]  Richard R. Muntz,et al.  On heterogeneous distributed geoscientific query processing , 1996, Proceedings RIDE '96. Sixth International Workshop on Research Issues in Data Engineering.

[9]  Pattie Maes,et al.  Collaborative Interface Agents , 1994, AAAI.

[10]  W. B. Cavnar,et al.  N-Gram-Based Text Filtering For TREC-2 , 1993, TREC.

[11]  Srinivasan Parthasarathy,et al.  Parallel Data Mining for Association Rules on Shared-Memory Multi-Processors , 1996, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing.

[12]  Johnz Willett Similarity and Clustering in Chemical Information Systems , 1987 .

[13]  Arno Siebes,et al.  Data surveyor: the nuggets in parallel , 1996, KDD 1996.

[14]  N. J. Radcliffe,et al.  GA-MINER: Parallel Data Mining with Hierarchical Genetic Algorithms Final Report , 1995 .

[15]  G Salton,et al.  Automatic Analysis, Theme Generation, and Summarization of Machine-Readable Texts , 1994, Science.

[16]  Larry R. Harris,et al.  Understanding natural language using a variable grammar , 1975 .

[17]  Andrew A. Chien,et al.  PPFS: a high performance portable parallel file system , 1995, ICS '95.

[18]  L. Foner What''s an Agent, Anyway? A Sociological Case Study. MIT Media Lab , 1997 .