A Web-Based Database Management System Supporting Parallel Data Mining Service on PC Clusters

In this study, we develop a database management system which provides users with on-line Web services to build and manage their databases and perform parallel data mining on Internet. We exploit two different programming toolkits, i.e., MPI and DSM to parallelize association rules on PC clusters in order to minimize response time. In addition, we have evaluated the performance of these two programming toolkits on parallelization of data association mining. Our experimental results show that both of MPI and DSM are effective for parallel data association mining while MPI provides a 30~45% performance improvement than DSM

[1]  Anthony Skjellum,et al.  Using MPI - portable parallel programming with the message-parsing interface , 1994 .

[2]  Ran Wolff,et al.  A high-performance distributed algorithm for mining association rules , 2004, Knowledge and Information Systems.

[3]  Jyh-Biau Chang,et al.  Teamster: a transparent distributed shared memory for cluster symmetric multiprocessors , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[4]  Ian T. Foster Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, NPC.

[5]  Rakesh Agrawal,et al.  Parallel Mining of Association Rules , 1996, IEEE Trans. Knowl. Data Eng..

[6]  Shaowei Xia,et al.  Efficient parallel mining of association rules on shared-memory multiple-processor machine , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[7]  Kai Li,et al.  IVY: A Shared Virtual Memory System for Parallel Computing , 1988, ICPP.

[8]  Rajkumar Buyya,et al.  High Performance Cluster Computing: Architectures and Systems , 1999 .

[9]  Rajkumar Buyya,et al.  Grids and Grid technologies for wide‐area distributed computing , 2002, Softw. Pract. Exp..

[10]  Mohammed J. Zaki,et al.  Parallel classification for data mining on shared-memory multiprocessors , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[11]  Srinivasan Parthasarathy,et al.  Parallel Data Mining for Association Rules on Shared-memory Systems , 1998 .

[12]  Anthony Skjellum,et al.  A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..

[13]  William Gropp,et al.  Skjellum using mpi: portable parallel programming with the message-passing interface , 1994 .

[14]  Ruoming Jin,et al.  Shared memory parallelization of data mining algorithms: techniques, programming interface, and performance , 2005, IEEE Transactions on Knowledge and Data Engineering.

[15]  Jiawei Han,et al.  A fast distributed algorithm for mining association rules , 1996, Fourth International Conference on Parallel and Distributed Information Systems.