Dynamic Remote Memory Acquiring for Parallel Data Mining on PC Cluster: Prliminary Performance Results

Recently data intensive applications such as data mining and data warehousing have been focused as one of the most important applications for high performance computing. As a platform, PC/WS cluster is a promising candidate for future high performance computers, from the viewpoint of good scalability and cost performance ratio. We have developed a large scale ATM connected PC cluster until now, and implemented several database applications, including parallel data mining, to evaluate their performance and the feasibility of such applications over PC clusters.

[1]  Joel M. Halpern,et al.  Classical IP and ARP over ATM , 1998, RFC.

[2]  Alan L. Cox,et al.  TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems , 1994, USENIX Winter.

[3]  Robert Armstrong,et al.  Commodity clusters: performance comparison between PCs and workstations , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[4]  Masato OGUCHI,et al.  Characteristics of a Parallel Data Mining Application Implemented on an ATM Connected PC Cluster , 1997, HPCN Europe.

[5]  Masato Oguchi,et al.  Parallel Database Processing on a 100 Node PC Cluster: Cases for Decision Support Query Processing and Data Mining , 1997, ACM/IEEE SC 1997 Conference (SC'97).

[6]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[7]  Sarita V. Adve,et al.  Shared Memory Consistency Models: A Tutorial , 1996, Computer.

[8]  Amnon Barak,et al.  Performance of the MOSIX Parallel System for a Cluster of PCs , 1997, HPCN Europe.

[9]  Philip K. McKinley,et al.  Communication issues in parallel computing across ATM networks , 1994, IEEE Parallel & Distributed Technology: Systems & Applications.

[10]  Masato Oguchi,et al.  Optimizing protocol parameters to large scale PC cluster and evaluation of its effectiveness with parallel data mining , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[11]  Sudha Ram,et al.  Proceedings of the 1997 ACM SIGMOD international conference on Management of data , 1997, ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems.

[12]  Andrea C. Arpaci-Dusseau,et al.  Parallel computing on the berkeley now , 1997 .

[13]  Alan L. Cox,et al.  TreadMarks: shared memory computing on networks of workstations , 1996 .

[14]  Dan Grossman,et al.  Multiprotocol Encapsulation over ATM Adaptation Layer 5 , 1993, RFC.

[15]  Mitsuhisa Sato,et al.  PM: An Operating System Coordinated High Performance Communication Library , 1997, HPCN Europe.

[16]  Masaru Kitsuregawa,et al.  Parallel mining algorithms for generalized association rules with classification hierarchy , 1997, SIGMOD '98.

[17]  M. Hill,et al.  Weak ordering-a new definition , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.

[18]  Masaru Kitsuregawa,et al.  Hash based parallel algorithms for mining association rules , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[19]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[20]  Sushil Jajodia,et al.  Proceedings of the 1993 ACM SIGMOD international conference on Management of data , 1993, SIGMOD 1993.