Design and Implementation of an Agent Home Scheme Strategy for Prefetch-Based DSM Systems

In recent years, cluster computing has been widely investigated and there is no doubt that it can provide a cost-effective computing infrastructure by aggregating computational power, communication, and storage resources. Moreover, it is also considered to be a very attractive platform for low-cost supercomputing. Distributed shared memory (DSM) systems utilize the physical memory of each computing node interconnected in a private network to form a global virtual shared memory. Since this global shared memory is distributed among the computing nodes, accessing the data located in remote computing nodes is an absolute necessity. However, this action will result in significant remote memory access latencies which are major sources of overhead in DSM systems. For these reasons, in order to increase overall system performance and decrease this overhead, a number of strategies have been devised. Prefetching is one such approach which can reduce latencies, although it always increases the workload in the home nodes. In this paper, we propose a scheme named Agent Home Scheme. Its most noticeable feature, when compared to other schemes, is that the agent home distributes the workloads of each computing nodes when sending data. By doing this, we can reduce not only the workload of the home nodes by balancing the workload for each node, but also the waiting time. Experimental results show that the proposed method can obtain about 20% higher performance than the original JIAJIA, about 18% more than History Prefetching Strategy (HPS), and about 10% higher than Effective Prefetch Strategy (EPS).

[1]  H. Cheong,et al.  A cache coherence scheme with fast selective invalidation , 1988, [1988] The 15th Annual International Symposium on Computer Architecture. Conference Proceedings.

[2]  Hu Weiwu,et al.  Dynamic data prefetching in home-based software DSMs , 2001 .

[3]  Andrew S. Tanenbaum,et al.  Distributed operating systems , 2009, CSUR.

[4]  C. M. Mobarry,et al.  HRUNTING: a distributed shared memory system for the BEOWULF parallel workstation , 1998, 1998 IEEE Aerospace Conference Proceedings (Cat. No.98TH8339).

[5]  Kuan-Ching Li,et al.  On the Design and Implementation of an Effective Prefetch Strategy for DSM Systems , 2006, The Journal of Supercomputing.

[6]  Weisong Shi,et al.  Running real applications on software DSMs , 2000, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region.

[7]  Weisong Shi,et al.  Adaptive write detection in home-based software DSMs , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[8]  Joonwon Lee,et al.  An efficient lock protocol for home-based lazy release consistency , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[9]  Peter Membrey,et al.  The Linux Kernel , 2009 .

[10]  James R. Goodman,et al.  Cache Consistency and Sequential Consistency , 1991 .

[11]  Anoop Gupta,et al.  SPLASH: Stanford parallel applications for shared-memory , 1992, CARN.

[12]  Liviu Iftode,et al.  Performance evaluation of two home-based lazy release consistency protocols for shared virtual memory systems , 1996, OSDI '96.

[13]  Kuan-Ching Li,et al.  On Design of a Prefetching Strategy for DSM System , 2004, International Conference on Parallel and Distributed Processing Techniques and Applications.

[14]  Cho-Li Wang,et al.  Migrating-Home Protocol for Software Distributed Shared Memory , 2002, J. Inf. Sci. Eng..

[15]  Weiwu Hu,et al.  Dynamic data prefetching in home-based software DSMs , 2008, Journal of Computer Science and Technology.

[16]  Kuan-Ching Li,et al.  On design of agent home scheme for prefetching strategy in DSM systems , 2005, 19th International Conference on Advanced Information Networking and Applications (AINA'05) Volume 1 (AINA papers).

[17]  Assaf Schuster,et al.  A comparison of sequential consistency with home-based lazy release consistency for software distributed shared memory , 2004, ICS '04.

[18]  Cho-Li Wang,et al.  A Migrating-Home Protocol for Implementing Scope Consistency Model on a Cluster of Workstations , 1999, PDPTA.

[19]  Weisong Shi,et al.  Write Detection in Home-Based Software DSMs , 1999, Euro-Par.

[20]  Alan L. Cox,et al.  Lazy release consistency for software distributed shared memory , 1992, ISCA '92.

[21]  Anoop Gupta,et al.  The SPLASH-2 programs: characterization and methodological considerations , 1995, ISCA.

[22]  Alan L. Cox,et al.  Message Passing Versus Distributed Shared Memory on Networks of Workstations , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[23]  Martin K. Purvis,et al.  Homeless and Home-based Lazy Release Consistency Protocols on Distributed Shared Memory , 2004, ACSC.

[24]  Weiwu Hu,et al.  A comparison of two strategies of dynamic data prefetching in software DSM , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[25]  Ming Li,et al.  A lock-based cache coherence protocol for scope consistency , 1998, Journal of Computer Science and Technology.

[26]  Daniel Pierre Bovet,et al.  Understanding the Linux Kernel , 2000 .

[27]  Kyu Ho Park,et al.  Moving home-based lazy release consistency for shared virtual memory systems , 1999, Proceedings of the 1999 International Conference on Parallel Processing.

[28]  Weisong Shi,et al.  Optimizing Home-Based Software DSM Protocols , 2004, Cluster Computing.

[29]  Weisong Shi,et al.  Home Migration in Home-Based Software DSMs , 1999 .

[30]  S. Okamoto,et al.  A moving home-based software DSM system , 2003, 2003 IEEE Pacific Rim Conference on Communications Computers and Signal Processing (PACRIM 2003) (Cat. No.03CH37490).

[31]  Weisong Shi,et al.  Evaluation of the JIAJIA software DSM system on high performance computer architectures , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[32]  Jih-Fu Tu,et al.  A dynamic data prefetching method of improving the memory latency , 2000, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region.

[33]  Weisong Shi,et al.  Reducing system overheads in home-based software DSMs , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[34]  Weisong Shi,et al.  JIAJIA: A Software DSM System Based on a New Cache Coherence Protocol , 1999, HPCN Europe.

[35]  Hai Jiang,et al.  MigThread: thread migration in DSM systems , 2002, Proceedings. International Conference on Parallel Processing Workshop.

[36]  A. Tanenbaum,et al.  DISTRIBUTED SYSTEMS PRINCIPLES AND PARADIGMS PROBLEM SOLUTIONS , 2001 .

[37]  Liviu Iftode,et al.  Scope Consistency: A Bridge between Release Consistency and Entry Consistency , 1996, SPAA '96.

[38]  David H. Bailey,et al.  The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..

[39]  William Evan Speight,et al.  Utilizing home node prediction to improve the performance of software distributed shared memory , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..