论文信息 - Hardware Implementation and Analysis of Gen-Z Protocol for Memory-Centric Architecture

Hardware Implementation and Analysis of Gen-Z Protocol for Memory-Centric Architecture

With the increase in memory-intensive applications, a memory-centric architecture has been proposed in which the central processing units (CPUs) access a pool of fabric-attached memory. This architecture eliminates the dependency of system components and provides benefits for achieving an independent upgrade cycle and fine-grained resource control. However, developing a memory-centric architecture requires new hardware and software for achieving the low-latency and high-bandwidth communication between the memory and the CPU. This paper presents a hardware prototype of a memory-centric architecture using Gen-Z, which is a new universal system interconnect optimized for ultralow latency and ultra-high bandwidth. The Gen-Z hardware prototype was designed according to the core specification 1.0a and implemented in two types of host interfaces. In this study, we measured the performance of the Gen-Z hardware prototype, i.e., the latency and throughput, and compared it with of the solid-state drive (SSD) and local memory. The experimental results indicated that the performance of remote memory access for a specific write request that utilizes the Gen-Z protocol was better than that of the SSD and local memory. Further, we discussed methods for improving the performance of the Gen-Z prototype.

[1] Scott Shenker,et al. Network Requirements for Resource Disaggregation , 2016, OSDI.

[2] George Porter,et al. Is memory disaggregation feasible? A case study with Spark SQL , 2016, 2016 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS).

[3] Marcos K. Aguilera,et al. Can far memory improve job throughput? , 2020, EuroSys.

[4] Thomas F. Wenisch,et al. Thermostat: Application-transparent Page Management for Two-tiered Main Memory , 2017, ASPLOS.

[5] Jing Guo,et al. Who Limits the Resource Efficiency of My Datacenter: An Analysis of Alibaba Datacenter Traces , 2019, 2019 IEEE/ACM 27th International Symposium on Quality of Service (IWQoS).

[6] Thomas F. Wenisch,et al. System-level implications of disaggregated memory , 2012, IEEE International Symposium on High-Performance Comp Architecture.

[7] Kostas Katrinis,et al. A software-defined architecture and prototype for disaggregated memory rack scale systems , 2017, 2017 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS).

[8] Kostas Katrinis,et al. Rack-scale disaggregated cloud data centers: The dReDBox project vision , 2016, 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[9] Scott Shenker,et al. Network support for resource disaggregation in next-generation datacenters , 2013, HotNets.

[10] Kang G. Shin,et al. Efficient Memory Disaggregation with Infiniswap , 2017, NSDI.

[11] David Valentine,et al. For the machine , 2017, Interpretation of Commercial Contracts in European Private Law.

[12] Jiajia Chen,et al. Disaggregated Data Centers: Challenges and Trade-offs , 2020, IEEE Communications Magazine.

[13] Thomas F. Wenisch,et al. Disaggregated memory for expansion and sharing in blade servers , 2009, ISCA '09.

[14] Gerald Q. Maguire,et al. Software-Defined “Hardware” Infrastructures: A Survey on Enabling Technologies and Open Research Directions , 2018, IEEE Communications Surveys & Tutorials.