Demystifying CXL Memory with Genuine CXL-Ready Systems and Devices

The high demand for memory capacity in modern datacenters has led to multiple lines of innovation in memory expansion and disaggregation. One such effort is Compute eXpress Link (CXL)-based memory expansion, which has gained significant attention. To better leverage CXL memory, researchers have built several emulation and experimental platforms to study its behavior and characteristics. However, due to the lack of commercial hardware supporting CXL memory, the full picture of its capabilities may still be unclear to the community. In this work, we explore CXL memory's performance characterization on a state-of-the-art experimental platform. First, we study the basic performance characteristics of CXL memory using our proposed microbenchmark. Based on our observations and comparisons to standard DRAM connected to local and remote NUMA nodes, we also study the impact of CXL memory on end-to-end applications with different offloading and interleaving policies. Finally, we provide several guidelines for future programmers to realized the full potential of CXL memory

[1]  Myoungsoo Jung,et al.  Memory Pooling With CXL , 2023, IEEE Micro.

[2]  Daniel S. Berger,et al.  Design Tradeoffs in CXL-Based Memory Pools for Public Cloud Platforms , 2023, IEEE Micro.

[3]  Debendra Das Sharma Compute Express Link (CXL): Enabling Heterogeneous Data-Centric Computing With Heterogeneous Memory Hierarchy , 2023, IEEE Micro.

[4]  Shobhit O. Kanaujia,et al.  TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory , 2022, ASPLOS.

[5]  J. Rao,et al.  Characterizing the performance of intel optane persistent memory: a close look at its on-DIMM buffering , 2022, EuroSys.

[6]  M. Erez,et al.  HeMem: Scalable Tiered Memory Management for Big Data Applications and Real NVM , 2021, SOSP.

[7]  Tae Jun Ham,et al.  MERCI: efficient embedding reduction on commodity hardware via sub-query memoization , 2021, ASPLOS.

[8]  Anuj Kalia,et al.  Challenges and solutions for fast remote persistent memory access , 2020, SoCC.

[9]  Steven Swanson,et al.  An Empirical Guide to the Behavior and Use of Scalable Persistent Memory , 2019, FAST.

[10]  Yuan He,et al.  An Open-Source Benchmark Suite for Microservices and Their Hardware-Software Implications for Cloud & Edge Systems , 2019, ASPLOS.

[11]  Jian Huang,et al.  FlatFlash: Exploiting the Byte-Accessibility of SSDs within a Unified Memory-Storage Hierarchy , 2019, ASPLOS.

[12]  Joo Young Hwang,et al.  2B-SSD: The Case for Dual, Byte- and Block-Addressable Solid-State Drives , 2018, 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA).

[13]  Kang G. Shin,et al.  Efficient Memory Disaggregation with Infiniswap , 2017, NSDI.

[14]  Miguel Castro,et al.  FaRM: Fast Remote Memory , 2014, NSDI.

[15]  Adam Silberstein,et al.  Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.

[16]  Gavin Yamey,et al.  The Corporation , 2004, BMJ : British Medical Journal.