论文标题

评估HPC系统的新兴CXL内存池

Evaluating Emerging CXL-enabled Memory Pooling for HPC Systems

论文作者

Wahlgren, Jacob, Gokhale, Maya, Peng, Ivy B.

论文摘要

当前的HPC系统提供了静态配置并与计算节点紧密结合的内存资源。但是,HPC系统上的工作量正在发展。多样化的工作负载导致需要可配置的内存资源以实现高性能和利用率。在这项研究中,我们评估了一个利用启用CXL的内存池的内存子系统设计。研究了两个有前途的组合记忆子系统的有希望的用例 - 细粒度的能力供应和可扩展的带宽配置。我们开发了一个模拟器来探索各种记忆组成的性能影响。我们还提供了一个探查师,以确定应用程序中的内存使用模式及其优化机会。在各种模拟记忆配置上评估了七个科学和六个图形应用程序。当汇总记忆支持其记忆足迹的75%时,七个科学应用中的三个具有不到10%的性能影响。结果还表明,动态配置的高宽宽系统可以有效地支持带宽密集型非结构化网格的应用程序(例如OpenFOAM)。最后,我们通过共享内存池确定干扰是在HPC系统上采用的实用挑战。

Current HPC systems provide memory resources that are statically configured and tightly coupled with compute nodes. However, workloads on HPC systems are evolving. Diverse workloads lead to a need for configurable memory resources to achieve high performance and utilization. In this study, we evaluate a memory subsystem design leveraging CXL-enabled memory pooling. Two promising use cases of composable memory subsystems are studied -- fine-grained capacity provisioning and scalable bandwidth provisioning. We developed an emulator to explore the performance impact of various memory compositions. We also provide a profiler to identify the memory usage patterns in applications and their optimization opportunities. Seven scientific and six graph applications are evaluated on various emulated memory configurations. Three out of seven scientific applications had less than 10% performance impact when the pooled memory backed 75% of their memory footprint. The results also show that a dynamically configured high-bandwidth system can effectively support bandwidth-intensive unstructured mesh-based applications like OpenFOAM. Finally, we identify interference through shared memory pools as a practical challenge for adoption on HPC systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源