论文标题

块和任务矩阵库2.0

The Chunks and Tasks Matrix Library 2.0

论文作者

Rubensson, Emanuel H., Rudberg, Elias, Kruchinina, Anastasia, Artemov, Anton G.

论文摘要

我们使用块和任务编程模型基于矩阵的稀疏Quadtree表示,介绍了C ++仅平行稀疏矩阵库。该库实现了许多用于分布式存储器并行化的稀疏矩阵算法,这些算法能够动态利用数据局部性以避免数据移动。使用CHT-MPI 2.0运行时库的实现块和任务模型,将其应用于具有不同非零结构的三个矩阵矩阵矩阵序列的块Sparse矩阵乘法的示例。无论稀疏结构如何,运行时库都成功地加载了平衡计算。

We present a C++ header-only parallel sparse matrix library, based on sparse quadtree representation of matrices using the Chunks and Tasks programming model. The library implements a number of sparse matrix algorithms for distributed memory parallelization that are able to dynamically exploit data locality to avoid movement of data. This is demonstrated for the example of block-sparse matrix-matrix multiplication applied to three sequences of matrices with different nonzero structure, using the CHT-MPI 2.0 runtime library implementation of the Chunks and Tasks model. The runtime library succeeds to dynamically load balance the calculation regardless of the sparsity structure.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源