The torus-wrap mapping for dense matrix calculations on massively parallel computers.

Abstract

Scalable parallel algorithms are considered for linear algebra applications. A bottleneck in these algorithms is the mapping of matrix elements to processors. Wrapping a block mapping in both rows and columns of the matrix is called the torus-wrap mapping. Its generalization is the block-torus-wrap, which assigns each block to a single processor in such a way that the distribution of block mirrors is the distribution of elements in a torus-wrap mapping. It is proved that this assignment scheme leads to dense matrix algorithms that achieve the lower bound on interprocessor communication under reasonable conditions. Theoretical and experimental results are compared with those obtained from more traditional mapping.

BibTeX key: 808.65148
entry type: article
year: 1994
journal: SIAM J. Sci. Comput.
number: 5
pages: 1201-1226
volume: 15
reviewer: L.S.Ioffe (Haifa)
language: English
classmath: *65Y05 Parallel computation (numerical methods) 65F05 Direct methods for linear systems 68P10 Searching and sorting

BibSonomy

The torus-wrap mapping for dense matrix calculations on massively parallel computers.

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on