Author of the publication

Anatomy of high-performance matrix multiplication.

, and . ACM Trans. Math. Softw., 34 (3): 12:1-12:25 (2008)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Efficient Communication Primitives on Mesh Architectures with Hardware Routing., , , and . PPSC, page 943-948. SIAM, (1993)Strassen's Algorithm for Tensor Contraction., , and . CoRR, (2017)Exploiting Symmetry in Tensors for High Performance, , , and . CoRR, (2013)Scalable parallelization of FLAME code via the workqueuing model., , , and . ACM Trans. Math. Softw., 34 (2): 10:1-10:29 (2008)A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting., , , , and . IEEE Access, (2019)Using desktop computers to solve large-scale dense linear algebra problems., , , and . J. Supercomput., 58 (2): 145-150 (2011)SUMMA: scalable universal matrix multiplication algorithm., and . Concurr. Pract. Exp., 9 (4): 255-274 (1997)Goal-Oriented and Modular Stability Analysis., and . SIAM J. Matrix Anal. Appl., 32 (1): 286-308 (2011)Parallelizing the QR Algorithm for the Unsymmetric Algebraic Eigenvalue Problem: Myths and Reality., and . SIAM J. Sci. Comput., 17 (4): 870-883 (1996)Exploiting Symmetry in Tensors for High Performance: Multiplication with Symmetric Tensors., , , and . SIAM J. Sci. Comput., (2014)