Author of the publication

A QDWH-based SVD Software Framework on Distributed-memory Manycore Systems.

, , , and . ACM Trans. Math. Softw., 45 (2): 18:1-18:21 (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators., , , and . VECPAR, volume 7851 of Lecture Notes in Computer Science, page 72-79. Springer, (2012)A QDWH-based SVD Software Framework on Distributed-memory Manycore Systems., , , and . ACM Trans. Math. Softw., 45 (2): 18:1-18:21 (2019)Batched Triangular Dense Linear Algebra Kernels for Very Small Matrix Sizes on GPUs., , and . ACM Trans. Math. Softw., 45 (2): 15:1-15:28 (2019)Exploiting Data Sparsity for Large-Scale Matrix Computations., , , , , and . Euro-Par, volume 11014 of Lecture Notes in Computer Science, page 721-734. Springer, (2018)Redesigning Triangular Dense Matrix Computations on GPUs., , and . Euro-Par, volume 9833 of Lecture Notes in Computer Science, page 477-489. Springer, (2016)Data-driven execution of fast multipole methods., and . Concurr. Comput. Pract. Exp., 26 (11): 1935-1946 (2014)Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency., , and . Comput. Sci. Res. Dev., 27 (4): 277-287 (2012)Maximizing I/O Bandwidth for Reverse Time Migration on Heterogeneous Large-Scale Systems., , and . Euro-Par, volume 12247 of Lecture Notes in Computer Science, page 263-278. Springer, (2020)Toward a High Performance Tile Divide and Conquer Algorithm for the Dense Symmetric Eigenvalue Problem., , and . SIAM J. Sci. Comput., (2012)Multidimensional Intratile Parallelization for Memory-Starved Stencil Computations., , , and . ACM Trans. Parallel Comput., 4 (3): 12:1-12:32 (2018)