Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators., , and . HPEC, page 1-6. IEEE, (2019)Achieving numerical accuracy and high performance using recursive tile LU factorization with partial pivoting., , , and . Concurr. Comput. Pract. Exp., 26 (7): 1408-1431 (2014)Tools and techniques for performance - Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems)., , , , , and . SC, page 113. ACM Press, (2006)A Framework for Batched and GPU-Resident Factorization Algorithms Applied to Block Householder Transformations., , , , and . ISC, volume 9137 of Lecture Notes in Computer Science, page 31-47. Springer, (2015)Optimizing Krylov Subspace Solvers on Graphics Processing Units., , , , , and . IPDPS Workshops, page 941-949. IEEE Computer Society, (2014)Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures., , and . IPDPS Workshops, page 653-662. IEEE Computer Society, (2016)Virtual Systolic Array for QR Decomposition., , , , and . IPDPS, page 251-260. IEEE Computer Society, (2013)Towards Portable Runtime Support for Irregular and Out-of-Core Computations., and . PVM/MPI, volume 1697 of Lecture Notes in Computer Science, page 59-66. Springer, (1999)Replacing Pivoting in Distributed Gaussian Elimination with Randomized Techniques., , and . ScalA@SC, page 35-43. IEEE, (2020)Weighted dynamic scheduling with many parallelism grains for offloading of numerical workloads to multiple varied accelerators., , , , , and . ScalA@SC, page 5:1-5:8. ACM, (2015)