Author of the publication

Mixed precision LU factorization on GPU tensor cores: reducing data movement and memory footprint.

, and . Int. J. High Perform. Comput. Appl., 37 (2): 165-179 (March 2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Communication Avoiding Block Low-Rank Parallel Multifrontal Triangular Solve with Many Right-Hand Sides., , , , , , and . SIAM J. Matrix Anal. Appl., 45 (1): 148-166 (March 2024)Combining Sparse Approximate Factorizations with Mixed-precision Iterative Refinement., , , , , and . ACM Trans. Math. Softw., 49 (1): 4:1-4:29 (March 2023)Bridging the Gap Between Flat and Hierarchical Low-Rank Matrix Formats: The Multilevel Block Low-Rank Format., , , and . SIAM J. Sci. Comput., 41 (3): A1414-A1442 (2019)Sharper Probabilistic Backward Error Analysis for Basic Linear Algebra Kernels with Random Data., and . SIAM J. Sci. Comput., 42 (5): A3427-A3446 (2020)Robust and Accurate Stopping Criteria for Adaptive Randomized Sampling in Matrix-Free Hierarchically Semiseparable Construction., , , , , and . SIAM J. Sci. Comput., 41 (5): S61-S85 (2019)Access-averse framework for computing low-rank matrix approximations., , , , and . IEEE BigData, page 70-77. IEEE Computer Society, (2014)Stochastic Rounding and Its Probabilistic Backward Error Analysis., , and . SIAM J. Sci. Comput., 43 (1): A566-A585 (2021)Matrix Multiplication in Multiword Arithmetic: Error Analysis and Application to GPU Tensor Cores., , , , and . SIAM J. Sci. Comput., 45 (1): 1- (February 2023)Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs., , , , , and . SC, page 60:1-60:11. ACM, (2015)Mixed Precision Block Fused Multiply-Add: Error Analysis and Application to GPU Tensor Cores., , , , and . SIAM J. Sci. Comput., 42 (3): C124-C141 (2020)