Author of the publication

Optimizing GPU Kernels for Irregular Batch Workloads: A Case Study for Cholesky Factorization.

, , , and . HPEC, page 1-7. IEEE, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Accelerating Scientific Computations with Mixed Precision Algorithms, , , , , , , and . CoRR, (2008)Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product., , and . SpringSim (HPS), page 75-82. SCS/ACM, (2015)Performance analysis and design of a hessenberg reduction using stabilized blocked elementary transformations for new architectures., , , and . SpringSim (HPS), page 135-142. SCS/ACM, (2015)Performance Analysis and Optimisation of Two-sided Factorization Algorithms for Heterogeneous Platform., , , and . ICCS, volume 51 of Procedia Computer Science, page 180-190. Elsevier, (2015)GPU-Based Homotopy Continuation for Minimal Problems in Computer Vision., , , , , and . CoRR, (2021)A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic., , , , , , , , , and 15 other author(s). CoRR, (2020)Batched one-sided factorizations of tiny matrices using GPUs: Challenges and countermeasures., , , and . J. Comput. Sci., (2018)Mixed-Precision Cholesky QR Factorization and Its Case Studies on Multicore CPU with Multiple GPUs., , and . SIAM J. Sci. Comput., (2015)A survey of numerical linear algebra methods utilizing mixed-precision arithmetic., , , , , , , , , and 11 other author(s). Int. J. High Perform. Comput. Appl., (2021)Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices., , , , , , and . Parallel Comput., (2019)