Author of the publication

Performance analysis and acceleration of explicit integration for large kinetic networks using batched GPU computations.

, , , , , , and . HPEC, page 1-7. IEEE, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Towards Achieving Performance Portability Using Directives for Accelerators., , , , , , and . WACCPD@SC, page 13-24. IEEE Computer Society, (2016)Optimizing GPU Kernels for Irregular Batch Workloads: A Case Study for Cholesky Factorization., , , and . HPEC, page 1-7. IEEE, (2018)Scalability Issues in FFT Computation., , , and . PaCT, volume 12942 of Lecture Notes in Computer Science, page 279-287. Springer, (2021)Efficient implementation of quantum materials simulations on distributed CPU-GPU systems., , , , , and . SC, page 10:1-10:12. ACM, (2015)Performance, Design, and Autotuning of Batched GEMM for GPUs., , , and . ISC, volume 9697 of Lecture Notes in Computer Science, page 21-38. Springer, (2016)Towards Half-Precision Computation for Complex Matrices: A Case Study for Mixed Precision Solvers on GPUs., , and . ScalA@SC, page 17-24. IEEE, (2019)Tridiagonalization of a Symmetric Dense Matrix on a GPU Cluster., , , and . IPDPS Workshops, page 1070-1079. IEEE, (2013)The Impact of Multicore on Math Software., , , , , and . PARA, volume 4699 of Lecture Notes in Computer Science, page 1-10. Springer, (2006)Autotuning GEMM Kernels for the Fermi GPU., , and . IEEE Trans. Parallel Distributed Syst., 23 (11): 2045-2057 (2012)Stability and Performance of Various Singular Value QR Implementations on Multicore CPU with a GPU., , and . ACM Trans. Math. Softw., 43 (2): 10:1-10:18 (2016)