Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Abstract: A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks., , , , and . SC Companion, page 1338-1339. IEEE Computer Society, (2012)Model-Driven One-Sided Factorizations on Multicore Accelerated Systems., , , , , and . Supercomput. Front. Innov., 1 (1): 85-115 (2014)Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers., , , and . SC, page 47:1-47:11. IEEE / ACM, (2018)Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels., , and . SC, page 8:1-8:11. ACM, (2011)Accelerating Numerical Dense Linear Algebra Calculations with GPUs., , , , , , and . Numerical Computations with GPUs, Springer, (2014)Leading Edge Hybrid Multi-GPU Algorithms for Generalized Eigenproblems in Electronic Structure Calculations., , , , , and . ISC, volume 7905 of Lecture Notes in Computer Science, page 67-80. Springer, (2013)Heterogeneous Streaming., , , , , , , , , and 8 other author(s). IPDPS Workshops, page 611-620. IEEE Computer Society, (2016)Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs., , , and . ICS, page 5:1-5:10. ACM, (2017)Heterogenous Acceleration for Linear Algebra in Multi-coprocessor Environments., , , and . VECPAR, volume 8969 of Lecture Notes in Computer Science, page 31-42. Springer, (2014)Performance Analysis of Parallel FFT on Large Multi-GPU Systems., , , , and . IPDPS Workshops, page 372-381. IEEE, (2022)