Author of the publication

Optimizing the SVD Bidiagonalization Process for a Batch of Small Matrices.

, , , and . ICCS, volume 108 of Procedia Computer Science, page 1008-1018. Elsevier, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Model-Driven One-Sided Factorizations on Multicore Accelerated Systems., , , , , and . Supercomput. Front. Innov., 1 (1): 85-115 (2014)Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels., , and . SC, page 8:1-8:11. ACM, (2011)Abstract: A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks., , , , and . SC Companion, page 1338-1339. IEEE Computer Society, (2012)Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers., , , and . SC, page 47:1-47:11. IEEE / ACM, (2018)Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs., , , and . ICS, page 5:1-5:10. ACM, (2017)Leading Edge Hybrid Multi-GPU Algorithms for Generalized Eigenproblems in Electronic Structure Calculations., , , , , and . ISC, volume 7905 of Lecture Notes in Computer Science, page 67-80. Springer, (2013)Accelerating Numerical Dense Linear Algebra Calculations with GPUs., , , , , , and . Numerical Computations with GPUs, Springer, (2014)Heterogeneous Streaming., , , , , , , , , and 8 other author(s). IPDPS Workshops, page 611-620. IEEE Computer Society, (2016)Heterogenous Acceleration for Linear Algebra in Multi-coprocessor Environments., , , and . VECPAR, volume 8969 of Lecture Notes in Computer Science, page 31-42. Springer, (2014)Performance Analysis of Parallel FFT on Large Multi-GPU Systems., , , , and . IPDPS Workshops, page 372-381. IEEE, (2022)