Author of the publication

Portable HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi.

, , , , , , and . PPAM (1), volume 8384 of Lecture Notes in Computer Science, page 571-581. Springer, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A hybrid Hermitian general eigenvalue solver, , , , , and . CoRR, (2012)Abstract: A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks., , , , and . SC Companion, page 1338-1339. IEEE Computer Society, (2012)Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers., , , and . SC, page 47:1-47:11. IEEE / ACM, (2018)Fast Cholesky factorization on GPUs for batch and native modes in MAGMA., , , and . J. Comput. Sci., (2017)Batched matrix computations on hardware accelerators based on GPUs., , , , and . Int. J. High Perform. Comput. Appl., 29 (2): 193-208 (2015)Model-Driven One-Sided Factorizations on Multicore Accelerated Systems., , , , , and . Supercomput. Front. Innov., 1 (1): 85-115 (2014)Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems., , , , , , , , , and . Supercomput. Front. Innov., 2 (4): 67-86 (2015)Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels., , and . SC, page 8:1-8:11. ACM, (2011)Leading Edge Hybrid Multi-GPU Algorithms for Generalized Eigenproblems in Electronic Structure Calculations., , , , , and . ISC, volume 7905 of Lecture Notes in Computer Science, page 67-80. Springer, (2013)Heterogeneous Streaming., , , , , , , , , and 8 other author(s). IPDPS Workshops, page 611-620. IEEE Computer Society, (2016)