Author of the publication

A Comprehensive Study of Task Coalescing for Selecting Parallelism Granularity in a Two-Stage Bidiagonal Reduction.

, , , and . IPDPS, page 25-35. IEEE Computer Society, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

With Extreme Computing, the Rules Have Changed., , , , , , , , and . Comput. Sci. Eng., 19 (3): 52-62 (2017)Achieving numerical accuracy and high performance using recursive tile LU factorization with partial pivoting., , , and . Concurr. Comput. Pract. Exp., 26 (7): 1408-1431 (2014)Recursive approach in sparse matrix LU factorization., , and . Sci. Program., 9 (1): 51-60 (2001)Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators., , and . HPEC, page 1-6. IEEE, (2019)PLASMA: Parallel Linear Algebra Software for Multicore Using OpenMP., , , , , , , , , and 5 other author(s). ACM Trans. Math. Softw., 45 (2): 16:1-16:35 (2019)Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy., , , , and . ACM Trans. Math. Softw., 34 (4): 17:1-17:22 (2008)Parallel reduction to hessenberg form with algorithm-based fault tolerance., , , and . SC, page 88:1-88:11. ACM, (2013)Anatomy of a globally recursive embedded LINPACK benchmark., and . HPEC, page 1-6. IEEE, (2012)Exploiting Mixed Precision Floating Point Hardware in Scientific Computations., , , , , , and . High Performance Computing Workshop, volume 16 of Advances in Parallel Computing, page 19-36. IOS Press, (2006)Programming the LU Factorization for a Multicore System with Accelerators., , , and . VECPAR, volume 7851 of Lecture Notes in Computer Science, page 28-35. Springer, (2012)