Author of the publication

Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing.

, , and . Parallel Comput., 36 (12): 645-654 (2010)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems., , , and . SPAA, page 280-288. ACM, (2007)GPU-Aware Non-contiguous Data Movement In Open MPI., , , , and . HPDC, page 231-242. ACM, (2016)Optimized Batched Linear Algebra for Modern Architectures., , , , and . Euro-Par, volume 10417 of Lecture Notes in Computer Science, page 511-522. Springer, (2017)Selected Results from the ParkBench Benchmark., , and . Euro-Par, Vol. II, volume 1124 of Lecture Notes in Computer Science, page 251-254. Springer, (1996)A Scalable Non-blocking Multicast Scheme for Distributed DAG Scheduling., , and . ICCS (1), volume 5544 of Lecture Notes in Computer Science, page 195-204. Springer, (2009)CPU-GPU hybrid bidiagonal reduction with soft error resilience., , , and . ScalA@SC, page 2:1-2:5. ACM, (2013)Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers., , , and . SC, page 47:1-47:11. IEEE / ACM, (2018)ADAPT: an event-based adaptive collective communication framework., , , , , and . HPDC, page 118-130. ACM, (2018)Scheduling for Numerical Linear Algebra Library at Scale., , , and . High Performance Computing Workshop, volume 18 of Advances in Parallel Computing, page 3-26. IOS Press, (2008)Trends in High Performance Computing.. Comput. J., 47 (4): 399-403 (2004)