Author of the publication

Generalized GEMM Kernels on GPGPUs: Experiments and Applications.

, , and . PARCO, volume 19 of Advances in Parallel Computing, page 307-314. IOS Press, (2009)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Design Patterns for Scientific Computations on Sparse Matrices., , , and . Euro-Par Workshops (1), volume 7155 of Lecture Notes in Computer Science, page 367-376. Springer, (2011)Approximate Inverse Preconditioners for Krylov Methods on Heterogeneous Parallel Computers., and . PARCO, volume 25 of Advances in Parallel Computing, page 183-192. IOS Press, (2013)Generalized GEMM Kernels on GPGPUs: Experiments and Applications., , and . PARCO, volume 19 of Advances in Parallel Computing, page 307-314. IOS Press, (2009)Object-Oriented Techniques for Sparse Matrix Computations in Fortran 2003., and . ACM Trans. Math. Softw., 38 (4): 23:1-23:20 (2012)SIMPL: A Pattern Language for Writing Efficient Kernels on GPGPU., , and . SE4HPCS@ICSE, page 38-45. IEEE Computer Society, (2015)Extracting UML class diagrams from object-oriented Fortran: ForUML., , and . SE-HPCCSE@SC, page 9-16. ACM, (2013)Some Preliminary Experiences with Sparse BLAS in Parallel Iterative Solvers., and . PARA, volume 1041 of Lecture Notes in Computer Science, page 207-213. Springer, (1995)OpenCoarrays: Open-source Transport Layers Supporting Coarray Fortran Compilers., , , , , and . PGAS, page 4:1-4:11. ACM, (2014)Coarray-based load balancing on heterogeneous and many-core architectures., , and . Parallel Comput., (2017)Efficient Algebraic Multigrid Preconditioners on Clusters of GPUs., , , , and . Parallel Process. Lett., 29 (1): 1950001:1-1950001:15 (2019)