Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

From CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programming.

P. Du, R. Weber, P. Luszczek, S. Tomov, G. Peterson, and J. Dongarra. Parallel Comput., 38 (8): 391-407 (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Sven Piotrowiak

Katja Piotrowski

Heinz Piotrowski

Zana Piotrowski

Jens Piotraschke

Other publications of authors with the same name

Exploiting Mixed Precision Floating Point Hardware in Scientific Computations.A. Buttari, J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov. High Performance Computing Workshop, volume 16 of Advances in Parallel Computing, page 19-36. IOS Press, (2006)Anatomy of a globally recursive embedded LINPACK benchmark.J. Dongarra, and P. Luszczek. HPEC, page 1-6. IEEE, (2012)Programming the LU Factorization for a Multicore System with Accelerators.J. Kurzak, P. Luszczek, M. Faverge, and J. Dongarra. VECPAR, volume 7851 of Lecture Notes in Computer Science, page 28-35. Springer, (2012)ScaLAPACK.J. Dongarra, and P. Luszczek. Encyclopedia of Parallel Computing, Springer, (2011)Parallel reduction to hessenberg form with algorithm-based fault tolerance.Y. Jia, G. Bosilca, P. Luszczek, and J. Dongarra. SC, page 88:1-88:11. ACM, (2013)Towards Portable Runtime Support for Irregular and Out-of-Core Computations.M. Bubak, and P. Luszczek. PVM/MPI, volume 1697 of Lecture Notes in Computer Science, page 59-66. Springer, (1999)Replacing Pivoting in Distributed Gaussian Elimination with Randomized Techniques.N. Lindquist, P. Luszczek, and J. Dongarra. ScalA@SC, page 35-43. IEEE, (2020)Weighted dynamic scheduling with many parallelism grains for offloading of numerical workloads to multiple varied accelerators.A. Haidar, Y. Jia, P. Luszczek, S. Tomov, A. YarKhan, and J. Dongarra. ScalA@SC, page 5:1-5:8. ACM, (2015)Towards batched linear solvers on accelerated hardware platforms.A. Haidar, T. Dong, P. Luszczek, S. Tomov, and J. Dongarra. PPoPP, page 261-262. ACM, (2015)LU Factorization of Small Matrices: Accelerating Batched DGETRF on the GPU.T. Dong, A. Haidar, P. Luszczek, J. Harris, S. Tomov, and J. Dongarra. HPCC/CSS/ICESS, page 157-160. IEEE, (2014)

BibSonomy

Disambiguation of "Luszczek, Piotr"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

From CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programming.

Please choose a person to relate this publication to

Sven Piotrowiak

Katja Piotrowski

Heinz Piotrowski

Zana Piotrowski

Jens Piotraschke

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Luszczek, Piotr"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML From CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programming.

Please choose a person to relate this publication to

Sven Piotrowiak

Katja Piotrowski

Heinz Piotrowski

Zana Piotrowski

Jens Piotraschke

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

From CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programming.