Author of the publication

Accelerating lattice QCD multigrid on GPUs using fine-grained parallelization.

, , , , , and . SC, page 795-806. IEEE Computer Society, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors., , , , , , and . SC, page 69-80. IEEE Computer Society, (2014)Continuing Progress on a Lattice QCD Software Infrastructure. CoRR, (2008)Performance Portability of a Wilson Dslash Stencil Operator Mini-App Using Kokkos and SYCL., , , , , , , and . P3HPC@SC, page 14-25. IEEE, (2019)High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach., , , , , , and . SC, page 69:1-69:11. ACM, (2011)Optimizing Wilson-Dirac Operator and Linear Solvers for Intel® KNL., , , , and . ISC Workshops, volume 9945 of Lecture Notes in Computer Science, page 415-427. (2016)Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing., , , , , , , , , and 2 other author(s). CoRR, (2018)Application Experiences on a GPU-Accelerated Arm-based HPC Testbed., , , , , , , , , and 24 other author(s). HPC Asia Workshops, page 35-49. ACM, (2023)A Framework for Lattice QCD Calculations on GPUs., , , and . IPDPS, page 1073-1082. IEEE Computer Society, (2014)Optimizing a Multiple Right-Hand Side Dslash Kernel for Intel Knights Corner., , , , and . ISC Workshops, volume 9945 of Lecture Notes in Computer Science, page 390-401. (2016)Improving concurrency and asynchrony in multithreaded MPI applications using software offloading., , , , , , , and . SC, page 30:1-30:12. ACM, (2015)