Author of the publication

High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach.

, , , , , , and . SC, page 69:1-69:11. ACM, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Overview of the QCDSP and QCDOC computers., , , , , , , , , and 8 other author(s). IBM J. Res. Dev., 49 (2-3): 351-366 (2005)Early Application Experiences on a Modern GPU-Accelerated Arm-based HPC Platform., , , , , , , , , and 24 other author(s). CoRR, (2022)A per-cent-level determination of the nucleon axial coupling from quantum chromodynamics., , , , , , , , , and 5 other author(s). Nat., 558 (7708): 91-94 (2018)Solving DWF dirac equation using multi-splitting preconditioned conjugate gradient with tensor cores on NVIDIA GPUs., , , and . PASC, page 9:1-9:11. ACM, (2021)QCDOC: A 10 Teraflops Computer for Tightly-Coupled Calculations., , , , , , , , , and 8 other author(s). SC, page 40. IEEE Computer Society, (2004)An evaluation of the CORAL interconnects., , , , , , , , , and 2 other author(s). SC, page 39:1-39:18. ACM, (2019)Performance Portability of a Wilson Dslash Stencil Operator Mini-App Using Kokkos and SYCL., , , , , , , and . P3HPC@SC, page 14-25. IEEE, (2019)High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach., , , , , , and . SC, page 69:1-69:11. ACM, (2011)Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing., , , , , , , , , and 2 other author(s). CoRR, (2018)Pushing memory bandwidth limitations through efficient implementations of Block-Krylov space solvers on GPUs., , , , and . Comput. Phys. Commun., (2018)