Author of the publication

Case Study of Using Kokkos and SYCL as Performance-Portable Frameworks for Milc-Dslash Benchmark on NVIDIA, AMD and Intel GPUs.

, , , , , , and . P3HPC@SC, page 57-67. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Rapid Exploration of Optimization Strategies on Advanced Architectures using TestSNAP and LAMMPS., , , , , , , and . CoRR, (2020)The Kokkos OpenMPTarget Backend: Implementation and Lessons Learned., , , , , and . IWOMP, volume 14114 of Lecture Notes in Computer Science, page 99-113. Springer, (2023)A Novel Multi-level Integrated Roofline Model Approach for Performance Characterization., , , , , , , , , and 3 other author(s). ISC, volume 10876 of Lecture Notes in Computer Science, page 226-245. Springer, (2018)Evaluating Performance Portability of OpenMP for SNAP on NVIDIA, Intel, and AMD GPUs Using the Roofline Methodology., , , , and . WACCPD@SC, volume 12655 of Lecture Notes in Computer Science, page 3-24. Springer, (2020)Non-recurring engineering (NRE) best practices: a case study with the NERSC/NVIDIA OpenMP contract., , , , , , and . SC, page 31. ACM, (2021)Transactional Access to Shared Memory in StarSs, a Task Based Programming Model., , , , and . Euro-Par, volume 7484 of Lecture Notes in Computer Science, page 514-525. Springer, (2012)Timemory: Modular Performance Analysis for HPC., , , , , , , , and . ISC, volume 12151 of Lecture Notes in Computer Science, page 434-452. Springer, (2020)A Case Study for Performance Portability Using OpenMP 4.5., , , and . WACCPD@SC, volume 11381 of Lecture Notes in Computer Science, page 75-95. Springer, (2018)Comparing Managed Memory and ATS with and without Prefetching on NVIDIA Volta GPUs., , and . PMBS@SC, page 41-46. IEEE, (2019)A Methodology for Evaluating Tightly-integrated and Disaggregated Accelerated Architectures., , , , , , , and . PMBS@SC, page 71-81. IEEE, (2022)