Author of the publication

Roofline Scaling Trajectories: A Method for Parallel Application and Architectural Performance Analysis.

, , and . HPCS, page 350-358. IEEE, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Fine-grained parallelization of lattice QCD kernel routine on GPUs., , and . J. Parallel Distributed Comput., 68 (10): 1350-1359 (2008)Modern gyrokinetic particle-in-cell simulation of fusion plasmas on top supercomputers., , , , , , and . Int. J. High Perform. Comput. Appl., (2019)Optimized pre-copy live migration for memory intensive applications., , , and . SC, page 40:1-40:11. ACM, (2011)Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality., , , , , and . CoRR, (2024)Efficient SIMDization and data management of the Lattice QCD computation on the Cell Broadband Engine., and . Sci. Program., 17 (1-2): 153-172 (2009)Exploiting communication concurrency on high performance computing systems., , , and . PMAM@PPoPP, page 132-143. ACM, (2015)CSPACER: A Reduced API Set Runtime for the Space Consistency Model.. HPC Asia, page 58-68. ACM, (2021)Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches., , , , , , , and . PMBS@SC, page 126-137. IEEE, (2020)Architectural Requirements for Deep Learning Workloads in HPC Environments., , , , , , , , and . PMBS, page 7-17. IEEE, (2021)Slipstream Execution Mode for CMP-Based Multiprocessors., , and . HPCA, page 179-190. IEEE Computer Society, (2003)