From post

Performance Analysis of GPU Programming Models Using the Roofline Scaling Trajectories.

, , и . Bench, том 12093 из Lecture Notes in Computer Science, стр. 3-19. Springer, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Optimized pre-copy live migration for memory intensive applications., , , и . SC, стр. 40:1-40:11. ACM, (2011)Modern gyrokinetic particle-in-cell simulation of fusion plasmas on top supercomputers., , , , , , и . Int. J. High Perform. Comput. Appl., (2019)Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches., , , , , , , и . PMBS@SC, стр. 126-137. IEEE, (2020)CSPACER: A Reduced API Set Runtime for the Space Consistency Model.. HPC Asia, стр. 58-68. ACM, (2021)Fine-grained parallelization of lattice QCD kernel routine on GPUs., , и . J. Parallel Distributed Comput., 68 (10): 1350-1359 (2008)Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality., , , , , и . CoRR, (2024)Efficient SIMDization and data management of the Lattice QCD computation on the Cell Broadband Engine., и . Sci. Program., 17 (1-2): 153-172 (2009)Exploiting communication concurrency on high performance computing systems., , , и . PMAM@PPoPP, стр. 132-143. ACM, (2015)Slipstream Execution Mode for CMP-Based Multiprocessors., , и . HPCA, стр. 179-190. IEEE Computer Society, (2003)Correlation between Detailed and Simplified Simulations in Studying Multiprocessor Architecture.. ICCD, стр. 387-392. IEEE Computer Society, (2005)