Author of the publication

MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime.

, , , , and . SC Workshops, page 1081-1092. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Survey of CPU-GPU Heterogeneous Computing Techniques., and . ACM Comput. Surv., 47 (4): 69:1-69:35 (2015)Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training., , , and . CoRR, (2018)Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training., , , and . IPDPS, page 188-199. IEEE, (2019)EqualWrites: Reducing Intra-Set Write Variations for Enhancing Lifetime of Non-Volatile Caches., and . IEEE Trans. Very Large Scale Integr. Syst., 24 (1): 103-114 (2016)Accelerating S3D: A GPGPU Case Study., , , , , and . Euro-Par Workshops, volume 6043 of Lecture Notes in Computer Science, page 122-131. Springer, (2009)A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications., , , , and . WOMPAT, volume 2104 of Lecture Notes in Computer Science, page 53-67. Springer, (2001)Contemporary High Performance Computing - From Petascale toward Exascale.. Chapman and Hall / CRC computational science series CRC Press, (2013)Kernel-level single system image for petascale computing., , , , , and . ACM SIGOPS Oper. Syst. Rev., 40 (2): 50-54 (2006)DRAGON: breaking GPU memory capacity limits with direct NVM access., , , , and . SC, page 32:1-32:13. IEEE / ACM, (2018)GA-GPU: extending a library-based global address spaceprogramming model for scalable heterogeneouscomputing systems., and . Conf. Computing Frontiers, page 53-64. ACM, (2012)