Author of the publication

Toward Performance Portable Programming for Heterogeneous Systems on a Chip: A Case Study with Qualcomm Snapdragon SoC.

, , , , , and . HPEC, page 1-7. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

GA-GPU: extending a library-based global address spaceprogramming model for scalable heterogeneouscomputing systems., and . Conf. Computing Frontiers, page 53-64. ACM, (2012)DRAGON: breaking GPU memory capacity limits with direct NVM access., , , , and . SC, page 32:1-32:13. IEEE / ACM, (2018)Accelerating S3D: A GPGPU Case Study., , , , , and . Euro-Par Workshops, volume 6043 of Lecture Notes in Computer Science, page 122-131. Springer, (2009)A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications., , , , and . WOMPAT, volume 2104 of Lecture Notes in Computer Science, page 53-67. Springer, (2001)Runtime Techniques to Enable a Highly-Scalable Global Address Space Model for Petascale Computing., , , , and . Int. J. Parallel Program., 40 (6): 633-655 (2012)Using FPGA Devices to Accelerate Biomolecular Simulations., , , , and . Computer, 40 (3): 66-73 (2007)Aspen-based performance and energy modeling frameworks., , , , and . J. Parallel Distributed Comput., (2018)Performance portability study for massively parallel computational fluid dynamics application on scalable heterogeneous architectures., , , and . J. Parallel Distributed Comput., (2019)Performance characteristics of biomolecular simulations on high-end systems with multi-core processors., , and . Parallel Comput., 34 (11): 640-651 (2008)Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training., , , and . CoRR, (2018)