Author of the publication

hiCUDA: High-Level GPGPU Programming.

, and . IEEE Trans. Parallel Distributed Syst., 22 (1): 78-90 (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Computation and Data Partitioning on Scalable Shared Memory Multiprocessors., and . PDPTA, page 41-50. CSREA Press, (1995)Locality Enhancement for Large-Scale Shared-Memory Multiprocessors., , , and . LCR, volume 1511 of Lecture Notes in Computer Science, page 335-342. Springer, (1998)A Compiler Infrastructure for High-Performance Java., and . HPCN Europe, volume 2110 of Lecture Notes in Computer Science, page 675-684. Springer, (2001)Pipelined Training with Stale Weights of Deep Convolutional Neural Networks., and . CoRR, (2019)Reducing divergence in GPGPU programs with loop merging., and . GPGPU@ASPLOS, page 12-23. ACM, (2013)A Characterization of Traces in Java Programs., and . PLC, page 87-93. CSREA Press, (2005)Optimization of Compiler-Generated OpenCL CNN Kernels and Runtime for FPGAs., and . IPDPS Workshops, page 100-103. IEEE, (2022)Locality management using multiple SPMs on the Multi-Level Computing Architecture., and . ESTIMedia, page 67-72. IEEE Computer Society, (2006)Architectural support for synchronization-free deterministic parallel programming., and . HPCA, page 337-348. IEEE Computer Society, (2012)Genesis: a language for generating synthetic training programs for machine learning., , and . Conf. Computing Frontiers, page 8:1-8:8. ACM, (2015)