Author of the publication

Optimized two-level parallelization for GPU accelerators using the polyhedral model.

, , and . CC, page 22-33. ACM, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Chapel-on-X: Exploring Tasking Runtimes for PGAS Languages., , , , and . ESPM2@SC, page 5:1-5:8. ACM, (2017)Phasers: a unified deadlock-free construct for collective and point-to-point synchronization., , , and . ICS, page 277-288. ACM, (2008)A Unified Approach to Variable Renaming for Enhanced Vectorization., , , and . LCPC, volume 11882 of Lecture Notes in Computer Science, page 1-20. Springer, (2018)The design and implementation of the habanero-java parallel programming language., , , , , , and . OOPSLA Companion, page 185-186. ACM, (2011)Exploring Compiler Optimization Opportunities for the OpenMP 4.× Accelerator Model on a POWER8+GPU Platform., , , , and . WACCPD@SC, page 68-78. IEEE Computer Society, (2016)Formalization of Phase Ordering., , and . PLACES, volume 211 of EPTCS, page 13-24. (2016)A Parallelizing Compiler Cooperative Heterogeneous Multicore Processor Architecture., , , , , , , and . Trans. High Perform. Embed. Archit. Compil., (2011)Oil and Water Can Mix: An Integration of Polyhedral and AST-Based Transformations., , and . SC, page 287-298. IEEE Computer Society, (2014)Chunking parallel loops in the presence of synchronization., , , and . ICS, page 181-192. ACM, (2009)Hardware and Software Tradeoffs for Task Synchronization on Manycore Architectures., , , , , , , , and . Euro-Par (2), volume 6853 of Lecture Notes in Computer Science, page 112-123. Springer, (2011)