Author of the publication

A compiler for throughput optimization of graph algorithms on GPUs.

, and . OOPSLA, page 1-19. ACM, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Parallel and Vector Programming Languages.. Wiley Encyclopedia of Computer Science and Engineering, John Wiley & Sons, Inc., (2008)An experimental evaluation of tiling and shackling for memory hierarchy management., , , and . International Conference on Supercomputing, page 482-491. ACM, (1999)Think globally, search locally., , and . ICS, page 141-150. ACM, (2005)Adaptive heterogeneous scheduling for integrated GPUs., , , , , and . PACT, page 151-162. ACM, (2014)Compiling for Locality., and . ICPP (2), page 142-146. Pennsylvania State University Press, (1990)0-271-00728-1.Parallel program = operator + schedule + parallel data structure.. SAMOS, page iii. IEEE, (2015)C3: A System for Automating Application-Level Checkpointing of MPI Programs., , , and . LCPC, volume 2958 of Lecture Notes in Computer Science, page 357-373. Springer, (2003)Left-Looking to Right-Looking and Vice Versa: An Application of Fractal Symbolic Analysis to Linear Algebra Code Restructuring., , and . Euro-Par, volume 1900 of Lecture Notes in Computer Science, page 379-388. Springer, (2000)Cyclone: A Static Timing and Power Engine for Asynchronous Circuits., , , and . ASYNC, page 11-19. IEEE, (2020)A Study of Graph Analytics for Massive Datasets on Distributed Multi-GPUs., , , , , and . IPDPS, page 84-94. IEEE, (2020)