Author of the publication

Case Studies in Automatic GPGPU Code Generation with llc.

, and . Euro-Par Workshops, volume 6586 of Lecture Notes in Computer Science, page 13-22. Springer, (2010)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Towards Heterogeneous and Distributed Computing in C++., , and . IWOCL, page 18:1-18:5. ACM, (2019)Towards Cross-Platform Performance Portability of DNN Models using SYCL., , , , , , , and . P3HPC@SC, page 25-35. IEEE, (2020)Automatic Hybrid MPI+OpenMP Code Generation with llc., , , and . PVM/MPI, volume 5759 of Lecture Notes in Computer Science, page 185-195. Springer, (2009)Automatic code generation for GPUs in llc., and . J. Supercomput., 58 (3): 349-356 (2011)Exploring large macromolecular functional motions on clusters of multicore processors., , , , , and . J. Comput. Phys., (2013)What's New in SYCL 1.2.1 and How to Explore the Features., and . IWOCL, page 11:1. ACM, (2018)Optimize or Wait? Using llc Fast-Prototyping Tool to Evaluate CUDA Optimizations., and . PDP, page 257-261. IEEE Computer Society, (2011)accULL: An OpenACC Implementation with CUDA and OpenCL Support., , , and . Euro-Par, volume 7484 of Lecture Notes in Computer Science, page 871-882. Springer, (2012)Optimization strategies in different CUDA architectures using llCoMP., and . Microprocess. Microsystems, 36 (2): 78-87 (2012)Leveraging task-parallelism in message-passing dense matrix factorizations using SMPSs., , , and . Parallel Comput., 40 (5-6): 113-128 (2014)