Author of the publication

Improving performance of SYCL applications on CPU architectures using LLVM-directed compilation flow.

, , , and . Concurr. Comput. Pract. Exp., (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Accelerating Neural Networks Using Open Standard Software on RISC-V., and . ISC Workshops, volume 13999 of Lecture Notes in Computer Science, page 552-564. Springer, (2023)Towards Cross-Platform Performance Portability of DNN Models using SYCL., , , , , , , and . P3HPC@SC, page 25-35. IEEE, (2020)A practical tile size selection model for affine loop nests., , , and . ICS, page 27-39. ACM, (2021)Optimizing geometric multigrid method computation using a DSL approach., , , and . SC, page 15. ACM, (2017)A Performance Analysis of Leading Many-Core Technologies for Cellular Automata Execution., , , , , , and . Euro-Par Workshops (1), volume 14351 of Lecture Notes in Computer Science, page 270-281. Springer, (2023)Improving performance of SYCL applications on CPU architectures using LLVM-directed compilation flow., , , and . Concurr. Comput. Pract. Exp., (2023)Improving performance of SYCL applications on CPU architectures using LLVM-directed compilation flow., , , and . PMAM@PPoPP, page 1-10. ACM, (2022)User-driven Online Kernel Fusion for SYCL., , , , and . ACM Trans. Archit. Code Optim., 20 (2): 21:1-21:25 (June 2023)Towards performance portability of AI graphs using SYCL., , , , , and . P3HPC@SC, page 111-122. IEEE, (2022)Technical Talk: A SYCL Extension for User-Driven Online Kernel Fusion., , , , and . IWOCL, page 19:1-19:2. ACM, (2023)