Author of the publication

Delite: A Compiler Architecture for Performance-Oriented Embedded Domain-Specific Languages.

, , , , , , and . ACM Trans. Embed. Comput. Syst., 13 (4s): 134:1-134:25 (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Automatic support for multi-module parallelism from computational patterns., , , , , , and . FPL, page 1-8. IEEE, (2015)Exploring the limits of Concurrency in ML Training on Google TPUs., , , , , , , , , and 9 other author(s). CoRR, (2020)GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding., , , , , , , , and . ICLR, OpenReview.net, (2021)Generating Configurable Hardware from Parallel Patterns., , , , , , and . ASPLOS, page 651-665. ACM, (2016)A Generic Design for Encoding and Decoding Variable Length Codes in Multi-codec Video Processing Engines., , , and . ISVLSI, page 197-202. IEEE Computer Society, (2008)GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism., , , , , , and . CoRR, (2018)Mesh-tensorflow: Deep learning for supercomputers, , , , , , , , , and 1 other author(s). Advances in Neural Information Processing Systems, page 10435--10444. (2018)GSPMD: General and Scalable Parallelization for ML Computation Graphs., , , , , , , , , and 6 other author(s). CoRR, (2021)Project Lancet: Surgical Precision JIT Compilers, , , , , , and . (2013)Surgical precision JIT compilers., , , , , and . PLDI, page 41-52. ACM, (2014)