Author of the publication

Towards neural architecture-aware exploration of compiler optimizations in a deep learning graph compiler.

, , , , and . CF, page 244-250. ACM, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Stream-AI-MD: streaming AI-driven adaptive molecular simulations for heterogeneous computing platforms., , , , , , , , , and 4 other author(s). PASC, page 6:1-6:13. ACM, (2021)FAIR for AI: An interdisciplinary, international, inclusive, and diverse community building perspective., , , , , , , , , and 14 other author(s). CoRR, (2022)A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators., , , , , , , , and . CoRR, (2023)A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads., , , , , , , , , and 14 other author(s). PMBS@SC, page 13-25. IEEE, (2022)Throughput-oriented and Accuracy-aware DNN Training with BFloat16 on GPU., , and . IPDPS Workshops, page 1084-1087. IEEE, (2022)Efficient Design Space Exploration for Sparse Mixed Precision Neural Architectures., , , and . HPDC, page 265-276. ACM, (2022)Toward an In-Depth Analysis of Multifidelity High Performance Computing Systems., , , , , , and . CCGRID, page 716-725. IEEE, (2022)Data Race Detection Using Large Language Models., , , , , and . SC Workshops, page 215-223. ACM, (2023)Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines., , , , , and . ECSA (Tracks and Workshops), volume 13928 of Lecture Notes in Computer Science, page 402-417. Springer, (2022)Is Data Placement Optimization Still Relevant on Newer GPUs?, , , , , and . PMBS@SC, page 83-96. IEEE, (2018)