Author of the publication

PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections.

, , , , , , , , , and . OSDI, page 37-54. USENIX Association, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Vapro: performance variance detection and diagnosis for production-run parallel applications., , , , , , , and . PPoPP, page 150-162. ACM, (2022)Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From Tsinghua University., , , , , , , and . IEEE Trans. Parallel Distributed Syst., 32 (11): 2631-2634 (2021)PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections., , , , , , , , , and . OSDI, page 37-54. USENIX Association, (2021)WiseGraph: Optimizing GNN with Joint Workload Partition of Graph and Operations., , , , , , , , , and . EuroSys, page 1-17. ACM, (2024)FreeTensor: a free-form DSL with holistic optimizations for irregular tensor programs., , , , , , and . PLDI, page 872-887. ACM, (2022)EINNET: Optimizing Tensor Programs with Derivation-Based Transformations., , , , , , , , , and 1 other author(s). OSDI, page 739-755. USENIX Association, (2023)Optimal Kernel Orchestration for Tensor Programs with Korch., , , , , , , , , and 1 other author(s). ASPLOS (3), page 755-769. ACM, (2024)OLLIE: Derivation-based Tensor Program Optimizer., , , , , , , , , and . CoRR, (2022)BaGuaLu: targeting brain scale pretrained models with over 37 million cores., , , , , , , , , and 15 other author(s). PPoPP, page 192-204. ACM, (2022)Optimal Kernel Orchestration for Tensor Programs with Korch., , , , , , , , , and . CoRR, (2024)