Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference., , , , , , , , , and 1 other author(s). CoRR, (2024)TensorIR: An Abstraction for Automatic Tensorized Program Optimization., , , , , , , , , and 1 other author(s). CoRR, (2022)TensorIR: An Abstraction for Automatic Tensorized Program Optimization., , , , , , , , , and 1 other author(s). ASPLOS (2), page 804-817. ACM, (2023)TVM: An Automated End-to-End Optimizing Compiler for Deep Learning., , , , , , , , , and 2 other author(s). OSDI, page 578-594. USENIX Association, (2018)A Hardware-Software Blueprint for Flexible Deep Learning Specialization., , , , , , , , , and 1 other author(s). IEEE Micro, 39 (5): 8-16 (2019)Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning., , , , , , , , , and 1 other author(s). CoRR, (2022)NumS: Scalable Array Programming for the Cloud., , , , , , and . CoRR, (2022)A Unified Optimization Approach for CNN Model Inference on Integrated GPUs., , , , , , and . ICPP, page 99:1-99:10. ACM, (2019)ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training., , , , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 1803-1813. PMLR, (2021)Efficient Memory Management for Large Language Model Serving with PagedAttention., , , , , , , , and . SOSP, page 611-626. ACM, (2023)