Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs.

B. Zheng, C. Yu, J. Wang, Y. Ding, Y. Liu, Y. Wang, and G. Pekhimenko. MICRO, page 1364-1380. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Hao Yu

Yu Hao

Hao Yu

Peter Hao Yu

Chia-Hao Yu

Other publications of authors with the same name

AutoAccel: Automated Accelerator Generation and Optimization with Composable, Parallel and Pipeline Architecture.J. Cong, P. Wei, C. Yu, and P. Zhang. CoRR, (2018)TensorIR: An Abstraction for Automatic Tensorized Program Optimization.S. Feng, B. Hou, H. Jin, W. Lin, J. Shao, R. Lai, Z. Ye, L. Zheng, C. Yu, Y. Yu and 1 other author(s). CoRR, (2022)Decoupled Model Schedule for Deep Learning Training.H. Chen, C. Yu, S. Zheng, Z. Zhang, Z. Zhang, and Y. Wang. CoRR, (2023)From JVM to FPGA: Bridging Abstraction Hierarchy via Optimized Deep Pipelining.J. Cong, P. Wei, and C. Yu. HotCloud, USENIX Association, (2018)MOCHA: Multinode Cost Optimization in Heterogeneous Clouds with Accelerators.P. Zhou, J. Sheng, C. Yu, P. Wei, J. Wang, D. Wu, and J. Cong. FPGA, page 273-279. ACM, (2021)DietCode: Automatic Optimization for Dynamic Tensor Programs.B. Zheng, Z. Jiang, C. Yu, H. Shen, J. Fromm, Y. Liu, Y. Wang, L. Ceze, T. Chen, and G. Pekhimenko. MLSys, mlsys.org, (2022)TensorIR: An Abstraction for Automatic Tensorized Program Optimization.S. Feng, B. Hou, H. Jin, W. Lin, J. Shao, R. Lai, Z. Ye, L. Zheng, C. Yu, Y. Yu and 1 other author(s). ASPLOS (2), page 804-817. ACM, (2023)Overcoming Data Transfer Bottlenecks in DNN Accelerators via Layer-Conscious Memory Managment.X. Wei, Y. Liang, P. Zhang, C. Yu, and J. Cong. FPGA, page 120. ACM, (2019)AutoDSE: Enabling Software Programmers to Design Efficient FPGA Accelerators.A. Sohrabizadeh, C. Yu, M. Gao, and J. Cong. ACM Trans. Design Autom. Electr. Syst., 27 (4): 32:1-32:27 (2022)Efficiently Programming Large Language Models using SGLang.L. Zheng, L. Yin, Z. Xie, J. Huang, C. Sun, C. Yu, S. Cao, C. Kozyrakis, I. Stoica, J. Gonzalez and 2 other author(s). CoRR, (2023)

BibSonomy

Disambiguation of "Yu, Cody Hao"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs.

Please choose a person to relate this publication to

Hao Yu

Yu Hao

Hao Yu

Peter Hao Yu

Chia-Hao Yu

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Yu, Cody Hao"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs.

Please choose a person to relate this publication to

Hao Yu

Yu Hao

Hao Yu

Peter Hao Yu

Chia-Hao Yu

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs.