Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Proteus: Simulating the Performance of Distributed DNN Training.

J. Duan, X. Li, P. Xu, X. Zhang, S. Yan, Y. Liang, and D. Lin. CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Yan Yan

Shanshan Yan

Americanization in China: an analysis of General Motors and its strategies in ChinaS. Yan. Uni Mainz, (2009)

Yan Xu

Untersuchungen zur Verleimung von Holz und Holzspanplatten mit UF-Leimharzen und PMDIY. Xu. TU Braunschweig, (2009)

Yan Zhang

Yan Xu

Other publications of authors with the same name

面向GPU计算平台的归约算法的性能优化研究 (Study on Performance Optimization of Reduction Algorithm Targeting GPU Computing Platform).Y. Zhang, L. Chen, X. An, and S. Yan. 计算机科学, 46 (2): 306-314 (2019)GPURoofline: A Model for Guiding Performance Optimizations on GPUs.H. Jia, Y. Zhang, G. Long, J. Xu, S. Yan, and Y. Li. Euro-Par, volume 7484 of Lecture Notes in Computer Science, page 920-932. Springer, (2012)An Insightful Program Performance Tuning Chain for GPU Computing.H. Jia, Y. Zhang, G. Long, and S. Yan. ICA3PP (1), volume 7439 of Lecture Notes in Computer Science, page 502-516. Springer, (2012)Proteus: Simulating the Performance of Distributed DNN Training.J. Duan, X. Li, P. Xu, X. Zhang, S. Yan, Y. Liang, and D. Lin. CoRR, (2023)EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers.L. Jiang, P. Xu, Q. Zhu, X. Li, S. Yan, X. Zhang, D. Lin, W. Ma, Z. Li, J. Liu and 3 other author(s). ICPP, page 54:1-54:11. ACM, (2022)DIESEL+: Accelerating Distributed Deep Learning Tasks on Image Datasets.L. Wang, Q. Luo, and S. Yan. IEEE Trans. Parallel Distributed Syst., 33 (5): 1173-1184 (2022)LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K.T. Yuan, X. Ning, D. Zhou, Z. Yang, S. Li, M. Zhuang, Z. Tan, Z. Yao, D. Lin, B. Li and 3 other author(s). CoRR, (2024)GradientFlow: Optimizing Network Performance for Large-Scale Distributed DNN Training.P. Sun, Y. Wen, R. Han, W. Feng, and S. Yan. IEEE Trans. Big Data, 8 (2): 495-507 (2022)Characterization and prediction of deep learning workloads in large-scale GPU datacenters.Q. Hu, P. Sun, S. Yan, Y. Wen, and T. Zhang. SC, page 104. ACM, (2021)AMOS: enabling automatic mapping for tensor computations on spatial accelerators with hardware abstraction.S. Zheng, R. Chen, A. Wei, Y. Jin, Q. Han, L. Lu, B. Wu, X. Li, S. Yan, and Y. Liang. ISCA, page 874-887. ACM, (2022)

BibSonomy

Disambiguation of "Yan, Shengen"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Proteus: Simulating the Performance of Distributed DNN Training.

Please choose a person to relate this publication to

Yan Yan

Shanshan Yan

Yan Xu

Yan Zhang

Yan Xu

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Yan, Shengen"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Proteus: Simulating the Performance of Distributed DNN Training.

Please choose a person to relate this publication to

Yan Yan

Shanshan Yan

Yan Xu

Yan Zhang

Yan Xu

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Proteus: Simulating the Performance of Distributed DNN Training.