Author of the publication

PLink: Discovering and Exploiting Locality for Accelerated Distributed Training on the public Cloud.

, , , , and . MLSys, mlsys.org, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Approximate Storage in Solid-State Memories., , , and . ACM Trans. Comput. Syst., 32 (3): 9:1-9:23 (2014)WORDA: A Winograd Offline-Runtime Decomposition Algorithm for Faster CNN Inference., , and . MWSCAS, page 46-49. IEEE, (2021)Cloud Collectives: Towards Cloud-aware Collectives forML Workloads with Rank Reordering., , , and . CoRR, (2021)Securing RDMA for High-Performance Datacenter Storage Systems., , , and . HotCloud, USENIX Association, (2020)Understanding RDMA Behavior in NUMA Systems., and . CGO, page 273-274. IEEE, (2019)Performance Evaluation of the Impact of NUMA on One-sided RDMA Interactions., and . SRDS, page 288-298. IEEE, (2020)TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches., , , , , , , and . NSDI, page 593-612. USENIX Association, (2023)Scaling Distributed Machine Learning with In-Network Aggregation., , , , , , , , , and . NSDI, page 785-808. USENIX Association, (2021)Latency-Tolerant Software Distributed Shared Memory., , , , , , and . USENIX Annual Technical Conference, page 291-305. USENIX Association, (2015)PLink: Discovering and Exploiting Locality for Accelerated Distributed Training on the public Cloud., , , , and . MLSys, mlsys.org, (2020)