Author of the publication

Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning.

, , , , , , and . ICPP, page 161-170. IEEE Computer Society, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

JAMILA: A Usable Batch Job Management System to Coordinate Heterogeneous Clusters and Diverse Applications over Grid or Cloud Infrastructure., , , and . NPC, volume 6289 of Lecture Notes in Computer Science, page 412-422. Springer, (2010)A Study of Database Performance Sensitivity to Experiment Settings., , , , , , , , and . Proc. VLDB Endow., 15 (7): 1439-1452 (2022)Understanding the Idiosyncrasies of Real Persistent Memory., , and . Proc. VLDB Endow., 14 (4): 626-639 (2020)Performance analysis of deep learning workloads using roofline trajectories., , and . CCF Trans. High Perform. Comput., 1 (3-4): 224-239 (2019)DLoBD: A Comprehensive Study of Deep Learning over Big Data Stacks on HPC Clusters., , , , and . IEEE Trans. Multi Scale Comput. Syst., 4 (4): 635-648 (2018)Accelerating Iterative Big Data Computing Through MPI., and . J. Comput. Sci. Technol., 30 (2): 283-294 (2015)NVMe-CR: A Scalable Ephemeral Storage Runtime for Checkpoint/Restart with NVMe-over-Fabrics., , and . IPDPS, page 172-181. IEEE, (2021)High performance MPI library over SR-IOV enabled infiniband clusters., , , , , and . HiPC, page 1-10. IEEE Computer Society, (2014)Performance Characterization of Large Language Models on High-Speed Interconnects., , , , and . HOTI, page 53-60. IEEE, (2023)Characterizing Lossy and Lossless Compression on Emerging BlueField DPU Architectures., , , and . HOTI, page 33-40. IEEE, (2023)