Author of the publication

Scaling TensorFlow, PyTorch, and MXNet using MVAPICH2 for High-Performance Deep Learning on Frontera.

, , , and . DLS@SC, page 76-83. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Networking and communication challenges for post-exascale systems., , and . Frontiers Inf. Technol. Electron. Eng., 19 (10): 1230-1235 (2018)Performance Characterization of using Quantization for DNN Inference on Edge Devices: Extended Version., , , , , , and . CoRR, (2023)Cross-layer Visualization and Profiling of Network and I/O Communication for HPC Clusters., , , and . CoRR, (2021)Efficient MPI-based Communication for GPU-Accelerated Dask Applications., , , and . CCGRID, page 277-286. IEEE, (2021)AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters., , , , and . HIPC, page 32-41. IEEE, (2022)Towards Architecture-aware Hierarchical Communication Trees on Modern HPC Systems., , , , , , , and . HiPC, page 272-281. IEEE, (2021)Lightning Talks of EduHPC 2022., , , , , , , , , and 16 other author(s). EduHPC@SC, page 42-49. IEEE, (2022)MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators., , , , , , and . SC Workshops, page 847-854. ACM, (2023)MPI4Spark Meets YARN: Enhancing MPI4Spark through YARN support for HPC., , , and . IEEE Big Data, page 2265-2274. IEEE, (2023)Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters*., , , , , , and . IPDPS, page 444-453. IEEE, (2021)