Author of the publication

Optimized Non-contiguous MPI Datatype Communication for GPU Clusters: Design, Implementation and Evaluation with MVAPICH2.

, , , , , , and . CLUSTER, page 308-316. IEEE Computer Society, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Trip-Based Multicasting Model in Wormhole-Routed Networks with Virtual Channels., , and . IEEE Trans. Parallel Distributed Syst., 7 (2): 138-150 (1996)High Performance RDMA-Based MPI Implementation over InfiniBand., , and . Int. J. Parallel Program., 32 (3): 167-198 (2004)Scalable Distributed DNN Training using TensorFlow and CUDA-Aware MPI: Characterization, Designs, and Performance Evaluation., , , , and . CoRR, (2018)Demotion-based exclusive caching through demote buffering: design and evaluations over different networks., , and . SNAPI@PACT, page 73-80. ACM, (2003)Communication-Aware Hardware-Assisted MPI Overlap Engine., , , , , , , and . ISC, volume 12151 of Lecture Notes in Computer Science, page 517-535. Springer, (2020)HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training with TensorFlow., , , , and . ISC, volume 12151 of Lecture Notes in Computer Science, page 83-103. Springer, (2020)"Hey CAI" - Conversational AI Enabled User Interface for HPC Tools., , , , , , and . ISC, volume 13289 of Lecture Notes in Computer Science, page 87-108. Springer, (2022)BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs., , , , and . ISC, volume 12728 of Lecture Notes in Computer Science, page 18-37. Springer, (2021)Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters., , , , , and . ISC, volume 13289 of Lecture Notes in Computer Science, page 109-130. Springer, (2022)Simulation of Modern Parallel Systems: A CSIM-based Approach., , , , , , and . WSC, page 1013-1020. ACM, (1997)