Author of the publication

Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters.

, , , , , , and . ISC, volume 13289 of Lecture Notes in Computer Science, page 3-25. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Application-bypass reduction for large-scale clusters., , , and . Int. J. High Perform. Comput. Netw., 2 (2/3/4): 99-109 (2004)Demotion-based exclusive caching through demote buffering: design and evaluations over different networks., , and . SNAPI@PACT, page 73-80. ACM, (2003)Architectural Design of Orthogonal Multiprocessor for Multidimensional Information Processing., and . J. Inf. Sci. Eng., 7 (4): 459-485 (1991)Accelerating communication with multi-HCA aware collectives in MPI., , , , , , and . Concurr. Comput. Pract. Exp., (2024)A Parallel-Serial Binary Arbitration Scheme for Collision-Free Multi-Access Techniques., and . Comput. Networks, (1988)Performance characterization of hadoop workloads on SR-IOV-enabled virtualized InfiniBand clusters., , and . BDCAT, page 36-45. ACM, (2016)DistMILE: A Distributed Multi-Level Framework for Scalable Graph Embedding., , , , , and . HiPC, page 282-291. IEEE, (2021)Large-Message Nonblocking MPI_Iallgather and MPI Ibcast Offload via BlueField-2 DPU., , , , , and . HiPC, page 388-393. IEEE, (2021)Communication-Aware Hardware-Assisted MPI Overlap Engine., , , , , , , and . ISC, volume 12151 of Lecture Notes in Computer Science, page 517-535. Springer, (2020)HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training with TensorFlow., , , , and . ISC, volume 12151 of Lecture Notes in Computer Science, page 83-103. Springer, (2020)