Author of the publication

OC-DNN: Exploiting Advanced Unified Memory Capabilities in CUDA 9 and Volta GPUs for Out-of-Core DNN Training.

, , , , and . HiPC, page 143-152. IEEE, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Networking and communication challenges for post-exascale systems., , and . Frontiers Inf. Technol. Electron. Eng., 19 (10): 1230-1235 (2018)Performance Characterization of using Quantization for DNN Inference on Edge Devices: Extended Version., , , , , , and . CoRR, (2023)Cross-layer Visualization and Profiling of Network and I/O Communication for HPC Clusters., , , and . CoRR, (2021)Highly Efficient Alltoall and Alltoallv Communication Algorithms for GPU Systems., , , , , and . IPDPS Workshops, page 24-33. IEEE, (2022)Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters*., , , , , , and . IPDPS, page 444-453. IEEE, (2021)AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters., , , , and . HIPC, page 32-41. IEEE, (2022)Towards Architecture-aware Hierarchical Communication Trees on Modern HPC Systems., , , , , , , and . HiPC, page 272-281. IEEE, (2021)Lightning Talks of EduHPC 2022., , , , , , , , , and 16 other author(s). EduHPC@SC, page 42-49. IEEE, (2022)Efficient MPI-based Communication for GPU-Accelerated Dask Applications., , , and . CCGRID, page 277-286. IEEE, (2021)MPI4Spark Meets YARN: Enhancing MPI4Spark through YARN support for HPC., , , and . IEEE Big Data, page 2265-2274. IEEE, (2023)