Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Densifying Assumed-sparse Tensors: Improving Memory Efficiency and MPI Collective Performance during Tensor Accumulation for Parallelized Training of Neural Machine Translation Models., , , , , , , , , and 4 other author(s). CoRR, (2019)NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model., , , , and . LCPC, volume 8967 of Lecture Notes in Computer Science, page 67-81. Springer, (2014)Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model., , and . IPDPS Workshops, page 1169-1176. IEEE, (2013)Implementing the OpenACC Data Model., , , , , , and . IPDPS Workshops, page 662-672. IEEE Computer Society, (2017)Compiling a High-Level Directive-Based Programming Model for GPGPUs., , , , , and . LCPC, volume 8664 of Lecture Notes in Computer Science, page 105-120. Springer, (2013)SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance., , , , , , , , , and 14 other author(s). PMBS@SC, volume 8966 of Lecture Notes in Computer Science, page 46-67. Springer, (2014)Densifying Assumed-Sparse Tensors - Improving Memory Efficiency and MPI Collective Performance During Tensor Accumulation for Parallelized Training of Neural Machine Translation Models., , , , , , , , , and 4 other author(s). ISC, volume 11501 of Lecture Notes in Computer Science, page 23-39. Springer, (2019)An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling., , , and . ISC, volume 9697 of Lecture Notes in Computer Science, page 3-20. Springer, (2016)Deep Learning at Scale on NVIDIA V100 Accelerators., , and . PMBS@SC, page 23-32. IEEE, (2018)Filesystem Aware Scalable I/O Framework for Data-Intensive Parallel Applications., , and . IPDPS Workshops, page 2007-2014. IEEE, (2013)