Author of the publication

Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor.

, , , , and . ICPP, page 432-441. IEEE Computer Society, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Mixed-Precision AMG method for Many Core Accelerators., , , and . EuroMPI/ASIA, page 127. ACM, (2014)Poster reception - Scalable software infrastructure project., , , and . SC, page 140. ACM Press, (2006)Fast Conjugate Gradients with Multiple GPUs., , and . ICCS (1), volume 5544 of Lecture Notes in Computer Science, page 893-903. Springer, (2009)Accelerating data transfer between host and device using idle GPU., and . GPGPU@PPoPP, page 2:1-2:6. ACM, (2022)High Performance 3D Convolution for Protein Docking on IBM Blue Gene., , , and . ISPA, volume 4742 of Lecture Notes in Computer Science, page 958-969. Springer, (2007)Evaluating the SW26010 many-core processor with a micro-benchmark suite for performance optimizations., , , , and . Parallel Comput., (2018)High performance conjugate gradient solver on multi-GPU clusters using hypergraph partitioning., , and . Comput. Sci. Res. Dev., 25 (1-2): 83-91 (2010)Efficient Execution of Multiple CUDA Applications Using Transparent Suspend, Resume and Migration., , and . Euro-Par, volume 9233 of Lecture Notes in Computer Science, page 687-699. Springer, (2015)Statistical power modeling of GPU kernels using performance counters., , , , and . Green Computing Conference, page 115-122. IEEE Computer Society, (2010)Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor., , , , and . ICPP, page 432-441. IEEE Computer Society, (2017)