Author of the publication

Multi-level Optimization of Matrix Multiplication for GPU-equipped Systems.

, , , , and . ICCS, volume 4 of Procedia Computer Science, page 342-351. Elsevier, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Improving Strong-Scaling on GPU Cluster Based on Tightly Coupled Accelerators Architecture., , , , , , and . CLUSTER, page 88-91. IEEE Computer Society, (2015)Blocked All-Pairs Shortest Paths Algorithm for Hybrid CPU-GPU System., , and . HPCC, page 145-152. IEEE, (2011)Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU., , and . MCSoC, page 198-204. IEEE Computer Society, (2012)Implementation and Evaluation of NAS Parallel CG Benchmark on GPU Cluster with Proprietary Interconnect TCA., , , and . VECPAR, volume 10150 of Lecture Notes in Computer Science, page 135-145. Springer, (2016)Implementation of CG Method on GPU Cluster with Proprietary Interconnect TCA for GPU Direct Communication., , , , and . IPDPS Workshops, page 647-655. IEEE Computer Society, (2015)Blocked United Algorithm for the All-Pairs Shortest Paths Problem on Hybrid CPU-GPU Systems., , and . IEICE Trans. Inf. Syst., 95-D (12): 2759-2768 (2012)Performance Tuning of Matrix Multiplication in OpenCL on Different GPUs and CPUs., , and . SC Companion, page 396-405. IEEE Computer Society, (2012)Effectiveness of performance tuning techniques for general matrix multiplication on the PEZY-SC2., , and . HEART, page 8:1-8:6. ACM, (2019)Application of a communication-avoiding generalized minimal residual method to a gyrokinetic five dimensional eulerian code on many core platforms., , , , , , and . ScalA@SC, page 7:1-7:8. ACM, (2017)Brain-inspired Co-design of Algorithm/Architecture for CNN Accelerators., , and . IIAI-AAI, page 556-560. IEEE, (2019)