Author of the publication

Implementation of CG Method on GPU Cluster with Proprietary Interconnect TCA for GPU Direct Communication.

, , , , and . IPDPS Workshops, page 647-655. IEEE Computer Society, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Blocked All-Pairs Shortest Paths Algorithm for Hybrid CPU-GPU System., , and . HPCC, page 145-152. IEEE, (2011)Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU., , and . MCSoC, page 198-204. IEEE Computer Society, (2012)Improving Strong-Scaling on GPU Cluster Based on Tightly Coupled Accelerators Architecture., , , , , , and . CLUSTER, page 88-91. IEEE Computer Society, (2015)A Solution of the All-Pairs Shortest Paths Problem on the Cell Broadband Engine Processor., and . IEICE Trans. Inf. Syst., 92-D (6): 1225-1231 (2009)Incremental Principal Component Analysis Based on Adaptive Accumulation Ratio., , , and . ICONIP (1), volume 5506 of Lecture Notes in Computer Science, page 1196-1203. Springer, (2008)Implementation and performance evaluation of a communication-avoiding GMRES method for stencil-based code on GPU cluster., , , , and . J. Supercomput., 75 (12): 8115-8146 (2019)High Performance Software Systolic Array Computing of Multi-channel Convolution on a GPU., , and . ICCSA (1), volume 13375 of Lecture Notes in Computer Science, page 298-309. Springer, (2022)Matrix Multiply-Add in Min-plus Algebra on a Short-Vector SIMD Processor of Cell/B.E.., and . ICNC, page 272-274. IEEE Computer Society, (2010)Effectiveness of performance tuning techniques for general matrix multiplication on the PEZY-SC2., , and . HEART, page 8:1-8:6. ACM, (2019)Performance Tuning of Matrix Multiplication in OpenCL on Different GPUs and CPUs., , and . SC Companion, page 396-405. IEEE Computer Society, (2012)