Author of the publication

Analyzing MPI-3.0 Process-Level Shared Memory: A Case Study with Stencil Computations.

, , , , , and . CCGRID, page 1099-1106. IEEE Computer Society, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

P-DOT: a model of computation for big data., , , and . IJPEDS, 31 (3): 233-253 (2016)Development of a Scalable Solver for the Earth's Core Convection., , and . HPCA (China), volume 5938 of Lecture Notes in Computer Science, page 497-502. Springer, (2009)CRSD: Application Specific Auto-tuning of SpMV for Diagonal Sparse Matrices., , , , , and . Euro-Par (2), volume 6853 of Lecture Notes in Computer Science, page 316-327. Springer, (2011)Efficient parallel optimizations of a high-performance SIFT on GPUs., , , , , , and . J. Parallel Distributed Comput., (2019)基于OpenCL的直方图生成算法优化方法研究 (Research on Histogram Generation Algorithm Optimization Based on OpenCL)., , and . 计算机科学, 42 (11): 32-36 (2015)The static parallel distribution algorithms for hybrid density-functional calculations in HONPAS package., , , , , , and . Int. J. High Perform. Comput. Appl., (2020)Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms., , , , and . IEEE Trans. Parallel Distributed Syst., 32 (7): 1702-1712 (2021)IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM., , , , , , and . CoRR, (2022)Function Prediction of Proteins in Yeast Networks Based on the MCL Algorithm., and . J. Softw., 9 (5): 1157-1162 (2014)Memory Efficient Two-Pass 3D FFT Algorithm for Intel® Xeon PhiTM Coprocessor., , , and . J. Comput. Sci. Technol., 29 (6): 989-1002 (2014)