Author of the publication

Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor.

, , , , , , , , and . ICPP, page 422-431. IEEE Computer Society, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Extreme-Scale Realistic Stencil Computations on Sunway TaihuLight with Ten Million Cores., , , and . CCGrid, page 566-571. IEEE Computer Society, (2018)26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight., , , , , , , , and . IPDPS, page 535-544. IEEE Computer Society, (2017)Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity., , and . CoRR, (2020)Performance Optimization of the HPCG Benchmark on the Sunway TaihuLight Supercomputer., , , , , and . ACM Trans. Archit. Code Optim., 15 (1): 11:1-11:20 (2018)AutoWM: a novel domain-specific tool for universal multi-/many-core accelerations of the WRF cloud microphysics., , and . Clust. Comput., 24 (2): 935-951 (2021)Extreme-Scale High-Order WENO Simulations of 3-D Detonation Wave with 10 Million Cores., , , , and . ACM Trans. Archit. Code Optim., 15 (2): 26:1-26:21 (2018)End-to-end Adaptive Distributed Training on PaddlePaddle., , , , , , , , , and 3 other author(s). CoRR, (2021)Solving a trillion unknowns per second with HPGMG on Sunway TaihuLight., , , and . Clust. Comput., 23 (2): 493-507 (2020)Performance Evaluation of HPGMG on Tianhe-2: Early Experience., , , , , , and . ICA3PP (4), volume 9531 of Lecture Notes in Computer Science, page 230-243. Springer, (2015)Pattern-Driven Hybrid Multi- and Many-Core Acceleration in the MPAS Shallow-Water Model., , , , , , and . ICPP, page 71-80. IEEE Computer Society, (2015)