Author of the publication

Fast and accurate variable batch size convolution neural network training on large scale distributed systems.

, , , and . Concurr. Comput. Pract. Exp., (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Poster: revisiting virtual channel memory for performance and fairness on multi-core architecture., , , , , and . ICS, page 379. ACM, (2011)Exploiting Parallelization for RNA Secondary Structure Prediction in Cluster., , and . International Conference on Computational Science (3), volume 3516 of Lecture Notes in Computer Science, page 979-982. Springer, (2005)Optimizing stencil code via locality of computation., and . PACT, page 477-478. ACM, (2014)Implementation of the Smith-Waterman algorithm on a reconfigurable supercomputing platform., , and . HPRCTA, page 39-48. ACM Press, (2007)A coarse-grained stream architecture for cryo-electron microscopy images 3D reconstruction., , , , , , and . FPGA, page 143-152. ACM, (2012)T2HT : Traffic-Driven Machine Learning Based Hierarchical Topology Generation Model., , , , , and . ICPADS, page 275-283. IEEE, (2019)Implementation of Short Read Alignment Algorithm in OpenCL on Xeon Phi Coprocessor., , and . HPCC/CSS/ICESS, page 1633-1636. IEEE, (2015)A New Traffic Offloading Method with Slow Switching Optical Device in Exascale Computer., , , , and . ICCD, page 138-146. IEEE, (2019)边缘海静力数值预报模式并行算法研究 (Parallelization of Hydrostatic Numerical Forecasting Model of Marginal Sea)., , , , , and . 计算机科学, 43 (1): 14-17 (2016)Fast Data-Obtaining Algorithm for Data Assimilation with Large Data Set., , , , and . Int. J. Parallel Program., 48 (4): 750-770 (2020)