Author of the publication

Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU.

, , , , , , , , , , , , and . IPDPS, page 156-166. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

swTVM: Exploring the Automated Compilation for Deep Learning on Sunway Architecture., , , , , , and . CoRR, (2019)SparkOT: Diagnosing Operation Level Inefficiency in Spark., , , , and . HPCC/SmartCity/DSS, page 692-699. IEEE, (2018)Performance-Aware Based Correlated Datasets Replication Strategy., , and . ISCTCS, volume 520 of Communications in Computer and Information Science, page 322-327. Springer, (2014)Energy Efficiency Evaluation of Workload Execution on Intel Xeon Phi Coprocessor., , , , and . ISCTCS, volume 426 of Communications in Computer and Information Science, page 268-275. Springer, (2013)Improving the Parallelism of CESM on GPU., , , , , , , and . ICA3PP (2), volume 11945 of Lecture Notes in Computer Science, page 11-18. Springer, (2019)BigRoots: An Effective Approach for Root-cause Analysis of Stragglers in Big Data System., , , , and . CoRR, (2018)L-DAG: Enabling Loopy Workflow in Scientific Application with Automatic DAG Transformation., , , and . DASC/PiCom/DataCom/CyberSciTech, page 946-953. IEEE, (2019)Modeling Power Consumption of The Code Execution Using Performance Counters Statistics., , , and . PDCAT, page 381-385. IEEE, (2019)Accelerating Sparse Cholesky Factorization on Sunway Manycore Architecture., , , , , , and . IEEE Trans. Parallel Distributed Syst., 31 (7): 1636-1650 (2020)Efficient detection of silent data corruption in HPC applications with synchronization-free message verification., , , and . J. Supercomput., 78 (1): 1381-1408 (2022)