Author of the publication

Monitoring strategies for scalable dynamic checkpointing.

, and . IGSC, page 1-8. IEEE Computer Society, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural Networks., , , , , and . HPDC, page 161-173. ACM, (2021)LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing., , , , , , , , , and 33 other author(s). DATE, page 169-174. IEEE, (2020)FPGA Checkpointing for Scientific Computing., , and . IOLTS, page 1-7. IEEE, (2021)Checkpoint Restart Support for Heterogeneous HPC Applications., , , and . CCGRID, page 242-251. IEEE, (2020)Autopsy of Ethereum's Post-Merge Reward System., , , and . ICBC, page 1-9. IEEE, (2023)A Study of Checkpointing in Large Scale Training of Deep Neural Networks., , , , and . CoRR, (2020)A Framework for Large Scale Particle Filters Validated with Data Assimilation for Weather Simulation., , , , and . CoRR, (2023)Unprotected computing: a large-scale study of DRAM raw error rate on a supercomputer., , , and . SC, page 645-655. IEEE Computer Society, (2016)Towards Ad Hoc Recovery for Soft Errors., , , and . FTXS@SC, page 1-10. IEEE, (2018)Application-Level Differential Checkpointing for HPC Applications with Dynamic Datasets., and . CCGRID, page 52-61. IEEE, (2019)