Author of the publication

Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size.

, , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 5731-5741. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Parity Models: A General Framework for Coding-Based Resilience in ML Inference., , and . CoRR, (2019)EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding., , , , and . OSDI, page 401-417. USENIX Association, (2016)Learning-Based Coded Computation., , and . IEEE J. Sel. Areas Inf. Theory, 1 (1): 227-236 (2020)Vantage: optimizing video upload for time-shifted viewing of social live streams., , , and . SIGCOMM, page 380-393. ACM, (2019)EVT: Accelerating Deep Learning Training with Epilogue Visitor Tree., , , , , , and . ASPLOS (3), page 301-316. ACM, (2024)Parity models: erasure-coded resilience for prediction serving systems., , and . SOSP, page 30-46. ACM, (2019)Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding., , , , and . Proc. VLDB Endow., 16 (11): 3137-3150 (2023)Arithmetic-intensity-guided fault tolerance for neural network inference on GPUs., and . SC, page 79. ACM, (2021)Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size., , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 5731-5741. PMLR, (2021)Rethinking Erasure-Coding Libraries in the Age of Optimized Machine Learning., , and . HotStorage, ACM, (2024)