Author of the publication

DeepSpeed: System Optimizations Enable Training Deep Learning Models with Over 100 Billion Parameters.

, , , and . KDD, page 3505-3506. ACM, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The Cilkview scalability analyzer., , and . SPAA, page 145-156. ACM, (2010)Scheduling for data center interactive services., and . Allerton, page 1170-1181. IEEE, (2011)GRIP: Multi-Store Capacity-Optimized High-Performance Nearest Neighbor Search for Vector Search Engine., and . CIKM, page 1673-1682. ACM, (2019)Better Caching in Search Advertising Systems with Rapid Refresh Predictions., , , , and . WWW, page 1875-1884. ACM, (2018)ZeRO-Offload: Democratizing Billion-Scale Model Training., , , , , , , and . USENIX Annual Technical Conference, page 551-564. USENIX Association, (2021)QACO: exploiting partial execution in web servers., , , , and . CAC, page 12:1-12:10. ACM, (2013)ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks., , , , , , , , , and 2 other author(s). CoRR, (2023)Scalable and Efficient MoE Training for Multitask Multilingual Models., , , , , , , , and . CoRR, (2021)Online Resource Management for Carbon-Neutral Cloud Computing., , , and . Handbook on Data Centers, Springer, (2015)ScaLA: Accelerating Adaptation of Pre-Trained Transformer-Based Language Models via Efficient Large-Batch Adversarial Noise., , and . CoRR, (2022)