Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Sentinel: Runtime Data Management on Heterogeneous Main MemorySystems for Deep Learning., , , , and . CoRR, (2019)DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention., , , , , , , , and . CoRR, (2023)Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs., , , , , , , and . CoRR, (2022)Speed-ANN: Low-Latency and High-Accuracy Nearest Neighbor Search via Intra-Query Parallelism., , , , and . CoRR, (2022)Random-LTD: Random and Layerwise Token Dropping Brings Efficient Training for Large-scale Transformers., , , , , , and . CoRR, (2022)DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing., , , , and . CoRR, (2022)Valor: efficient, software-only region conflict exceptions., , , and . OOPSLA, page 241-259. ACM, (2015)DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing., , , , , , and . AAAI, page 18490-18498. AAAI Press, (2024)Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination., , , and . SIGMOD Conference, page 2539-2554. ACM, (2020)DUET: A Compiler-Runtime Subgraph Scheduling Approach for Tensor Programs on a Coupled CPU-GPU Architecture., , and . IPDPS, page 151-161. IEEE, (2021)