Author of the publication

SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Networks.

, , , , , and . ACM Trans. Embed. Comput. Syst., 22 (2): 30:1-30:23 (March 2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks., , , , , , , , , and 2 other author(s). CoRR, (2023)A Practical Dynamic Buffer Overflow Detector., and . NDSS, The Internet Society, (2004)ZeRO-Offload: Democratizing Billion-Scale Model Training., , , , , , , and . USENIX Annual Technical Conference, page 551-564. USENIX Association, (2021)SimiGrad: Fine-Grained Adaptive Batching for Large Scale Training using Gradient Similarity Measurement., , , , , and . NeurIPS, page 20531-20544. (2021)Guardrail: a high fidelity approach to protecting hardware devices from buggy drivers., , , and . ASPLOS, page 655-670. ACM, (2014)SERF: efficient scheduling for fast deep neural network serving via judicious parallelism., , , and . SC, page 300-311. IEEE Computer Society, (2016)Performance Modeling and Scalability Optimization of Distributed Deep Learning Systems., , , and . KDD, page 1355-1364. ACM, (2015)HyperDrive: exploring hyperparameters with POP scheduling., , , , and . Middleware, page 1-13. ACM, (2017)FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design., , , , , , , , , and 3 other author(s). CoRR, (2024)BLOOM: A 176B-Parameter Open-Access Multilingual Language Model., , , , , , , , , and 39 other author(s). CoRR, (2022)