Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks., , , , , , , , and . ISCA, page 27-40. ACM, (2017)HAMMER: Hardware-Friendly Approximate Computing for Self-Attention With Mean-Redistribution And Linearization., , , and . IEEE Comput. Archit. Lett., 22 (1): 13-16 (January 2023)Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference., , , , , , , and . CoRR, (2023)A locality-aware memory hierarchy for energy-efficient GPU architectures., , , and . MICRO, page 86-98. ACM, (2013)ARK: Fully Homomorphic Encryption Accelerator with Runtime Data Generation and Inter-Operation Key Reuse., , , , , , and . MICRO, page 1237-1254. IEEE, (2022)Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations., , , and . ISCA, page 968-981. IEEE, (2020)GPU-based Private Information Retrieval for On-Device Machine Learning Inference., , , , , , , , , and 4 other author(s). ASPLOS (1), page 197-214. ACM, (2024)Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training., , and . CoRR, (2020)PREMA: A Predictive Multi-Task Scheduling Algorithm For Preemptible Neural Processing Units., and . HPCA, page 220-233. IEEE, (2020)Understanding the Implication of Non-Volatile Memory for Large-Scale Graph Neural Network Training., , and . IEEE Comput. Archit. Lett., 20 (2): 118-121 (2021)