Author of the publication

Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks.

, , , , , and . HPCA, page 78-91. IEEE Computer Society, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Structurally Sparsified Backward Propagation for Faster Long Short-Term Memory Training., , , , , and . CoRR, (2018)Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training., , and . CoRR, (2020)FPGA-Accelerated Data Preprocessing for Personalized Recommendation Systems., , and . IEEE Comput. Archit. Lett., 23 (1): 7-10 (January 2024)Understanding the Implication of Non-Volatile Memory for Large-Scale Graph Neural Network Training., , and . IEEE Comput. Archit. Lett., 20 (2): 118-121 (2021)Pathfinding Future PIM Architectures by Demystifying a Commercial PIM Technology., , , and . CoRR, (2023)PREMA: A Predictive Multi-Task Scheduling Algorithm For Preemptible Neural Processing Units., and . HPCA, page 220-233. IEEE, (2020)Trident: A Hybrid Correlation-Collision GPU Cache Timing Attack for AES Key Recovery., , , , , , and . HPCA, page 332-344. IEEE, (2021)BTS: an accelerator for bootstrappable fully homomorphic encryption., , , , , , and . ISCA, page 711-725. ACM, (2022)Architecture design of a high-performance dual-symbol binary arithmetic coder for JPEG2000., and . ICIP, page 2665-2668. IEEE, (2009)Beyond the Memory Wall: A Case for Memory-Centric HPC System for Deep Learning., and . MICRO, page 148-161. IEEE Computer Society, (2018)