Author of the publication

SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization.

, , , , , , , and . CVPR, page 11216-11225. Computer Vision Foundation / IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FlexSaaS: A Reconfigurable Accelerator for Web Search Selection., , , , , , , , and . ACM Trans. Reconfigurable Technol. Syst., 12 (1): 5:1-5:20 (2019)AFPQ: Asymmetric Floating Point Quantization for LLMs., , , , , , and . CoRR, (2023)Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference., , , , , , , and . CoRR, (2023)Inverse model and adaptive neighborhood search based cooperative optimizer for energy-efficient distributed flexible job shop scheduling., , , and . Swarm Evol. Comput., (December 2023)Dense-to-Sparse Gate for Mixture-of-Experts., , , , , , , , and . CoRR, (2021)Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models., , , , , , , , and . CoRR, (2023)NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors., , , , , , , and . MobiSys, page 70-83. ACM, (2023)SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization., , , , , , , and . CVPR, page 11216-11225. Computer Vision Foundation / IEEE, (2019)Information Technology Education Based on Cloud Computing., , , , and . ICICA (1), volume 391 of Communications in Computer and Information Science, page 417-426. Springer, (2013)BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation., , , , , , and . CoRR, (2024)