Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Implicit Neural Video Compression., , , , and . CoRR, (2021)The LLM Surgeon., , , , and . CoRR, (2023)Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)., , , , , and . CoRR, (2022)QBitOpt: Fast and Accurate Bitwidth Reallocation during Training., , , , and . ICCV (Workshops), page 1274-1283. IEEE, (2023)Understanding and Overcoming the Challenges of Efficient Transformer Quantization., , and . EMNLP (1), page 7947-7969. Association for Computational Linguistics, (2021)Up or Down? Adaptive Rounding for Post-Training Quantization., , , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 7197-7206. PMLR, (2020)A Practical Mixed Precision Algorithm for Post-Training Quantization., , , , , and . BMVC Workshop, BMVA Press, (2023)FP8 versus INT8 for efficient deep learning inference., , , , , , , , , and 1 other author(s). CoRR, (2023)Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing., , and . CoRR, (2023)Bayesian Bits: Unifying Quantization and Pruning., , , , , , and . NeurIPS, (2020)