Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Towards efficient vision transformer inference: a first study of transformers on mobile devices., , , and . HotMobile, page 1-7. ACM, (2022)Boosting Mobile CNN Inference through Semantic Memory., , , , , , and . ACM Multimedia, page 2362-2371. ACM, (2021)nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices., , , , , and . GetMobile Mob. Comput. Commun., 25 (4): 19-23 (2021)Fast Hardware-Aware Neural Architecture Search., , , , and . CVPR Workshops, page 2959-2967. Computer Vision Foundation / IEEE, (2020)SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference., , , , , , , , and . ICCV, page 5796-5805. IEEE, (2023)LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search., , , , , , , , and . NSDI, USENIX Association, (2024)Boosting LLM Reasoning: Push the Limits of Few-shot Learning with Reinforced In-Context Pruning., , , and . CoRR, (2023)ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices., , , , , , , , and . ICCV, page 5806-5817. IEEE, (2023)SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance., , , , , , , and . CIKM, page 3654-3663. ACM, (2022)LUT-NN: Towards Unified Neural Network Inference by Table Lookup., , , , , , , and . CoRR, (2023)