Author of the publication

Adaptive Contrastive Knowledge Distillation for BERT Compression.

, , , , , , and . ACL (Findings), page 8941-8953. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

LAW: Learning to Auto Weight., , , , , , and . CoRR, (2019)Improved adaptive gray wolf genetic algorithm for photovoltaic intelligent edge terminal optimal configuration., , , , , and . Comput. Electr. Eng., (2021)E^2-LLM: Efficient and Extreme Length Extension of Large Language Models., , , , , , , , , and 4 other author(s). CoRR, (2024)A Conflict-Aware Capacity Control Mechanism for Deep Cache Hierarchy., , and . IEICE Trans. Inf. Syst., 105-D (6): 1150-1163 (2022)ICD-Face: Intra-class Compactness Distillation for Face Recognition., , , , , , and . ICCV, page 20985-20995. IEEE, (2023)Adaptive Contrastive Knowledge Distillation for BERT Compression., , , , , , and . ACL (Findings), page 8941-8953. Association for Computational Linguistics, (2023)Knowledge Distillation via Route Constrained Optimization., , , , , , , and . ICCV, page 1345-1354. IEEE, (2019)Deep 3D Vessel Segmentation based on Cross Transformer Network., , , , , , and . BIBM, page 1115-1120. IEEE, (2022)LogLG: Weakly Supervised Log Anomaly Detection via Log-Event Graph Construction., , , , , , , , and . DASFAA (4), volume 13946 of Lecture Notes in Computer Science, page 490-501. Springer, (2023)MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues., , , , , , , , , and 1 other author(s). CoRR, (2024)