Author of the publication

Repulsive Attention: Rethinking Multi-head Attention as Bayesian Inference.

, , , , , , , , and . EMNLP (1), page 236-255. Association for Computational Linguistics, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning., , , , , and . CVPR, page 7736-7745. IEEE, (2023)Energy-Efficient and QoS-Aware Computation Offloading in GEO/LEO Hybrid Satellite Networks., , , , , and . Remote. Sens., 15 (13): 3299 (July 2023)Flexi-BOPI: Flexible granularity pipeline inference with Bayesian optimization for deep learning models on HMPSoC., , , , , , and . Inf. Sci., (2024)Customized Load Profiles Synthesis for Electricity Customers Based on Conditional Diffusion Models., and . IEEE Trans. Smart Grid, 15 (4): 4259-4270 (July 2024)Improving Non-Transferable Representation Learning by Harnessing Content and Style., , , , , , , , and . ICLR, OpenReview.net, (2024)Data Augmented Flatness-aware Gradient Projection for Continual Learning., , , , , and . ICCV, page 5607-5616. IEEE, (2023)Space4HGNN: A Novel, Modularized and Reproducible Platform to Evaluate Heterogeneous Graph Neural Network., , , , , , , , , and . SIGIR, page 2776-2789. ACM, (2022)Task-Distributionally Robust Data-Free Meta-Learning., , , , , , and . CoRR, (2023)Bayesian Meta Sampling for Fast Uncertainty Adaptation., , , , and . ICLR, OpenReview.net, (2020)Representation Surgery for Multi-Task Model Merging., , , , , , and . ICML, OpenReview.net, (2024)