From post

Adversarially Regularized Policy Learning Guided by Trajectory Optimization.

, , , и . L4DC, том 168 из Proceedings of Machine Learning Research, стр. 844-857. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation., , , , , и . NAACL-HLT, стр. 1610-1623. Association for Computational Linguistics, (2022)Tensor maps for synchronizing heterogeneous shape collections., , , , и . ACM Trans. Graph., 38 (4): 106:1-106:18 (2019)A Hypergradient Approach to Robust Regression without Correspondence., , , , , , и . CoRR, (2020)SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process., , , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 20210-20220. PMLR, (2023)Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach., , , , , , , и . EMNLP (1), стр. 6562-6577. Association for Computational Linguistics, (2021)Self-Training with Differentiable Teacher., , , , , , , и . NAACL-HLT (Findings), стр. 933-949. Association for Computational Linguistics, (2022)Context-Aware Query Rewriting for Improving Users' Search Experience on E-commerce Websites., , , , , , и . CoRR, (2022)Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing., , , , , , , и . CoRR, (2023)Adversarially Regularized Policy Learning Guided by Trajectory Optimization., , , и . L4DC, том 168 из Proceedings of Machine Learning Research, стр. 844-857. PMLR, (2022)Less is More: Task-aware Layer-wise Distillation for Language Model Compression., , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 20852-20867. PMLR, (2023)