From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Get Back Here: Robust Imitation by Return-to-Distribution Planning., , , , , , , , и . CoRR, (2023)QD-RL: Efficient Mixing of Quality and Diversity in Reinforcement Learning., , , , и . CoRR, (2020)HIGhER: Improving instruction following with Hindsight Generation for Experience Replay., , , и . SSCI, стр. 225-232. IEEE, (2020)Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback., , , , , , , , , и 9 other автор(ы). ACL (1), стр. 6252-6272. Association for Computational Linguistics, (2023)Self-Educated Language Agent with Hindsight Experience Replay for Instruction Following., , , и . ViGIL@NeurIPS, (2019)vec2text with Round-Trip Translations., , , , , и . CoRR, (2022)WARM: On the Benefits of Weight Averaged Reward Models., , , , , , и . CoRR, (2024)Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback., , , , , , , , , и 9 other автор(ы). CoRR, (2023)MusicRL: Aligning Music Generation to Human Preferences., , , , , , , , , и 4 other автор(ы). CoRR, (2024)Diversity policy gradient for sample efficient quality-diversity optimization., , , , , , , , и . GECCO, стр. 1075-1083. ACM, (2022)