From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MusicRL: Aligning Music Generation to Human Preferences.

G. Cideron, S. Girgin, M. Verzetti, D. Vincent, M. Kastelic, Z. Borsos, B. McWilliams, V. Ungureanu, O. Bachem, O. Pietquin, M. Geist, L. Hussenot, N. Zeghidour, и A. Agostinelli. CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Geoffrey Hinds

Geoffrey Coulson

Geoffrey Lund

Geoffrey Davis

Geoffrey Mwachala

Другие публикации лиц с тем же именем

Get Back Here: Robust Imitation by Return-to-Distribution Planning.G. Cideron, B. Tabanpour, S. Curi, S. Girgin, L. Hussenot, G. Dulac-Arnold, M. Geist, O. Pietquin, и R. Dadashi. CoRR, (2023)QD-RL: Efficient Mixing of Quality and Diversity in Reinforcement Learning.G. Cideron, T. Pierrot, N. Perrin, K. Beguir, и O. Sigaud. CoRR, (2020)HIGhER: Improving instruction following with Hindsight Generation for Experience Replay.G. Cideron, M. Seurin, F. Strub, и O. Pietquin. SSCI, стр. 225-232. IEEE, (2020)Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.P. Roit, J. Ferret, L. Shani, R. Aharoni, G. Cideron, R. Dadashi, M. Geist, S. Girgin, L. Hussenot, O. Keller и 9 other автор(ы). ACL (1), стр. 6252-6272. Association for Computational Linguistics, (2023)Self-Educated Language Agent with Hindsight Experience Replay for Instruction Following.G. Cideron, M. Seurin, F. Strub, и O. Pietquin. ViGIL@NeurIPS, (2019)vec2text with Round-Trip Translations.G. Cideron, S. Girgin, A. Raichuk, O. Pietquin, O. Bachem, и L. Hussenot. CoRR, (2022)WARM: On the Benefits of Weight Averaged Reward Models.A. Ramé, N. Vieillard, L. Hussenot, R. Dadashi, G. Cideron, O. Bachem, и J. Ferret. CoRR, (2024)Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.P. Roit, J. Ferret, L. Shani, R. Aharoni, G. Cideron, R. Dadashi, M. Geist, S. Girgin, L. Hussenot, O. Keller и 9 other автор(ы). CoRR, (2023)MusicRL: Aligning Music Generation to Human Preferences.G. Cideron, S. Girgin, M. Verzetti, D. Vincent, M. Kastelic, Z. Borsos, B. McWilliams, V. Ungureanu, O. Bachem, O. Pietquin и 4 other автор(ы). CoRR, (2024)Diversity policy gradient for sample efficient quality-diversity optimization.T. Pierrot, V. Macé, F. Chalumeau, A. Flajolet, G. Cideron, K. Beguir, A. Cully, O. Sigaud, и N. Perrin-Gilbert. GECCO, стр. 1075-1083. ACM, (2022)

Что такое BibSonomy?: С чего начать; Кнопки для браузера; Помощь
Разработчикам: Обзор; API-документация

Контакт и защита личных данных: о нас; Cookies; Сообщить о проблеме; BibSonomy Вики

Интеграция: PUMA; Расширение для TYPO3; Плагин для; Клиент Java REST; Поддерживаемые источники; далее

О BibSonomy: Команда; Блог; Список рассылки
Социальные сети: Наш Twitter