copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback.

A. Havrilla, M. Zhuravinskyi, D. Phung, A. Tiwari, J. Tow, S. Biderman, Q. Anthony, and L. Castricato. EMNLP, page 8578-8595. Association for Computational Linguistics, (2023)

BibTeX key: conf/emnlp/HavrillaZPTTBAC23
entry type: inproceedings
booktitle: EMNLP
year: 2023
pages: 8578-8595
publisher: Association for Computational Linguistics
crossref: conf/emnlp/2023
ee: https://aclanthology.org/2023.emnlp-main.530
isbn: 979-8-89176-060-8
url: http://dblp.uni-trier.de/db/conf/emnlp/emnlp2023.html#HavrillaZPTTBAC23

There is no review or comment yet. You can write one!

BibSonomy