Article,

Fine-Tuning Language Models from Human Preferences.

D. Ziegler, N. Stiennon, J. Wu, T. Brown, A. Radford, D. Amodei, P. Christiano, and G. Irving.
CoRR, (2019)

Meta data

BibTeX key: journals/corr/abs-1909-08593
entry type: article
year: 2019
journal: CoRR
volume: abs/1909.08593
ee: http://arxiv.org/abs/1909.08593
url: https://arxiv.org/pdf/1909.08593

Tags

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on