copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

On the Exploitability of Reinforcement Learning with Human Feedback for Large Language Models.

J. Wang, J. Wu, M. Chen, Y. Vorobeychik, and C. Xiao. CoRR, (2023)

Links and resources

BibTeX key: journals/corr/abs-2311-09641
entry type: article
year: 2023
journal: CoRR
volume: abs/2311.09641
ee: https://doi.org/10.48550/arXiv.2311.09641
url: http://dblp.uni-trier.de/db/journals/corr/corr2311.html#abs-2311-09641

Tags

Cite this publication

search on

Meta data

Last update 6 months ago
Created 10 months ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!