Article,

Reinforcement Learning Fine-tuning of Language Models is Biased Towards More Extractable Features.

D. Cruz, E. Pona, A. Holness-Tofts, E. Schmied, V. Alonso, C. Griffin, and B. Cirstea.
CoRR, (2023)

Meta data

BibTeX key: journals/corr/abs-2311-04046
entry type: article
year: 2023
journal: CoRR
volume: abs/2311.04046
ee: https://doi.org/10.48550/arXiv.2311.04046
url: http://dblp.uni-trier.de/db/journals/corr/corr2311.html#abs-2311-04046

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on