Article,

Provably Robust DPO: Aligning Language Models with Noisy Feedback.

S. Chowdhury, A. Kini, and N. Natarajan.
CoRR, (2024)

Meta data

BibTeX key: journals/corr/abs-2403-00409
entry type: article
year: 2024
journal: CoRR
volume: abs/2403.00409
ee: https://doi.org/10.48550/arXiv.2403.00409
url: http://dblp.uni-trier.de/db/journals/corr/corr2403.html#abs-2403-00409

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on