Article,

Rlaif: Scaling reinforcement learning from human feedback with ai feedback

, , , , , , , and .
arXiv preprint arXiv:2309.00267, (2023)

Meta data

Tags

Users

  • @albinzehe
  • @dblp

Comments and Reviews