Article,

Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate.

S. Chern, E. Chern, G. Neubig, and P. Liu.
CoRR, (2024)

Meta data

BibTeX key: journals/corr/abs-2401-16788
entry type: article
year: 2024
journal: CoRR
volume: abs/2401.16788
ee: https://doi.org/10.48550/arXiv.2401.16788
url: http://dblp.uni-trier.de/db/journals/corr/corr2401.html#abs-2401-16788

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on