Article,

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.

G. Bai, J. Liu, X. Bu, Y. He, J. Liu, Z. Zhou, Z. Lin, W. Su, T. Ge, B. Zheng, and W. Ouyang.
CoRR, (2024)

Meta data

BibTeX key: journals/corr/abs-2402-14762
entry type: article
year: 2024
journal: CoRR
volume: abs/2402.14762
ee: https://doi.org/10.48550/arXiv.2402.14762
url: http://dblp.uni-trier.de/db/journals/corr/corr2402.html#abs-2402-14762

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on