Inproceedings,

Evaluating Conversational Recommender Systems via User Simulation

S. Zhang, and K. Balog.
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &amp$\mathsemicolon$ Data Mining, page 1512-1520. ACM, (August 2020)
DOI: 10.1145/3394486.3403202

Abstract

Conversational information access is an emerging research area. Currently, human evaluation is used for end-to-end system evaluation, which is both very time and resource intensive at scale, and thus becomes a bottleneck of progress. As an alternative, we propose automated evaluation by means of simulating users. Our user simulator aims to generate responses that a real human would give by considering both individual preferences and the general flow of interaction with the system. We evaluate our simulation approach on an item recommendation task by comparing three existing conversational recommender systems. We show that preference modeling and task-specific interaction models both contribute to more realistic simulations, and can help achieve high correlation between automatic evaluation measures and manual human assessments.

BibTeX key: Zhang_2020
entry type: inproceedings
booktitle: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &amp$\mathsemicolon$ Data Mining
year: 2020
month: aug
pages: 1512-1520
publisher: ACM
DOI: 10.1145/3394486.3403202
url: https://doi.org/10.1145%2F3394486.3403202

BibSonomy

Evaluating Conversational Recommender Systems via User Simulation

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on