Inproceedings,

On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems.

P. Su, M. Gasic, N. Mrksic, L. Rojas-Barahona, S. Ultes, D. Vandyke, T. Wen, and S. Young.
ACL (1), The Association for Computer Linguistics, (2016)

Meta data

BibTeX key: conf/acl/SuGMRUVWY16
entry type: inproceedings
booktitle: ACL (1)
year: 2016
publisher: The Association for Computer Linguistics
crossref: conf/acl/2016-1
ee: https://aclanthology.org/P16-1230/
isbn: 978-1-945626-00-5
url: http://dblp.uni-trier.de/db/conf/acl/acl2016-1.html#SuGMRUVWY16

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on