Inproceedings,

Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems.

P. Su, D. Vandyke, M. Gasic, N. Mrksic, T. Wen, and S. Young.
SIGDIAL Conference, page 417-421. The Association for Computer Linguistics, (2015)

Meta data

BibTeX key: conf/sigdial/SuVGMWY15
entry type: inproceedings
booktitle: SIGDIAL Conference
year: 2015
pages: 417-421
publisher: The Association for Computer Linguistics
crossref: conf/sigdial/2015
ee: https://aclanthology.org/W15-4655/
isbn: 978-1-941643-75-4
url: http://dblp.uni-trier.de/db/conf/sigdial/sigdial2015.html#SuVGMWY15

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on