Inproceedings,

On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems.

, , , , , , , and .
ACL (1), The Association for Computer Linguistics, (2016)

Meta data

Tags

Users

  • @dblp

Comments and Reviews