Inproceedings,

Deterministic MDPs with Adversarial Rewards and Bandit Feedback.

, , and .
UAI, page 93-101. AUAI Press, (2012)

Meta data

Tags

Users

  • @dblp

Comments and Reviews