Inproceedings,

Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates.

, , , , , , , and .
NeurIPS, page 11872-11882. (2019)

Meta data

Tags

Users

  • @kirk86
  • @dblp

Comments and Reviews