Inproceedings,

Off-Policy Temporal Difference Learning with Function Approximation.

, , and .
ICML, page 417-424. Morgan Kaufmann, (2001)

Meta data

Tags

Users

  • @dblp

Comments and Reviews