Techreport,

Reinforcement Learning with High-Dimensional Continuous Actions

, and .
WL-TR-93-1147. Wright Laboratory, Wright-Patterson Air Force Base, (1993)

Meta data

Tags

Users

  • @schaul
  • @idsia

Comments and Reviews