Inproceedings,

Sample-Efficient Reinforcement Learning Based on Dynamics Models via Meta-policy Optimization.

, , , and .
ICCSIP, volume 1515 of Communications in Computer and Information Science, page 360-373. Springer, (2021)

Meta data

Tags

Users

  • @dblp

Comments and Reviews