Article,

基于KL散度的策略优化 (KL-divergence-based Policy Optimization).

, , and .
计算机科学, 46 (6): 212-217 (2019)

Meta data

Tags

Users

  • @dblp

Comments and Reviews