Article,

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence.

, , , , , and .
SIAM J. Optim., 33 (2): 1061-1091 (June 2023)

Meta data

Tags

Users

  • @dblp

Comments and Reviews