Article,

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence.

W. Zhan, S. Cen, B. Huang, Y. Chen, J. Lee, and Y. Chi.
SIAM J. Optim., 33 (2): 1061-1091 (June 2023)

Meta data

BibTeX key: journals/siamjo/ZhanCHCLC23
entry type: article
year: 2023
month: June
journal: SIAM J. Optim.
number: 2
pages: 1061-1091
volume: 33
ee: https://doi.org/10.1137/21m1456789
url: http://dblp.uni-trier.de/db/journals/siamjo/siamjo33.html#ZhanCHCLC23

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on