Article,

Temporal-difference emphasis learning with regularized correction for off-policy evaluation and control.

J. Cao, Q. Liu, L. Wu, Q. Fu, and S. Zhong.
Appl. Intell., 53 (18): 20917-20937 (September 2023)

Meta data

BibTeX key: journals/apin/CaoLWFZ23
entry type: article
year: 2023
month: September
journal: Appl. Intell.
number: 18
pages: 20917-20937
volume: 53
ee: https://doi.org/10.1007/s10489-023-04579-4
url: http://dblp.uni-trier.de/db/journals/apin/apin53.html#CaoLWFZ23

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on