copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy.

Y. Xie, B. Liu, Q. Liu, Z. Wang, Y. Zhou, and J. Peng. CoRR, (2018)

Links and resources

BibTeX key: journals/corr/abs-1808-00232
entry type: article
year: 2018
journal: CoRR
volume: abs/1808.00232
ee: http://arxiv.org/abs/1808.00232
url: http://dblp.uni-trier.de/db/journals/corr/corr1808.html#abs-1808-00232

Tags

Cite this publication

search on

Meta data

Last update 18 days ago
Created 4 months ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!