From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation.

M. Rowland, Y. Tang, C. Lyle, R. Munos, M. Bellemare, и W. Dabney. ICML, том 202 из Proceedings of Machine Learning Research, стр. 29210-29231. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Herbert Rowland

Rowland Onyenali

Rowland Lassen

Rowland Enyinnaya Eruba

Rowland Nii-Adjei Otchwemah

Другие публикации лиц с тем же именем

Adaptive Trade-Offs in Off-Policy Learning.M. Rowland, W. Dabney, и R. Munos. CoRR, (2019)Antithetic and Monte Carlo kernel estimators for partial rankings.M. Lomeli, M. Rowland, A. Gretton, и Z. Ghahramani. Stat. Comput., 29 (5): 1127-1147 (2019)A General Theoretical Paradigm to Understand Learning from Human Preferences.M. Azar, M. Rowland, B. Piot, D. Guo, D. Calandriello, M. Valko, и R. Munos. CoRR, (2023)Distributional Bellman Operators over Mean Embeddings.L. Wenliang, G. Delétang, M. Aitchison, M. Hutter, A. Ruoss, A. Gretton, и M. Rowland. CoRR, (2023)α-Rank: Multi-Agent Evaluation by Evolution.S. Omidshafiei, C. Papadimitriou, G. Piliouras, K. Tuyls, M. Rowland, J. Lespiau, W. Czarnecki, M. Lanctot, J. Pérolat, и R. Munos. CoRR, (2019)Orthogonal Estimation of Wasserstein Distances.M. Rowland, J. Hron, Y. Tang, K. Choromanski, T. Sarlós, и A. Weller. AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 186-195. PMLR, (2019)Unifying Orthogonal Monte Carlo Methods.K. Choromanski, M. Rowland, W. Chen, и A. Weller. ICML, том 97 из Proceedings of Machine Learning Research, стр. 1203-1212. PMLR, (2019)Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model.M. Rowland, L. Wenliang, R. Munos, C. Lyle, Y. Tang, и W. Dabney. CoRR, (2024)The Value-Improvement Path: Towards Better Representations for Reinforcement Learning.W. Dabney, A. Barreto, M. Rowland, R. Dadashi, J. Quan, M. Bellemare, и D. Silver. CoRR, (2020)From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization.J. Pérolat, R. Munos, J. Lespiau, S. Omidshafiei, M. Rowland, P. Ortega, N. Burch, T. Anthony, D. Balduzzi, B. Vylder и 3 other автор(ы). ICML, том 139 из Proceedings of Machine Learning Research, стр. 8525-8535. PMLR, (2021)

BibSonomy

Disambiguation