From post

The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation.

, , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 29210-29231. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Adaptive Trade-Offs in Off-Policy Learning., , и . CoRR, (2019)Antithetic and Monte Carlo kernel estimators for partial rankings., , , и . Stat. Comput., 29 (5): 1127-1147 (2019)A General Theoretical Paradigm to Understand Learning from Human Preferences., , , , , , и . CoRR, (2023)Distributional Bellman Operators over Mean Embeddings., , , , , , и . CoRR, (2023)α-Rank: Multi-Agent Evaluation by Evolution., , , , , , , , , и . CoRR, (2019)Orthogonal Estimation of Wasserstein Distances., , , , , и . AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 186-195. PMLR, (2019)Unifying Orthogonal Monte Carlo Methods., , , и . ICML, том 97 из Proceedings of Machine Learning Research, стр. 1203-1212. PMLR, (2019)Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model., , , , , и . CoRR, (2024)The Value-Improvement Path: Towards Better Representations for Reinforcement Learning., , , , , , и . CoRR, (2020)From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization., , , , , , , , , и 3 other автор(ы). ICML, том 139 из Proceedings of Machine Learning Research, стр. 8525-8535. PMLR, (2021)