From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Distributional Reinforcement Learning with Quantile Regression., , , и . CoRR, (2017)Adaptive Trade-Offs in Off-Policy Learning., , и . AISTATS, том 108 из Proceedings of Machine Learning Research, стр. 34-44. PMLR, (2020)Conditional Importance Sampling for Off-Policy Learning., , , , , , и . AISTATS, том 108 из Proceedings of Machine Learning Research, стр. 45-55. PMLR, (2020)Meta-learning of Sequential Strategies., , , , , , , , , и 14 other автор(ы). CoRR, (2019)Human Alignment of Large Language Models through Online Preference Optimisation., , , , , , , , , и 3 other автор(ы). CoRR, (2024)Nash Learning from Human Feedback., , , , , , , , , и 7 other автор(ы). CoRR, (2023)MICo: Learning improved representations via sampling-based state similarity for Markov decision processes., , , и . CoRR, (2021)Geometrically Coupled Monte Carlo Sampling., , , , , , и . NeurIPS, стр. 195-205. (2018)On the Effect of Auxiliary Tasks on Representation Dynamics., , , и . AISTATS, том 130 из Proceedings of Machine Learning Research, стр. 1-9. PMLR, (2021)Marginalized Operators for Off-policy Reinforcement Learning., , , и . AISTATS, том 151 из Proceedings of Machine Learning Research, стр. 655-679. PMLR, (2022)