Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal., , , , , , , , , and 3 other author(s). CoRR, (2022)An Analysis of Quantile Temporal-Difference Learning., , , , , , , , and . CoRR, (2023)Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning., , , , , , , , , and 24 other author(s). CoRR, (2022)Concave Utility Reinforcement Learning: the Mean-field Game viewpoint., , , , , , , and . CoRR, (2021)Combining policy gradient and Q-learning., , , and . ICLR (Poster), OpenReview.net, (2017)Sample Efficient Actor-Critic with Experience Replay., , , , , , and . ICLR (Poster), OpenReview.net, (2017)Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation.. J. Mach. Learn. Res., (2006)Minimax Regret Bounds for Reinforcement Learning., , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 263-272. PMLR, (2017)Leverage the Average: an Analysis of Regularization in RL., , , , , and . CoRR, (2020)An Analysis of Categorical Distributional Reinforcement Learning., , , , and . AISTATS, volume 84 of Proceedings of Machine Learning Research, page 29-37. PMLR, (2018)