Author of the publication

Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.

, , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 1753-1800. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation., , and . CoRR, (2021)Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space., , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 1753-1800. PMLR, (2023)Contributions to non-convex stochastic optimization and reinforcement learning. (Contributions à l'optimisation stochastique non convexe et à l'apprentissage par renforcement).. Institut Polytechnique de Paris, France, (2021)Independent Learning in Constrained Markov Potential Games., , and . AISTATS, volume 238 of Proceedings of Machine Learning Research, page 4024-4032. PMLR, (2024)Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies., , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 9827-9869. PMLR, (2023)Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation., , and . AISTATS, volume 151 of Proceedings of Machine Learning Research, page 991-1040. PMLR, (2022)Convergence Rates of a Momentum Algorithm with Bounded Adaptive Step Size for Nonconvex Optimization., and . ACML, volume 129 of Proceedings of Machine Learning Research, page 225-240. PMLR, (2020)Convergence of the ADAM algorithm from a Dynamical System Viewpoint., and . CoRR, (2018)Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity., , , and . CDC, page 2602-2609. IEEE, (2023)Policy Mirror Descent with Lookahead., and . CoRR, (2024)