Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.

A. Barakat, I. Fatkhullin, and N. He. ICML, volume 202 of Proceedings of Machine Learning Research, page 1753-1800. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Safwan Barakat

Lina Barakat

Anas Almharat

Anas Elhag

Fuad Barakat

Other publications of authors with the same name

Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation.A. Barakat, P. Bianchi, and J. Lehmann. CoRR, (2021)Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.A. Barakat, I. Fatkhullin, and N. He. ICML, volume 202 of Proceedings of Machine Learning Research, page 1753-1800. PMLR, (2023)Contributions to non-convex stochastic optimization and reinforcement learning. (Contributions à l'optimisation stochastique non convexe et à l'apprentissage par renforcement).A. Barakat. Institut Polytechnique de Paris, France, (2021)Independent Learning in Constrained Markov Potential Games.P. Jordan, A. Barakat, and N. He. AISTATS, volume 238 of Proceedings of Machine Learning Research, page 4024-4032. PMLR, (2024)Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies.I. Fatkhullin, A. Barakat, A. Kireeva, and N. He. ICML, volume 202 of Proceedings of Machine Learning Research, page 9827-9869. PMLR, (2023)Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation.A. Barakat, P. Bianchi, and J. Lehmann. AISTATS, volume 151 of Proceedings of Machine Learning Research, page 991-1040. PMLR, (2022)Convergence Rates of a Momentum Algorithm with Bounded Adaptive Step Size for Nonconvex Optimization.A. Barakat, and P. Bianchi. ACML, volume 129 of Proceedings of Machine Learning Research, page 225-240. PMLR, (2020)Convergence of the ADAM algorithm from a Dynamical System Viewpoint.A. Barakat, and P. Bianchi. CoRR, (2018)Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity.J. Wu, A. Barakat, I. Fatkhullin, and N. He. CDC, page 2602-2609. IEEE, (2023)Policy Mirror Descent with Lookahead.K. Protopapas, and A. Barakat. CoRR, (2024)

BibSonomy

Disambiguation of "Barakat, Anas"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.

Please choose a person to relate this publication to

Safwan Barakat

Lina Barakat

Anas Almharat

Anas Elhag

Fuad Barakat

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Barakat, Anas"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.

Please choose a person to relate this publication to

Safwan Barakat

Lina Barakat

Anas Almharat

Anas Elhag

Fuad Barakat

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.