Author of the publication

Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent.

, , , and . AISTATS, volume 238 of Proceedings of Machine Learning Research, page 2206-2214. PMLR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

MetaCURL: Non-stationary Concave Utility Reinforcement Learning., , , and . CoRR, (2024)Stability and approximation of nonlinear filters in the Hilbert metric, and application to particle filters., and . CDC, page 1585-1590. IEEE, (2000)Monte-Carlo algorithms for a forward Feynman-Kac-type representation for semilinear nonconservative partial differential equations., , and . Monte Carlo Methods Appl., 24 (1): 55-70 (2018)Large-Scale Nonconvex Optimization: Randomization, Gap Estimation, and Numerical Resolution., , , , and . SIAM J. Optim., 33 (4): 3083-3113 (December 2023)A Privacy-preserving Decentralized Algorithm for Distribution Locational Marginal Prices., , , , , and . CDC, page 4143-4148. IEEE, (2022)On the Robustness of the Snell Envelope., , , and . SIAM J. Financial Math., 2 (1): 587-626 (2011)Stability and Uniform Particle Approximation of Nonlinear Filters in Case of Non Ergodic Signals, and . Stochastic Analysis and Applications, 23 (3): 421--448 (2005)A sequential particle algorithm that keeps the particle system alive., and . EUSIPCO, page 1-4. IEEE, (2005)Approximate Nash Equilibria in Large Nonconvex Aggregative Games., , and . Math. Oper. Res., 48 (3): 1791-1809 (August 2023)Demand side management in the smart grid: An efficiency and fairness tradeoff., , , and . ISGT Europe, page 1-6. IEEE, (2017)