Author of the publication

Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints.

, , and . J. Mach. Learn. Res., (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Model-Free Algorithm and Regret Analysis for MDPs with Peak Constraints., , and . CoRR, (2020)Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach., , , , and . AAAI, page 3682-3689. AAAI Press, (2022)Escaping Saddle Points for Zeroth-order Non-convex Optimization using Estimated Gradient Descent., , and . CISS, page 1-6. IEEE, (2020)Provably Sample-Efficient Model-Free Algorithm for MDPs with Peak Constraints., , and . J. Mach. Learn. Res., (2023)A Reinforcement Learning Framework for Vehicular Network Routing Under Peak and Average Constraints., , , , , , and . IEEE Trans. Veh. Technol., 72 (5): 6753-6764 (May 2023)Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm., , and . CoRR, (2021)Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm., , and . AAAI, page 6737-6744. AAAI Press, (2023)Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm., , and . J. Artif. Intell. Res., (2022)Markov Decision Processes with Long-Term Average Constraints., , and . CoRR, (2021)Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes., , and . AAAI, page 10980-10988. AAAI Press, (2024)