Author of the publication

Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.

, , , and . CoRR, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Finite Sample Analysis for TD(0) with Linear Function Approximation., , , and . CoRR, (2017)On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning., , , , , and . CoRR, (2021)On the Products of Stochastic and Diagonal Matrices., and . CoRR, (2023)Reinforcement Learning for Datacenter Congestion Control., , , , , , , and . CoRR, (2021)Hierarchical Decision Making In Electricity Grid Management., , and . ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 2197-2206. JMLR.org, (2016)How to Combine Tree-Search Methods in Reinforcement Learning., , , and . AAAI, page 3494-3501. AAAI Press, (2019)Supervised learning for optimal power flow as a real-time proxy., , and . ISGT, page 1-5. IEEE, (2017)Finite Sample Analyses for TD(0) With Function Approximation., , , and . AAAI, page 6144-6160. AAAI Press, (2018)Distributed scenario-based optimization for asset management in a hierarchical decision making environment., , and . PSCC, page 1-9. IEEE, (2016)SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search., , , , and . CoRR, (2023)