Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.

G. Dalal, B. Szörényi, G. Thoppe, and S. Mannor. CoRR, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Brinda Dalal

Mira Dalal

Gal Schkolnik

Steffi Gal

Andreas Gál

Other publications of authors with the same name

Finite Sample Analysis for TD(0) with Linear Function Approximation.G. Dalal, B. Szörényi, G. Thoppe, and S. Mannor. CoRR, (2017)On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning.G. Tennenholtz, A. Hallak, G. Dalal, S. Mannor, G. Chechik, and U. Shalit. CoRR, (2021)On the Products of Stochastic and Diagonal Matrices.A. Hallak, and G. Dalal. CoRR, (2023)Reinforcement Learning for Datacenter Congestion Control.C. Tessler, Y. Shpigelman, G. Dalal, A. Mandelbaum, D. Kazakov, B. Fuhrer, G. Chechik, and S. Mannor. CoRR, (2021)Hierarchical Decision Making In Electricity Grid Management.G. Dalal, E. Gilboa, and S. Mannor. ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 2197-2206. JMLR.org, (2016)How to Combine Tree-Search Methods in Reinforcement Learning.Y. Efroni, G. Dalal, B. Scherrer, and S. Mannor. AAAI, page 3494-3501. AAAI Press, (2019)Supervised learning for optimal power flow as a real-time proxy.R. Canyasse, G. Dalal, and S. Mannor. ISGT, page 1-5. IEEE, (2017)Finite Sample Analyses for TD(0) With Function Approximation.G. Dalal, B. Szörényi, G. Thoppe, and S. Mannor. AAAI, page 6144-6160. AAAI Press, (2018)Distributed scenario-based optimization for asset management in a hierarchical decision making environment.G. Dalal, E. Gilboa, and S. Mannor. PSCC, page 1-9. IEEE, (2016)SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search.G. Dalal, A. Hallak, G. Thoppe, S. Mannor, and G. Chechik. CoRR, (2023)

BibSonomy

Disambiguation of "Dalal, Gal"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.

Please choose a person to relate this publication to

Brinda Dalal

Mira Dalal

Gal Schkolnik

Steffi Gal

Andreas Gál

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Dalal, Gal"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.

Please choose a person to relate this publication to

Brinda Dalal

Mira Dalal

Gal Schkolnik

Steffi Gal

Andreas Gál

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.