Author of the publication

Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration.

, , and . AAAI, page 6566-6573. AAAI Press, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Nonstationary Reinforcement Learning with Linear Function Approximation., , , and . Trans. Mach. Learn. Res., (2022)Reinforcement Learning in Low-rank MDPs with Density Features., , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 13710-13752. PMLR, (2023)Information-Theoretic Considerations in Batch Reinforcement Learning., and . ICML, volume 97 of Proceedings of Machine Learning Research, page 1042-1051. PMLR, (2019)Accelerating Nonconvex Learning via Replica Exchange Langevin diffusion., , , , and . ICLR (Poster), OpenReview.net, (2019)Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality., , , , , and . ICLR, OpenReview.net, (2022)Offline reinforcement learning under value and density-ratio realizability: The power of gaps., and . UAI, volume 180 of Proceedings of Machine Learning Research, page 378-388. PMLR, (2022)Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration., , and . AAAI, page 6566-6573. AAAI Press, (2021)On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL., , , , and . NeurIPS, (2022)Extended Abstract: Learning in Low-rank MDPs with Density Features., , and . CISS, page 1-3. IEEE, (2023)DTAD: A Dynamic Taint Analysis Detector for Information Security., , , , , and . WAIM, page 591-597. IEEE Computer Society, (2008)