Author of the publication

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP.

, , , and . CoRR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels, , , , , and . (2019)Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?, , , and . (2019)cite arxiv:1910.03016.When is particle filtering efficient for planning in partially observed linear dynamical systems?, , , , , and . UAI, volume 161 of Proceedings of Machine Learning Research, page 728-737. AUAI Press, (2021)Q-learning with Logarithmic Regret., , and . CoRR, (2020)Provable Representation Learning for Imitation Learning via Bi-level Optimization., , , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 367-376. PMLR, (2020)Q-learning with Logarithmic Regret., , and . AISTATS, volume 130 of Proceedings of Machine Learning Research, page 1576-1584. PMLR, (2021)Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games., , , and . AISTATS, volume 151 of Proceedings of Machine Learning Research, page 2736-2761. PMLR, (2022)Efficient Nonparametric Smoothness Estimation., , and . NIPS, page 1010-1018. (2016)Hypothesis Transfer Learning via Transformation Functions., , , and . NIPS, page 574-584. (2017)On the Power of Truncated SVD for General High-rank Matrix Estimation Problems., , and . NIPS, page 445-455. (2017)