Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP.

Z. Zhang, J. Yang, X. Ji, and S. Du. CoRR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Simon S Lee

Marcus Simon

Erstellung und Validierung eines Fragebogens für die Patientenbeurteilung der perioperativen Phase, PPP-FragebogenM. Simon. Uni Marburg, (2009)

Christine Simon

Vollblutaggregation beim Hund: Vergleich der Methode nach Born mit einer errechneten Aggregation nach Thrombozytenzählung mittels konventioneller HämatologiesystemeC. Simon. Uni Gießen, (2009)

Juan Du

Fei Du

Other publications of authors with the same name

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph KernelsS. Du, K. Hou, R. Salakhutdinov, B. Poczos, R. Wang, and K. Xu. (2019)Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?S. Du, S. Kakade, R. Wang, and L. Yang. (2019)cite arxiv:1910.03016.When is particle filtering efficient for planning in partially observed linear dynamical systems?S. Du, W. Hu, Z. Li, R. Shen, Z. Song, and J. Wu. UAI, volume 161 of Proceedings of Machine Learning Research, page 728-737. AUAI Press, (2021)Q-learning with Logarithmic Regret.K. Yang, L. Yang, and S. Du. CoRR, (2020)Provable Representation Learning for Imitation Learning via Bi-level Optimization.S. Arora, S. Du, S. Kakade, Y. Luo, and N. Saunshi. ICML, volume 119 of Proceedings of Machine Learning Research, page 367-376. PMLR, (2020)Q-learning with Logarithmic Regret.K. Yang, L. Yang, and S. Du. AISTATS, volume 130 of Proceedings of Machine Learning Research, page 1576-1584. PMLR, (2021)Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games.Y. Zhao, Y. Tian, J. Lee, and S. Du. AISTATS, volume 151 of Proceedings of Machine Learning Research, page 2736-2761. PMLR, (2022)Efficient Nonparametric Smoothness Estimation.S. Singh, S. Du, and B. Póczos. NIPS, page 1010-1018. (2016)Hypothesis Transfer Learning via Transformation Functions.S. Du, J. Koushik, A. Singh, and B. Póczos. NIPS, page 574-584. (2017)On the Power of Truncated SVD for General High-rank Matrix Estimation Problems.S. Du, Y. Wang, and A. Singh. NIPS, page 445-455. (2017)

BibSonomy

Disambiguation of "Du, Simon S."

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP.

Please choose a person to relate this publication to

Simon S Lee

Marcus Simon

Christine Simon

Juan Du

Fei Du

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Du, Simon S."

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP.

Please choose a person to relate this publication to

Simon S Lee

Marcus Simon

Christine Simon

Juan Du

Fei Du

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP.