Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.

A. Wagenmaker, Y. Chen, M. Simchowitz, S. Du, and K. Jamieson. CoRR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Yifang Cui

Yifang Sun

Chen Chen

Shih Chen Chen

Other publications of authors with the same name

A Rotation-Invariant Convolutional Neural Network for Image Enhancement Forensics.Y. Chen, Z. Lyu, X. Kang, and Z. Wang. ICASSP, page 2111-2115. IEEE, (2018)An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models.G. Bhatt, Y. Chen, A. Das, J. Zhang, S. Truong, S. Mussmann, Y. Zhu, J. Bilmes, S. Du, K. Jamieson and 2 other author(s). CoRR, (2024)An Ultrasonic Laminated Transducer for Viscoelastic Media Detection.S. Yang, W. Song, Y. Chen, L. Yang, M. Wang, Y. Lian, and K. Liu. Sensors, 21 (21): 7188 (2021)More Practical and Adaptive Algorithms for Online Quantum State Learning.Y. Chen, and X. Wang. CoRR, (2020)Double Compression Detection Based on the De-Blocking Filtering of HEVC Videos.X. Kang, P. Su, Z. Huang, Y. Chen, and J. Wang. ICASSP, page 1-5. IEEE, (2023)The Fair Contextual Multi-Armed Bandit.Y. Chen, A. Cuellar, H. Luo, J. Modi, H. Nemlekar, and S. Nikolaidis. AAMAS, page 1810-1812. International Foundation for Autonomous Agents and Multiagent Systems, (2020)Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information.P. Auer, Y. Chen, P. Gajane, C. Lee, H. Luo, R. Ortner, and C. Wei. COLT, volume 99 of Proceedings of Machine Learning Research, page 159-163. PMLR, (2019)Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning.Y. Wang, Y. Chen, W. Yan, K. Jamieson, and S. Du. CoRR, (2024)Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler.Y. Chen, K. Sankararaman, A. Lazaric, M. Pirotta, D. Karamshuk, Q. Wang, K. Mandyam, S. Wang, and H. Fang. CoRR, (2022)Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes.A. Wagenmaker, Y. Chen, M. Simchowitz, S. Du, and K. Jamieson. ICML, volume 162 of Proceedings of Machine Learning Research, page 22430-22456. PMLR, (2022)

BibSonomy

Disambiguation of "Chen, Yifang"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.

Please choose a person to relate this publication to

Yifang Cui

Yifang Sun

Chen Chen

Chen Chen

Shih Chen Chen

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Chen, Yifang"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.

Please choose a person to relate this publication to

Yifang Cui

Yifang Sun

Chen Chen

Chen Chen

Shih Chen Chen

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.