Author of the publication

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.

, , , , and . CoRR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Rotation-Invariant Convolutional Neural Network for Image Enhancement Forensics., , , and . ICASSP, page 2111-2115. IEEE, (2018)An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models., , , , , , , , , and 2 other author(s). CoRR, (2024)An Ultrasonic Laminated Transducer for Viscoelastic Media Detection., , , , , , and . Sensors, 21 (21): 7188 (2021)More Practical and Adaptive Algorithms for Online Quantum State Learning., and . CoRR, (2020)Double Compression Detection Based on the De-Blocking Filtering of HEVC Videos., , , , and . ICASSP, page 1-5. IEEE, (2023)The Fair Contextual Multi-Armed Bandit., , , , , and . AAMAS, page 1810-1812. International Foundation for Autonomous Agents and Multiagent Systems, (2020)Achieving Optimal Dynamic Regret for Non-stationary Bandits without Prior Information., , , , , , and . COLT, volume 99 of Proceedings of Machine Learning Research, page 159-163. PMLR, (2019)Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning., , , , and . CoRR, (2024)Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler., , , , , , , , and . CoRR, (2022)Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes., , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 22430-22456. PMLR, (2022)