Author of the publication

Uniformly Conservative Exploration in Reinforcement Learning.

, , , , and . AISTATS, volume 206 of Proceedings of Machine Learning Research, page 10856-10870. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching., , , and . CoRR, (2022)Diverse Sampling for Normalizing Flow Based Trajectory Forecasting., , , and . CoRR, (2020)VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training., , , , , and . ICLR, OpenReview.net, (2023)Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching., , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 14639-14663. PMLR, (2022)Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching., , , , and . L4DC, volume 211 of Proceedings of Machine Learning Research, page 259-271. PMLR, (2023)DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset., , , , , , , , , and 44 other author(s). CoRR, (2024)Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning., , , and . CoRR, (2021)State Relevance for Off-Policy Evaluation., , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 9537-9546. PMLR, (2021)Safe Human-Interactive Control via Shielding., , , , and . CoRR, (2021)Universal Visual Decomposer: Long-Horizon Manipulation Made Easy., , , , , , and . CoRR, (2023)