Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Uniformly Conservative Exploration in Reinforcement Learning.

W. Xu, Y. Ma, K. Xu, H. Bastani, and O. Bastani. AISTATS, volume 206 of Proceedings of Machine Learning Research, page 10856-10870. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Ian Jason

Jason Miskuly

Jason Crook

Jason Tam

Jason Pazis

Other publications of authors with the same name

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching.Y. Ma, A. Shen, D. Jayaraman, and O. Bastani. CoRR, (2022)Diverse Sampling for Normalizing Flow Based Trajectory Forecasting.Y. Ma, J. Inala, D. Jayaraman, and O. Bastani. CoRR, (2020)VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training.Y. Ma, S. Sodhani, D. Jayaraman, O. Bastani, V. Kumar, and A. Zhang. ICLR, OpenReview.net, (2023)Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching.Y. Ma, A. Shen, D. Jayaraman, and O. Bastani. ICML, volume 162 of Proceedings of Machine Learning Research, page 14639-14663. PMLR, (2022)Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching.Y. Ma, K. Sivakumar, J. Yan, O. Bastani, and D. Jayaraman. L4DC, volume 211 of Proceedings of Machine Learning Research, page 259-271. PMLR, (2023)DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset.A. Khazatsky, K. Pertsch, S. Nair, A. Balakrishna, S. Dasari, S. Karamcheti, S. Nasiriany, M. Srirama, L. Chen, K. Ellis and 44 other author(s). CoRR, (2024)Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning.Y. Ma, A. Shen, O. Bastani, and D. Jayaraman. CoRR, (2021)State Relevance for Off-Policy Evaluation.S. Shen, Y. Ma, O. Gottesman, and F. Doshi-Velez. ICML, volume 139 of Proceedings of Machine Learning Research, page 9537-9546. PMLR, (2021)Safe Human-Interactive Control via Shielding.J. Inala, Y. Ma, O. Bastani, X. Zhang, and A. Solar-Lezama. CoRR, (2021)Universal Visual Decomposer: Long-Horizon Manipulation Made Easy.Z. Zhang, Y. Li, O. Bastani, A. Gupta, D. Jayaraman, Y. Ma, and L. Weihs. CoRR, (2023)

BibSonomy

Disambiguation of "Ma, Yecheng Jason"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Uniformly Conservative Exploration in Reinforcement Learning.

Please choose a person to relate this publication to

Ian Jason

Jason Miskuly

Jason Crook

Jason Tam

Jason Pazis

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Ma, Yecheng Jason"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Uniformly Conservative Exploration in Reinforcement Learning.

Please choose a person to relate this publication to

Ian Jason

Jason Miskuly

Jason Crook

Jason Tam

Jason Pazis

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Uniformly Conservative Exploration in Reinforcement Learning.