From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations.

H. Xu, X. Zhan, H. Yin, и H. Qin. ICML, том 162 из Proceedings of Machine Learning Research, стр. 24725-24742. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Zhan Kang

Zhan Gao

Renya Zhan

Beibei Zhan

Zhan Qiu

Другие публикации лиц с тем же именем

A Policy-Guided Imitation Approach for Offline Reinforcement Learning.H. Xu, L. Jiang, J. Li, и X. Zhan. NeurIPS, (2022)Model-Based Offline Planning with Trajectory Pruning.X. Zhan, X. Zhu, и H. Xu. IJCAI, стр. 3716-3722. ijcai.org, (2022)Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic.T. Ji, Y. Luo, F. Sun, X. Zhan, J. Zhang, и H. Xu. CoRR, (2023)Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning.X. Feng, L. Jiang, X. Yu, H. Xu, X. Sun, J. Wang, X. Zhan, и W. Chan. IEEE Trans. Games, 16 (1): 102-112 (марта 2024)OpenChat: Advancing Open-source Language Models with Mixed-Quality Data.G. Wang, S. Cheng, X. Zhan, X. Li, S. Song, и Y. Liu. CoRR, (2023)Network-Wide Traffic States Imputation Using Self-interested Coalitional Learning.H. Qin, X. Zhan, Y. Li, X. Yang, и Y. Zheng. KDD, стр. 1370-1378. ACM, (2021)DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning.X. Zhan, H. Xu, Y. Zhang, X. Zhu, H. Yin, и Y. Zheng. AAAI, стр. 4680-4688. AAAI Press, (2022)A Century of Topological Coevolution of Complex Infrastructure Networks in an Alpine City.J. Zischg, C. Klinkhamer, X. Zhan, P. Rao, и R. Sitzenfrei. Complex., (2019)A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning.J. Li, S. Lin, T. Shi, C. Tian, Y. Mei, J. Song, X. Zhan, и R. Li. CoRR, (2023)Mind the Gap: Offline Policy Optimization for Imperfect Rewards.J. Li, X. Hu, H. Xu, J. Liu, X. Zhan, Q. Jia, и Y. Zhang. ICLR, OpenReview.net, (2023)

BibSonomy

Disambiguation