From post

Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations.

, , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 24725-24742. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A Policy-Guided Imitation Approach for Offline Reinforcement Learning., , , и . NeurIPS, (2022)Model-Based Offline Planning with Trajectory Pruning., , и . IJCAI, стр. 3716-3722. ijcai.org, (2022)Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic., , , , , и . CoRR, (2023)Curriculum Goal-Conditioned Imitation for Offline Reinforcement Learning., , , , , , , и . IEEE Trans. Games, 16 (1): 102-112 (марта 2024)OpenChat: Advancing Open-source Language Models with Mixed-Quality Data., , , , , и . CoRR, (2023)Network-Wide Traffic States Imputation Using Self-interested Coalitional Learning., , , , и . KDD, стр. 1370-1378. ACM, (2021)DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning., , , , , и . AAAI, стр. 4680-4688. AAAI Press, (2022)A Century of Topological Coevolution of Complex Infrastructure Networks in an Alpine City., , , , и . Complex., (2019)A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning., , , , , , , и . CoRR, (2023)Mind the Gap: Offline Policy Optimization for Imperfect Rewards., , , , , , и . ICLR, OpenReview.net, (2023)