From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future., , , , , , и . CoRR, (2019)SVRG for Policy Evaluation with Fewer Gradient Evaluations., , , и . IJCAI, стр. 2697-2703. ijcai.org, (2020)Scheduled for July 2020, Yokohama, Japan, postponed due to the Corona pandemic..Stable Policy Optimization via Off-Policy Divergence Regularization., , , и . UAI, том 124 из Proceedings of Machine Learning Research, стр. 1328-1337. AUAI Press, (2020)Real-time privacy-preserving model-based estimation of traffic flows., , и . ICCPS, стр. 92-102. IEEE Computer Society, (2014)Adversarial Divergences are Good Task Losses for Generative Modeling., , , , и . CoRR, (2017)Parametric Adversarial Divergences are Good Task Losses for Generative Modeling., , , , , и . ICLR (Workshop), OpenReview.net, (2018)Does Zero-Shot Reinforcement Learning Exist?, , и . CoRR, (2022)Separable value functions across time-scales., , , , , и . ICML, том 97 из Proceedings of Machine Learning Research, стр. 5468-5477. PMLR, (2019)Does Zero-Shot Reinforcement Learning Exist?, , и . ICLR, OpenReview.net, (2023)Score Models for Offline Goal-Conditioned Reinforcement Learning., , , , , и . ICLR, OpenReview.net, (2024)