From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure., , , , , , , , , и . AAAI, стр. 7078-7086. AAAI Press, (2023)Reinforcing Classical Planning for Adversary Driving Scenarios., , и . CoRR, (2019)Universal Option Models., , , , и . NIPS, стр. 990-998. (2014)Pseudo-MDPs and factored linear action models., , , и . ADPRL, стр. 1-9. IEEE, (2014)Understanding and mitigating the limitations of prioritized experience replay., , , , , , и . UAI, том 180 из Proceedings of Machine Learning Research, стр. 1561-1571. PMLR, (2022)Multi-Step Dyna Planning for Policy Evaluation and Control., , , , и . NIPS, стр. 2187-2195. Curran Associates, Inc., (2009)Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation., , , и . ICML, том 119 из Proceedings of Machine Learning Research, стр. 11204-11213. PMLR, (2020)Breaking the Deadly Triad with a Target Network., , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 12621-12631. PMLR, (2021)Minimal Residual Approaches for Policy Evaluation in Large Sparse Markov Chains., и . ISAIM, (2008)Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings., , , , и . IJCAI, стр. 860-867. ijcai.org, (2020)Scheduled for July 2020, Yokohama, Japan, postponed due to the Corona pandemic..