From post

Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling.

, , , и . ICML, том 119 из Proceedings of Machine Learning Research, стр. 10070-10080. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A Comprehensive Network Restoration Model for Active Distribution Network Considering Forecast Uncertainty., , , , , , и . IEEE Access, (2021)On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning., и . CoRR, (2020)Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past., и . CoRR, (2019)BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning., , , , , , и . CoRR, (2019)BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning., , , , , и . NeurIPS, (2020)Accurate, Diverse and Multiple Distractor Generation with Mixture of Experts., , и . NLPCC (1), том 14302 из Lecture Notes in Computer Science, стр. 761-773. Springer, (2023)Magnetically actuated gearbox for the wireless control of millimeter-scale robots., , , , , , , и . Sci. Robotics, (2022)On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning., , , и . ICLR, OpenReview.net, (2022)Randomized Ensembled Double Q-Learning: Learning Fast Without a Model., , , и . ICLR, OpenReview.net, (2021)Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance., , , , , и . CoRR, (2021)