From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling.

C. Wang, Y. Wu, Q. Vuong, и K. Ross. ICML, том 119 из Proceedings of Machine Learning Research, стр. 10070-10080. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Che Lim

Mingcheng Che

Austin Che

Guangzhou Che

Haiyan Che

Другие публикации лиц с тем же именем

A Comprehensive Network Restoration Model for Active Distribution Network Considering Forecast Uncertainty.G. Wang, X. Lei, H. Wu, K. Sun, L. Wang, Y. Ding, и C. Wang. IEEE Access, (2021)On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning.C. Wang, и K. Ross. CoRR, (2020)Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past.C. Wang, и K. Ross. CoRR, (2019)BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning.X. Chen, Z. Zhou, Z. Wang, C. Wang, Y. Wu, Q. Deng, и K. Ross. CoRR, (2019)BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning.X. Chen, Z. Zhou, Z. Wang, C. Wang, Y. Wu, и K. Ross. NeurIPS, (2020)Accurate, Diverse and Multiple Distractor Generation with Mixture of Experts.F. Qu, C. Wang, и Y. Wu. NLPCC (1), том 14302 из Lecture Notes in Computer Science, стр. 761-773. Springer, (2023)Magnetically actuated gearbox for the wireless control of millimeter-scale robots.C. Hong, Z. Ren, C. Wang, M. Li, Y. Wu, D. Tang, W. Hu, и M. Sitti. Sci. Robotics, (2022)On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning.C. Wang, S. Yuan, K. Shao, и K. Ross. ICLR, OpenReview.net, (2022)Randomized Ensembled Double Q-Learning: Learning Fast Without a Model.X. Chen, C. Wang, Z. Zhou, и K. Ross. ICLR, OpenReview.net, (2021)Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance.Y. Wu, X. Chen, C. Wang, Y. Zhang, Z. Zhou, и K. Ross. CoRR, (2021)

BibSonomy

Disambiguation