Author of the publication

CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms.

, , , , , , and . J. Mach. Learn. Res., (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Zephyr: Direct Distillation of LM Alignment., , , , , , , , , and 4 other author(s). CoRR, (2023)Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform., , , , , and . CoRR, (2023)EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine., , , , , , , , , and 2 other author(s). NeurIPS, (2022)Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks., , , , and . CoRR, (2023)A2C is a special case of PPO., , , , , and . CoRR, (2022)Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning., , , , , , , , , and 23 other author(s). CoRR, (2024)CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms., , , , , , and . J. Mach. Learn. Res., (2022)The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization., , , , , and . CoRR, (2024)Comparing Observation and Action Representations for Deep Reinforcement Learning in MicroRTS., and . CoRR, (2019)Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games., and . CoRR, (2020)