Author of the publication

Trust Region Policy Optimization.

, , , , and . ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 1889-1897. JMLR.org, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

RLlib: Abstractions for Distributed Reinforcement Learning., , , , , , , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 3059-3068. PMLR, (2018)Real-Time Machine Learning: The Missing Pieces., , , , , , , , , and . HotOS, page 106-110. ACM, (2017)Discriminating between causal structures in Bayesian Networks given partial observations., , and . Kybernetika, 50 (2): 284-295 (2014)Policy Gradient Search: Online Planning and Expert Iteration without Search Trees., , , , and . CoRR, (2019)ESCHER: expressive scheduling with ephemeral resources., , , , , , and . SoCC, page 47-62. ACM, (2022)A Linearly-Convergent Stochastic L-BFGS Algorithm., , and . AISTATS, volume 51 of JMLR Workshop and Conference Proceedings, page 249-258. JMLR.org, (2016)Ray: A Distributed Framework for Emerging AI Applications, , , , , , , , and . CoRR, (2017)Hoplite: Efficient Collective Communication for Task-Based Distributed Systems., , , , , , , and . CoRR, (2020)Ray: A Distributed Execution Engine for the Machine Learning Ecosystem.. University of California, Berkeley, USA, (2019)Ray: A Distributed Framework for Emerging AI Applications., , , , , , , , , and 1 other author(s). OSDI, page 561-577. USENIX Association, (2018)