Author of the publication

On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning.

, , , and . ICLR, OpenReview.net, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Magnetically actuated gearbox for the wireless control of millimeter-scale robots., , , , , , , and . Sci. Robotics, (2022)Randomized Ensembled Double Q-Learning: Learning Fast Without a Model., , , and . ICLR, OpenReview.net, (2021)On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning., , , and . ICLR, OpenReview.net, (2022)Dynamic constrained multi-objective optimization algorithm based on co-evolution and diversity enhancement., , , , and . Swarm Evol. Comput., (2024)On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning., and . CoRR, (2020)BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning., , , , , and . NeurIPS, (2020)Accurate, Diverse and Multiple Distractor Generation with Mixture of Experts., , and . NLPCC (1), volume 14302 of Lecture Notes in Computer Science, page 761-773. Springer, (2023)Portfolio Online Evolution in StarCraft., , , , and . AIIDE, page 114-121. AAAI Press, (2016)Multiple Frequency Bands Temporal State Representation for Deep Reinforcement Learning., , , , , and . CACML, page 309-315. ACM, (2023)Reinforcement Learning with Automated Auxiliary Loss Search., , , , , , , and . NeurIPS, (2022)