Author of the publication

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.

, , , , , , and . AAAI, page 11390-11398. AAAI Press, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Recognition in-the-Tail: Training Detectors for Unusual Pedestrians with Synthetic Imposters., and . CoRR, (2017)Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning., , , , , , , , , and 4 other author(s). Nat. Mac. Intell., 5 (7): 714-723 (July 2023)Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models., , , and . CoRR, (2023)TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations., , , , , , , and . CoRR, (2021)Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic., , , , , , and . PRICAI (3), volume 13033 of Lecture Notes in Computer Science, page 46-59. Springer, (2021)DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization., , , , , , and . AAAI, page 11390-11398. AAAI Press, (2024)MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment., , , , , and . CoRR, (2024)SVQN: Sequential Variational Soft Q-Learning Networks., , , and . ICLR, OpenReview.net, (2020)Deep reinforcement learning with credit assignment for combinatorial optimization., , , , , , and . Pattern Recognit., (2022)SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks., , , , , , , , and . CoRR, (2023)