Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.

W. Chen, S. Huang, Y. Chiang, T. Pearce, W. Tu, T. Chen, and J. Zhu. AAAI, page 11390-11398. AAAI Press, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Shiyu Zhang

Shiyu Song

Shiyu Yang

Haishi Huang

Zhida Huang

Other publications of authors with the same name

Recognition in-the-Tail: Training Detectors for Unusual Pedestrians with Synthetic Imposters.S. Huang, and D. Ramanan. CoRR, (2017)Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning.Y. Lin, Q. Zhang, B. Gao, J. Tang, P. Yao, C. Li, S. Huang, Z. Liu, Y. Zhou, Y. Liu and 4 other author(s). Nat. Mac. Intell., 5 (7): 714-723 (July 2023)Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models.H. Song, S. Huang, Y. Dong, and W. Tu. CoRR, (2023)TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations.S. Huang, W. Chen, L. Zhang, Z. Li, F. Zhu, D. Ye, T. Chen, and J. Zhu. CoRR, (2021)Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic.S. Huang, B. Wang, H. Su, D. Li, J. Hao, J. Zhu, and T. Chen. PRICAI (3), volume 13033 of Lecture Notes in Computer Science, page 46-59. Springer, (2021)DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.W. Chen, S. Huang, Y. Chiang, T. Pearce, W. Tu, T. Chen, and J. Zhu. AAAI, page 11390-11398. AAAI Press, (2024)MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment.Z. Xiong, B. Chen, S. Huang, W. Tu, Z. He, and Y. Gao. CoRR, (2024)SVQN: Sequential Variational Soft Q-Learning Networks.S. Huang, H. Su, J. Zhu, and T. Chen. ICLR, OpenReview.net, (2020)Deep reinforcement learning with credit assignment for combinatorial optimization.D. Yan, J. Weng, S. Huang, C. Li, Y. Zhou, H. Su, and J. Zhu. Pattern Recognit., (2022)SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.B. Lin, Y. Fu, K. Yang, P. Ammanabrolu, F. Brahman, S. Huang, C. Bhagavatula, Y. Choi, and X. Ren. CoRR, (2023)

BibSonomy

Disambiguation of "Huang, Shiyu"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.

Please choose a person to relate this publication to

Shiyu Zhang

Shiyu Song

Shiyu Yang

Haishi Huang

Zhida Huang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Huang, Shiyu"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.

Please choose a person to relate this publication to

Shiyu Zhang

Shiyu Song

Shiyu Yang

Haishi Huang

Zhida Huang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization.