Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information.

Y. Zhou, J. Li, and J. Zhu. ICLR, OpenReview.net, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Zhou Zhou

Jun Zhou

Wei Zhou

Dayu Zhou

Other publications of authors with the same name

Lazy-CFR: fast and near-optimal regret minimization for extensive games with imperfect information.Y. Zhou, T. Ren, J. Li, D. Yan, and J. Zhu. ICLR, OpenReview.net, (2020)Selective Verification Strategy for Learning From Crowds.T. Tian, Y. Zhou, and J. Zhu. AAAI, page 4147-4154. AAAI Press, (2018)Online Label Aggregation: A Variational Bayesian Approach.C. Hong, A. Ghiassi, Y. Zhou, R. Birke, and L. Chen. WWW, page 1904-1915. ACM / IW3C2, (2021)Identify the Nash Equilibrium in Static Games with Random Payoffs.Y. Zhou, J. Li, and J. Zhu. ICML, volume 70 of Proceedings of Machine Learning Research, page 4160-4169. PMLR, (2017)Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback.F. Kong, Y. Zhou, and S. Li. ICML, volume 162 of Proceedings of Machine Learning Research, page 11473-11482. PMLR, (2022)Exploration Analysis in Finite-Horizon Turn-based Stochastic Games.J. Li, Y. Zhou, T. Ren, and J. Zhu. UAI, volume 124 of Proceedings of Machine Learning Research, page 201-210. AUAI Press, (2020)Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process.X. Zhou, L. Wang, and Y. Zhou. ICML, OpenReview.net, (2024)Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information.Y. Zhou, J. Li, and J. Zhu. ICLR, OpenReview.net, (2020)Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit.Y. Zhou, S. Song, H. Zhang, J. Zhu, W. Chen, and T. Liu. CoRR, (2021)Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors.Y. Zhou, J. Zhu, and J. Zhuo. CoRR, (2017)

BibSonomy

Disambiguation of "Zhou, Yichi"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information.

Please choose a person to relate this publication to

Zhou Zhou

Zhou Zhou

Jun Zhou

Wei Zhou

Dayu Zhou

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Zhou, Yichi"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information.

Please choose a person to relate this publication to

Zhou Zhou

Zhou Zhou

Jun Zhou

Wei Zhou

Dayu Zhou

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information.