Author of the publication

Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information.

, , and . ICLR, OpenReview.net, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Lazy-CFR: fast and near-optimal regret minimization for extensive games with imperfect information., , , , and . ICLR, OpenReview.net, (2020)Selective Verification Strategy for Learning From Crowds., , and . AAAI, page 4147-4154. AAAI Press, (2018)Online Label Aggregation: A Variational Bayesian Approach., , , , and . WWW, page 1904-1915. ACM / IW3C2, (2021)Identify the Nash Equilibrium in Static Games with Random Payoffs., , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 4160-4169. PMLR, (2017)Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback., , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 11473-11482. PMLR, (2022)Exploration Analysis in Finite-Horizon Turn-based Stochastic Games., , , and . UAI, volume 124 of Proceedings of Machine Learning Research, page 201-210. AUAI Press, (2020)Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process., , and . ICML, OpenReview.net, (2024)Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information., , and . ICLR, OpenReview.net, (2020)Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit., , , , , and . CoRR, (2021)Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors., , and . CoRR, (2017)