Author of the publication

Concave Utility Reinforcement Learning: The Mean-field Game Viewpoint.

, , , , , , , and . AAMAS, page 489-497. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations., , , , , and . CoRR, (2018)Coresets for Nonparametric Estimation - the Case of DP-Means., , and . ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 209-217. JMLR.org, (2015)Uniform Deviation Bounds for k-Means Clustering., , , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 283-291. PMLR, (2017)Distributed and Provably Good Seedings for k-Means in Constant Rounds., , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 292-300. PMLR, (2017)Evaluating Generative Models Using Divergence Frontiers., , , , , and . CoRR, (2019)Google Research Football: A Novel Reinforcement Learning Environment., , , , , , , , , and 1 other author(s). AAAI, page 4501-4510. AAAI Press, (2020)Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback., , , , , , , , , and 9 other author(s). ACL (1), page 6252-6272. Association for Computational Linguistics, (2023)On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes., , , , , , and . ICLR, OpenReview.net, (2024)Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning., , , , , , , , , and 10 other author(s). CoRR, (2024)Scalable k -Means Clustering via Lightweight Coresets., , and . KDD, page 1119-1127. ACM, (2018)