Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Concave Utility Reinforcement Learning: The Mean-field Game Viewpoint.

M. Geist, J. Pérolat, M. Laurière, R. Elie, S. Perrin, O. Bachem, R. Munos, and O. Pietquin. AAMAS, page 489-497. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Otto Bachem

Carl Bachem

Gisela Bachem

Achim Bachem

Ulrich Bachem

Other publications of authors with the same name

Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations.F. Locatello, S. Bauer, M. Lucic, S. Gelly, B. Schölkopf, and O. Bachem. CoRR, (2018)Coresets for Nonparametric Estimation - the Case of DP-Means.O. Bachem, M. Lucic, and A. Krause. ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 209-217. JMLR.org, (2015)Uniform Deviation Bounds for k-Means Clustering.O. Bachem, M. Lucic, S. Hassani, and A. Krause. ICML, volume 70 of Proceedings of Machine Learning Research, page 283-291. PMLR, (2017)Distributed and Provably Good Seedings for k-Means in Constant Rounds.O. Bachem, M. Lucic, and A. Krause. ICML, volume 70 of Proceedings of Machine Learning Research, page 292-300. PMLR, (2017)Evaluating Generative Models Using Divergence Frontiers.J. Djolonga, M. Lucic, M. Cuturi, O. Bachem, O. Bousquet, and S. Gelly. CoRR, (2019)Google Research Football: A Novel Reinforcement Learning Environment.K. Kurach, A. Raichuk, P. Stanczyk, M. Zajac, O. Bachem, L. Espeholt, C. Riquelme, D. Vincent, M. Michalski, O. Bousquet and 1 other author(s). AAAI, page 4501-4510. AAAI Press, (2020)Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.P. Roit, J. Ferret, L. Shani, R. Aharoni, G. Cideron, R. Dadashi, M. Geist, S. Girgin, L. Hussenot, O. Keller and 9 other author(s). ACL (1), page 6252-6272. Association for Computational Linguistics, (2023)On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes.R. Agarwal, N. Vieillard, Y. Zhou, P. Stanczyk, S. Garea, M. Geist, and O. Bachem. ICLR, OpenReview.net, (2024)Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.K. Wang, R. Kidambi, R. Sullivan, A. Agarwal, C. Dann, A. Michi, M. Gelmi, Y. Li, R. Gupta, A. Dubey and 10 other author(s). CoRR, (2024)Scalable k -Means Clustering via Lightweight Coresets.O. Bachem, M. Lucic, and A. Krause. KDD, page 1119-1127. ACM, (2018)

BibSonomy

Disambiguation of "Bachem, Olivier"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Concave Utility Reinforcement Learning: The Mean-field Game Viewpoint.

Please choose a person to relate this publication to

Otto Bachem

Carl Bachem

Gisela Bachem

Achim Bachem

Ulrich Bachem

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Bachem, Olivier"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Concave Utility Reinforcement Learning: The Mean-field Game Viewpoint.

Please choose a person to relate this publication to

Otto Bachem

Carl Bachem

Gisela Bachem

Achim Bachem

Ulrich Bachem

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Concave Utility Reinforcement Learning: The Mean-field Game Viewpoint.