Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

M. Raghu, A. Irpan, J. Andreas, R. Kleinberg, Q. Le, and J. Kleinberg. (2017)cite arxiv:1711.02301Comment: Accepted to ICML 2018, code opensourced at: https://github.com/rubai5/ESS_Game.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Andreas Jacob

Other publications of authors with the same name

A Generative Model of Vector Space Semantics.J. Andreas, and Z. Ghahramani. CVSM@ACL, page 91-99. Association for Computational Linguistics, (2013)Joint Modeling of Chest Radiographs and Radiology Reports for Pulmonary Edema Assessment.G. Chauhan, R. Liao, W. III, J. Andreas, X. Wang, S. Berkowitz, S. Horng, P. Szolovits, and P. Golland. MICCAI (2), volume 12262 of Lecture Notes in Computer Science, page 529-539. Springer, (2020)Guiding Policies with Language via Meta-Learning.J. Co-Reyes, A. Gupta, S. Sanjeev, N. Altieri, J. Andreas, J. DeNero, P. Abbeel, and S. Levine. ICLR (Poster), OpenReview.net, (2019)Eliciting Human Preferences with Language Models.B. Li, A. Tamkin, N. Goodman, and J. Andreas. CoRR, (2023)Unified Pragmatic Models for Generating and Following Instructions.D. Fried, J. Andreas, and D. Klein. NAACL-HLT, page 1951-1963. Association for Computational Linguistics, (2018)Hierarchical Phrase-Based Sequence-to-Sequence Learning.B. Wang, I. Titov, J. Andreas, and Y. Kim. EMNLP, page 8211-8229. Association for Computational Linguistics, (2022)On the Accuracy of Self-Normalized Log-Linear Models.J. Andreas, M. Rabinovich, M. Jordan, and D. Klein. NIPS, page 1783-1791. (2015)Teachable Reinforcement Learning via Advice Distillation.O. Watkins, A. Gupta, T. Darrell, P. Abbeel, and J. Andreas. NeurIPS, page 6920-6933. (2021)Unsupervised Transcription of Piano Music.T. Berg-Kirkpatrick, J. Andreas, and D. Klein. NIPS, page 1538-1546. (2014)Guiding Pretraining in Reinforcement Learning with Large Language Models.Y. Du, O. Watkins, Z. Wang, C. Colas, T. Darrell, P. Abbeel, A. Gupta, and J. Andreas. ICML, volume 202 of Proceedings of Machine Learning Research, page 8657-8677. PMLR, (2023)

BibSonomy

Disambiguation of "Andreas, Jacob"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

Please choose a person to relate this publication to

Andreas Jacob

Andreas Jacob

Andreas Jacob

Andreas Jacob

Andreas Jacob

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Andreas, Jacob"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

Please choose a person to relate this publication to

Andreas Jacob

Andreas Jacob

Andreas Jacob

Andreas Jacob

Andreas Jacob

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?