Author of the publication

Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?

, , , , , and . (2017)cite arxiv:1711.02301Comment: Accepted to ICML 2018, code opensourced at: https://github.com/rubai5/ESS_Game.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Generative Model of Vector Space Semantics., and . CVSM@ACL, page 91-99. Association for Computational Linguistics, (2013)Joint Modeling of Chest Radiographs and Radiology Reports for Pulmonary Edema Assessment., , , , , , , , and . MICCAI (2), volume 12262 of Lecture Notes in Computer Science, page 529-539. Springer, (2020)Guiding Policies with Language via Meta-Learning., , , , , , , and . ICLR (Poster), OpenReview.net, (2019)Eliciting Human Preferences with Language Models., , , and . CoRR, (2023)Unified Pragmatic Models for Generating and Following Instructions., , and . NAACL-HLT, page 1951-1963. Association for Computational Linguistics, (2018)Hierarchical Phrase-Based Sequence-to-Sequence Learning., , , and . EMNLP, page 8211-8229. Association for Computational Linguistics, (2022)On the Accuracy of Self-Normalized Log-Linear Models., , , and . NIPS, page 1783-1791. (2015)Teachable Reinforcement Learning via Advice Distillation., , , , and . NeurIPS, page 6920-6933. (2021)Unsupervised Transcription of Piano Music., , and . NIPS, page 1538-1546. (2014)Guiding Pretraining in Reinforcement Learning with Large Language Models., , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 8657-8677. PMLR, (2023)