Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Deep Reinforcement Learning that Matters

P. Henderson, R. Islam, P. Bachman, J. Pineau, D. Precup, and D. Meger. (2017)cite arxiv:1709.06560Comment: Accepted to the Thirthy-Second AAAI Conference On Artificial Intelligence (AAAI), 2018.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Peter Henderson

Duane Henderson

Gemma Henderson

Neil Henderson

Joyce Henderson

Other publications of authors with the same name

A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version.I. Serban, R. Lowe, P. Henderson, L. Charlin, and J. Pineau. D&D, 9 (1): 1-49 (2018)Separating value functions across time-scales.J. Romoff, P. Henderson, A. Touati, Y. Ollivier, E. Brunskill, and J. Pineau. CoRR, (2019)Adversarial Gain.P. Henderson, K. Sinha, N. Ke, and J. Pineau. CoRR, (2018)LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models.N. Guha, J. Nyarko, D. Ho, C. Ré, A. Chilton, A. Narayana, A. Chohlas-Wood, A. Peters, B. Waldon, D. Rockmore and 30 other author(s). CoRR, (2023)Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications.B. Wei, K. Huang, Y. Huang, T. Xie, X. Qi, M. Xia, P. Mittal, M. Wang, and P. Henderson. CoRR, (2024)Entropy Regularization for Population Estimation.B. Chugg, P. Henderson, J. Goldin, and D. Ho. CoRR, (2022)The RLLChatbot: a solution to the ConvAI challenge.N. Gontier, K. Sinha, P. Henderson, I. Serban, M. Noseworthy, P. Parthasarathi, and J. Pineau. CoRR, (2018)Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection.P. Henderson, B. Chugg, B. Anderson, K. Altenburger, A. Turk, J. Guyton, J. Goldin, and D. Ho. AAAI, page 5087-5095. AAAI Press, (2023)When does pretraining help?: assessing self-supervised learning for law and the CaseHOLD dataset of 53, 000+ legal holdings.L. Zheng, N. Guha, B. Anderson, P. Henderson, and D. Ho. ICAIL, page 159-168. ACM, (2021)Beyond Ads: Sequential Decision-Making Algorithms in Law and Public Policy.P. Henderson, B. Chugg, B. Anderson, and D. Ho. CSLAW, page 87-100. ACM, (2022)

BibSonomy

Disambiguation of "Henderson, Peter"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Deep Reinforcement Learning that Matters

Please choose a person to relate this publication to

Peter Henderson

Duane Henderson

Gemma Henderson

Neil Henderson

Joyce Henderson

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Henderson, Peter"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Deep Reinforcement Learning that Matters

Please choose a person to relate this publication to

Peter Henderson

Duane Henderson

Gemma Henderson

Neil Henderson

Joyce Henderson

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Deep Reinforcement Learning that Matters