Author of the publication

Deep Reinforcement Learning that Matters

, , , , , and . (2017)cite arxiv:1709.06560Comment: Accepted to the Thirthy-Second AAAI Conference On Artificial Intelligence (AAAI), 2018.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version., , , , and . D&D, 9 (1): 1-49 (2018)Separating value functions across time-scales., , , , , and . CoRR, (2019)Adversarial Gain., , , and . CoRR, (2018)LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models., , , , , , , , , and 30 other author(s). CoRR, (2023)Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications., , , , , , , , and . CoRR, (2024)Entropy Regularization for Population Estimation., , , and . CoRR, (2022)The RLLChatbot: a solution to the ConvAI challenge., , , , , , and . CoRR, (2018)Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection., , , , , , , and . AAAI, page 5087-5095. AAAI Press, (2023)When does pretraining help?: assessing self-supervised learning for law and the CaseHOLD dataset of 53, 000+ legal holdings., , , , and . ICAIL, page 159-168. ACM, (2021)Beyond Ads: Sequential Decision-Making Algorithms in Law and Public Policy., , , and . CSLAW, page 87-100. ACM, (2022)