Author of the publication

Data-Efficient Policy Evaluation Through Behavior Policy Search.

, , , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 1394-1403. PMLR, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning Fair Representations with High-Confidence Guarantees., , and . CoRR, (2023)Optimization using Parallel Gradient Evaluations on Multiple Parameters., , , , and . CoRR, (2023)Reinforcement Learning When All Actions Are Not Always Available., , , and . AAAI, page 3381-3388. AAAI Press, (2020)Increasing the Action Gap: New Operators for Reinforcement Learning., , , , and . AAAI, page 1476-1483. AAAI Press, (2016)Is the Policy Gradient a Gradient?, and . AAMAS, page 939-947. International Foundation for Autonomous Agents and Multiagent Systems, (2020)Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments., , , and . AISTATS, volume 206 of Proceedings of Machine Learning Research, page 5474-5492. PMLR, (2023)Evaluating the Performance of Reinforcement Learning Algorithms., , , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 4962-4973. PMLR, (2020)Optimizing for the Future in Non-Stationary MDPs., , , , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 1414-1425. PMLR, (2020)SOPE: Spectrum of Off-Policy Estimators., , , , and . NeurIPS, page 18958-18969. (2021)Analyzing the Relationship Between Difference and Ratio-Based Fairness Metrics., , , and . FAccT, page 518-528. ACM, (2024)