Author of the publication

Combining parametric and nonparametric models for off-policy evaluation.

, , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 2366-2375. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making., , , and . CoRR, (2021)Sublinear Optimal Policy Value Estimation in Contextual Bandits., , and . CoRR, (2019)Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization., , , and . CoRR, (2023)Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy., , , and . AAAI, page 4436-4443. AAAI Press, (2020)Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding., , , and . NeurIPS, (2020)Reinforcement Learning with State Observation Costs in Action-Contingent Noiselessly Observable Markov Decision Processes., , and . NeurIPS, page 15650-15666. (2021)Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning., , and . NeurIPS, page 13626-13640. (2021)Fairer but Not Fair Enough On the Equitability of Knowledge Tracing., and . LAK, page 335-339. ACM, (2019)Value Driven Representation for Human-in-the-Loop Reinforcement Learning., and . UMAP, page 176-180. ACM, (2019)PLOTS: Procedure Learning from Observations using subTask Structure., , and . AAMAS, page 1007-1015. International Foundation for Autonomous Agents and Multiagent Systems, (2019)