Author of the publication

Combining parametric and nonparametric models for off-policy evaluation.

, , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 2366-2375. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Sublinear Optimal Policy Value Estimation in Contextual Bandits., , and . CoRR, (2019)Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization., , , and . CoRR, (2023)Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making., , , and . CoRR, (2021)Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy., , , and . AAAI, page 4436-4443. AAAI Press, (2020)Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding., , , and . NeurIPS, (2020)Reinforcement Learning with State Observation Costs in Action-Contingent Noiselessly Observable Markov Decision Processes., , and . NeurIPS, page 15650-15666. (2021)Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning., , and . NeurIPS, page 13626-13640. (2021)Fairer but Not Fair Enough On the Equitability of Knowledge Tracing., and . LAK, page 335-339. ACM, (2019)PLOTS: Procedure Learning from Observations using subTask Structure., , and . AAMAS, page 1007-1015. International Foundation for Autonomous Agents and Multiagent Systems, (2019)Sublinear Optimal Policy Value Estimation in Contextual Bandits., , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 4377-4387. PMLR, (2020)