Author of the publication

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.

, , , , and . NIPS, page 2627-2636. (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems, , , and . (2020)cite arxiv:2005.01643.Particle Value Functions., , , , , , and . ICLR (Workshop), OpenReview.net, (2017)Smoothed Action Value Functions for Learning Gaussian Policies., , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 3689-3697. PMLR, (2018)A sampling framework for incorporating quantitative mass spectrometry data in protein interaction analysis., , and . BMC Bioinform., (2013)An online sequence-to-sequence model for noisy speech recognition., , , , , , and . CoRR, (2017)Learning Hard Alignments with Variational Inference., , , , , and . ICASSP, page 5799-5803. IEEE, (2018)Gemini: A Family of Highly Capable Multimodal Models., , , , , , , , , and 42 other author(s). CoRR, (2023)Model Selection in Batch Policy Optimization., , , and . CoRR, (2021)Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios., , , , , , , , , and 2 other author(s). CoRR, (2022)Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction., , , and . CoRR, (2019)