Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.

G. Tucker, A. Mnih, C. Maddison, D. Lawson, and J. Sohl-Dickstein. NIPS, page 2627-2636. (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Dewey Tucker

Richard Tucker

Tucker Hermans

Bernard Tucker

Lisa Tucker-Kellogg

Other publications of authors with the same name

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open ProblemsS. Levine, A. Kumar, G. Tucker, and J. Fu. (2020)cite arxiv:2005.01643.Particle Value Functions.C. Maddison, D. Lawson, G. Tucker, N. Heess, A. Doucet, A. Mnih, and Y. Teh. ICLR (Workshop), OpenReview.net, (2017)Smoothed Action Value Functions for Learning Gaussian Policies.O. Nachum, M. Norouzi, G. Tucker, and D. Schuurmans. ICML, volume 80 of Proceedings of Machine Learning Research, page 3689-3697. PMLR, (2018)A sampling framework for incorporating quantitative mass spectrometry data in protein interaction analysis.G. Tucker, P. Loh, and B. Berger. BMC Bioinform., (2013)An online sequence-to-sequence model for noisy speech recognition.C. Chiu, D. Lawson, Y. Luo, G. Tucker, K. Swersky, I. Sutskever, and N. Jaitly. CoRR, (2017)Learning Hard Alignments with Variational Inference.D. Lawson, C. Chiu, G. Tucker, C. Raffel, K. Swersky, and N. Jaitly. ICASSP, page 5799-5803. IEEE, (2018)Gemini: A Family of Highly Capable Multimodal Models.R. Anil, S. Borgeaud, Y. Wu, J. Alayrac, J. Yu, R. Soricut, J. Schalkwyk, A. Dai, A. Hauth, K. Millican and 42 other author(s). CoRR, (2023)Model Selection in Batch Policy Optimization.J. Lee, G. Tucker, O. Nachum, and B. Dai. CoRR, (2021)Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios.Y. Lu, J. Fu, G. Tucker, X. Pan, E. Bronstein, B. Roelofs, B. Sapp, B. White, A. Faust, S. Whiteson and 2 other author(s). CoRR, (2022)Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction.A. Kumar, J. Fu, G. Tucker, and S. Levine. CoRR, (2019)

BibSonomy

Disambiguation of "Tucker, George"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.

Please choose a person to relate this publication to

Dewey Tucker

Richard Tucker

Tucker Hermans

Bernard Tucker

Lisa Tucker-Kellogg

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Tucker, George"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.

Please choose a person to relate this publication to

Dewey Tucker

Richard Tucker

Tucker Hermans

Bernard Tucker

Lisa Tucker-Kellogg

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models.