Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step., , , , , and . ICLR (Poster), OpenReview.net, (2018)Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity., , and . J. Mach. Learn. Res., (2022)Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts., , , , , , , , , and 10 other author(s). CoRR, (2023)Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?, , , , , , , , , and . EMNLP (Findings), page 12342-12364. Association for Computational Linguistics, (2023)Disentangling the independently controllable factors of variation by interacting with the world., , , , , , , , and . CoRR, (2018)Recall Traces: Backtracking Models for Efficient Reinforcement Learning., , , , , , and . CoRR, (2018)MaskGAN: Better Text Generation via Filling in the _______., , and . ICLR (Poster), OpenReview.net, (2018)Recall Traces: Backtracking Models for Efficient Reinforcement Learning., , , , , , , and . ICLR (Poster), OpenReview.net, (2019)Hyperbolic Discounting and Learning over Multiple Horizons, , , , and . (2019)cite arxiv:1902.06865.Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction., , , , and . CoRR, (2019)