Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Data Distributional Properties Drive Emergent In-Context Learning in Transformers., , , , , , , and . NeurIPS, (2022)Combining learning rate decay and weight decay with complexity gradient descent - Part I., and . CoRR, (2019)BYOL works even without batch statistics., , , , , , , , , and 1 other author(s). CoRR, (2020)On Wasserstein Reinforcement Learning and the Fokker-Planck equation., and . CoRR, (2017)Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning., , , , , , , , , and 4 other author(s). NeurIPS, (2020)A short variational proof of equivalence between policy gradients and soft Q learning., and . CoRR, (2017)Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit., and . CoRR, (2017)Biologically inspired architectures for sample-efficient deep reinforcement learning., , and . CoRR, (2019)Static Activation Function Normalization., and . CoRR, (2019)Bootstrap your own latent: A new approach to self-supervised learning, , , , , , , , , and 1 other author(s). arXiv preprint arXiv:2006.07733, (2020)