Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

On Wasserstein Reinforcement Learning and the Fokker-Planck equation.

P. Richemond, and B. Maginnis. CoRR, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

H -H Kaatz

Florian H H Giesert

Lobna H airall h

Christoph H -H Traulsen

Albrecht H -H Meyer

Other publications of authors with the same name

Data Distributional Properties Drive Emergent In-Context Learning in Transformers.S. Chan, A. Santoro, A. Lampinen, J. Wang, A. Singh, P. Richemond, J. McClelland, and F. Hill. NeurIPS, (2022)Combining learning rate decay and weight decay with complexity gradient descent - Part I.P. Richemond, and Y. Guo. CoRR, (2019)BYOL works even without batch statistics.P. Richemond, J. Grill, F. Altché, C. Tallec, F. Strub, A. Brock, S. Smith, S. De, R. Pascanu, B. Piot and 1 other author(s). CoRR, (2020)On Wasserstein Reinforcement Learning and the Fokker-Planck equation.P. Richemond, and B. Maginnis. CoRR, (2017)Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning.J. Grill, F. Strub, F. Altché, C. Tallec, P. Richemond, E. Buchatskaya, C. Doersch, B. Pires, Z. Guo, M. Azar and 4 other author(s). NeurIPS, (2020)A short variational proof of equivalence between policy gradients and soft Q learning.P. Richemond, and B. Maginnis. CoRR, (2017)Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit.B. Maginnis, and P. Richemond. CoRR, (2017)Biologically inspired architectures for sample-efficient deep reinforcement learning.P. Richemond, A. Kolbeinsson, and Y. Guo. CoRR, (2019)Static Activation Function Normalization.P. Richemond, and Y. Guo. CoRR, (2019)Bootstrap your own latent: A new approach to self-supervised learningJ. Grill, F. Strub, F. Altché, C. Tallec, P. Richemond, E. Buchatskaya, C. Doersch, B. Pires, Z. Guo, M. Azar and 1 other author(s). arXiv preprint arXiv:2006.07733, (2020)

BibSonomy

Disambiguation of "Richemond, Pierre H."

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

On Wasserstein Reinforcement Learning and the Fokker-Planck equation.

Please choose a person to relate this publication to

H -H Kaatz

Florian H H Giesert

Lobna H airall h

Christoph H -H Traulsen

Albrecht H -H Meyer

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Richemond, Pierre H."

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML On Wasserstein Reinforcement Learning and the Fokker-Planck equation.

Please choose a person to relate this publication to

H -H Kaatz

Florian H H Giesert

Lobna H airall h

Christoph H -H Traulsen

Albrecht H -H Meyer

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

On Wasserstein Reinforcement Learning and the Fokker-Planck equation.