Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Value Function Polytope in Reinforcement Learning.

R. Dadashi, M. Bellemare, A. Taïga, N. Roux, and D. Schuurmans. ICML, volume 97 of Proceedings of Machine Learning Research, page 1486-1495. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Dale Adams

Aklilu Dalelo

Dale Rubbra

Dale Pearson

Jonathan Dale

Other publications of authors with the same name

Strictly Lexicalised Dependency Parsing.Q. Wang, D. Schuurmans, and D. Lin. Trends in Parsing Technology, Springer, (2010)Stochastic Neural Networks with Monotonic Activation FunctionsS. Ravanbakhsh, B. Poczos, J. Schneider, D. Schuurmans, and R. Greiner. (2015)cite arxiv:1601.00034v2.pdfComment: AISTATS 2016.Learning Gene Regulatory Networks via Globally Regularized Risk Minimization.Y. Guo, and D. Schuurmans. RECOMB-CG, volume 4751 of Lecture Notes in Computer Science, page 83-95. Springer, (2007)Variational Rejection Sampling.A. Grover, R. Gummadi, M. Lázaro-Gredilla, D. Schuurmans, and S. Ermon. AISTATS, volume 84 of Proceedings of Machine Learning Research, page 823-832. PMLR, (2018)Data Perturbation for Escaping Local Maxima in Learning.G. Elidan, M. Ninio, N. Friedman, and D. Schuurmans. AAAI/IAAI, page 132-139. AAAI Press / The MIT Press, (2002)Sparse Learning Based Linear Coherent Bi-clustering.Y. Shi, X. Liao, X. Zhang, G. Lin, and D. Schuurmans. WABI, volume 7534 of Lecture Notes in Computer Science, page 346-364. Springer, (2012)Self-Supervised Chinese Word Segmentation.F. Peng, and D. Schuurmans. IDA, volume 2189 of Lecture Notes in Computer Science, page 238-247. Springer, (2001)Divergence based graph estimation for manifold learning.K. Abou-Moustafa, F. Ferrie, and D. Schuurmans. GlobalSIP, page 447-450. IEEE, (2013)Combining Statistical Language Models via the Latent Maximum Entropy Principle.S. Wang, D. Schuurmans, F. Peng, and Y. Zhao. Mach. Learn., 60 (1-3): 229-250 (2005)Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.O. Nachum, M. Norouzi, K. Xu, and D. Schuurmans. CoRR, (2017)

BibSonomy

Disambiguation of "Schuurmans, Dale"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Value Function Polytope in Reinforcement Learning.

Please choose a person to relate this publication to

Dale Adams

Aklilu Dalelo

Dale Rubbra

Dale Pearson

Jonathan Dale

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Schuurmans, Dale"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML The Value Function Polytope in Reinforcement Learning.

Please choose a person to relate this publication to

Dale Adams

Aklilu Dalelo

Dale Rubbra

Dale Pearson

Jonathan Dale

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Value Function Polytope in Reinforcement Learning.