Author of the publication

The Value Function Polytope in Reinforcement Learning.

, , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 1486-1495. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Strictly Lexicalised Dependency Parsing., , and . Trends in Parsing Technology, Springer, (2010)Stochastic Neural Networks with Monotonic Activation Functions, , , , and . (2015)cite arxiv:1601.00034v2.pdfComment: AISTATS 2016.Learning Gene Regulatory Networks via Globally Regularized Risk Minimization., and . RECOMB-CG, volume 4751 of Lecture Notes in Computer Science, page 83-95. Springer, (2007)Variational Rejection Sampling., , , , and . AISTATS, volume 84 of Proceedings of Machine Learning Research, page 823-832. PMLR, (2018)Data Perturbation for Escaping Local Maxima in Learning., , , and . AAAI/IAAI, page 132-139. AAAI Press / The MIT Press, (2002)Sparse Learning Based Linear Coherent Bi-clustering., , , , and . WABI, volume 7534 of Lecture Notes in Computer Science, page 346-364. Springer, (2012)Self-Supervised Chinese Word Segmentation., and . IDA, volume 2189 of Lecture Notes in Computer Science, page 238-247. Springer, (2001)Divergence based graph estimation for manifold learning., , and . GlobalSIP, page 447-450. IEEE, (2013)Combining Statistical Language Models via the Latent Maximum Entropy Principle., , , and . Mach. Learn., 60 (1-3): 229-250 (2005)Trust-PCL: An Off-Policy Trust Region Method for Continuous Control., , , and . CoRR, (2017)