Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Nonparametric Return Distribution Approximation for Reinforcement Learning.

T. Morimura, M. Sugiyama, H. Kashima, H. Hachiya, and T. Tanaka. ICML, page 799-806. Omnipress, (2010)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Tetsuro Samata

Other publications of authors with the same name

Large-Scale Nonparametric Estimation of Vehicle Travel Time Distributions.R. Takahashi, T. Osogami, and T. Morimura. SDM, page 12-23. SIAM / Omnipress, (2012)Nonparametric Return Distribution Approximation for Reinforcement Learning.T. Morimura, M. Sugiyama, H. Kashima, H. Hachiya, and T. Tanaka. ICML, page 799-806. Omnipress, (2010)Natural actor-critic with baseline adjustment for variance reduction.T. Morimura, E. Uchibe, and K. Doya. Artif. Life Robotics, 13 (1): 275-279 (2008)Predicting halfway through simulation: early scenario evaluation using intermediate features of agent-based simulations.S. Hara, R. Raymond, T. Morimura, and H. Muta. WSC, page 334-343. IEEE/ACM, (2014)Sampler for Composition Ratio by Markov Chain Monte Carlo.Y. Obara, T. Morimura, and H. Yanagisawa. CoRR, (2019)Least Absolute Policy Iteration-A Robust Approach to Value Function Approximation.M. Sugiyama, H. Hachiya, H. Kashima, and T. Morimura. IEICE Trans. Inf. Syst., 93-D (9): 2555-2565 (2010)Least absolute policy iteration for robust value function approximation.M. Sugiyama, H. Hachiya, H. Kashima, and T. Morimura. ICRA, page 2904-2909. IEEE, (2009)Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.T. Morimura, E. Uchibe, J. Yoshimoto, J. Peters, and K. Doya. Neural Comput., 22 (2): 342-376 (2010)Frugal signal control using low resolution web-camera and traffic flow estimation.K. Maeda, T. Morimura, T. Katsuki, and M. Teraguchi. WSC, page 2082-2091. IEEE/ACM, (2014)Solving inverse problem of Markov chain with partial observations.T. Morimura, T. Osogami, and T. Idé. NIPS, page 1655-1663. (2013)

BibSonomy

Disambiguation of "Morimura, Tetsuro"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Nonparametric Return Distribution Approximation for Reinforcement Learning.

Please choose a person to relate this publication to

Tetsuro Samata

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Morimura, Tetsuro"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Nonparametric Return Distribution Approximation for Reinforcement Learning.

Please choose a person to relate this publication to

Tetsuro Samata

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Nonparametric Return Distribution Approximation for Reinforcement Learning.