Author of the publication

Nonparametric Return Distribution Approximation for Reinforcement Learning.

, , , , and . ICML, page 799-806. Omnipress, (2010)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Large-Scale Nonparametric Estimation of Vehicle Travel Time Distributions., , and . SDM, page 12-23. SIAM / Omnipress, (2012)Nonparametric Return Distribution Approximation for Reinforcement Learning., , , , and . ICML, page 799-806. Omnipress, (2010)Natural actor-critic with baseline adjustment for variance reduction., , and . Artif. Life Robotics, 13 (1): 275-279 (2008)Predicting halfway through simulation: early scenario evaluation using intermediate features of agent-based simulations., , , and . WSC, page 334-343. IEEE/ACM, (2014)Sampler for Composition Ratio by Markov Chain Monte Carlo., , and . CoRR, (2019)Least Absolute Policy Iteration-A Robust Approach to Value Function Approximation., , , and . IEICE Trans. Inf. Syst., 93-D (9): 2555-2565 (2010)Least absolute policy iteration for robust value function approximation., , , and . ICRA, page 2904-2909. IEEE, (2009)Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning., , , , and . Neural Comput., 22 (2): 342-376 (2010)Frugal signal control using low resolution web-camera and traffic flow estimation., , , and . WSC, page 2082-2091. IEEE/ACM, (2014)Solving inverse problem of Markov chain with partial observations., , and . NIPS, page 1655-1663. (2013)