Author of the publication

Natural Actor-Critic.

, , and . ECML, volume 3720 of Lecture Notes in Computer Science, page 280-291. Springer, (2005)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Building a Library of Tactile Skills Based on FingerVision., , , , , and . Humanoids, page 717-722. IEEE, (2019)Robust policy updates for stochastic optimal control., , , and . Humanoids, page 388-393. IEEE, (2014)Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation., , , and . AAAI, page 1351-1356. AAAI Press, (2008)Stable reinforcement learning with autoencoders for tactile and visual data., , , , and . IROS, page 3928-3934. IEEE, (2016)Actuation and stiffening in fluid-driven soft robots using low-melting-point material., , , , , , , and . IROS, page 4692-4698. IEEE, (2019)Multimodal Uncertainty Reduction for Intention Recognition in Human-Robot Interaction., , , and . IROS, page 7009-7016. IEEE, (2019)Hierarchical Tactile-Based Control Decomposition of Dexterous In-Hand Manipulation Tasks., , and . Frontiers Robotics AI, (2020)Policy evaluation with temporal differences: a survey and comparison., , and . J. Mach. Learn. Res., 15 (1): 809-883 (2014)Multi-agent active information gathering in discrete and continuous-state decentralized POMDPs by policy graph improvement., , and . Auton. Agents Multi Agent Syst., 34 (2): 42 (2020)Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts., , and . CoRR, (2023)