Author of the publication

Combination of learning from non-optimal demonstrations and feedbacks using inverse reinforcement learning and Bayesian policy improvement.

, , , and . Expert Syst. Appl., (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Effects of a Bio-mimicked Flapping Path on Propulsion Efficiency of Two-segmental Fish Robots., , and . IROS, page 1721-1726. IEEE, (2019)A multi-robot system for dome inspection and maintenance: Concept and stability analysis., , and . ROBIO, page 853-858. IEEE, (2011)Flow visualization over a thick blunt trailing-edge airfoil with base cavity at low Reynolds numbers using PIV technique., , , and . J. Vis., 20 (4): 695-710 (2017)Compliance: encoded information and behavior in a team of cooperative object-handling robots., , and . Adv. Robotics, 17 (5): 427-446 (2003)A Distributed Q-Learning Approach for Variable Attention to Multiple Critics., , , and . ICONIP (3), volume 7665 of Lecture Notes in Computer Science, page 244-251. Springer, (2012)FPGA Implementation of a Cortical Network Based on the Hodgkin-Huxley Neuron Model., , , , and . ICONIP (1), volume 7663 of Lecture Notes in Computer Science, page 243-250. Springer, (2012)Learning to Integrate an Artificial Sensory Device: How Bayesian Integration May Lead to Nonoptimal Perception., , , and . IEEE Trans. Cogn. Dev. Syst., 14 (4): 1755-1765 (2022)Learning sequential visual attention control through dynamic state space discretization., , and . ICRA, page 2258-2263. IEEE, (2009)Reduction of Learning Time for Robots Using Automatic State Abstraction., , and . EUROS, volume 22 of Springer Tracts in Advanced Robotics, page 79-92. Springer, (2006)Comparing Learning Attention Control in Perceptual and Decision Space., , , and . WAPCV, volume 5395 of Lecture Notes in Computer Science, page 242-256. Springer, (2008)