Author of the publication

Piecewise constant reinforcement learning for robotic applications.

, , and . ICINCO-ICSO, page 214-221. INSTICC Press, (2007)978-972-8865-82-5.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game., , , and . Adaptive Agents and Multi-Agents Systems, volume 4865 of Lecture Notes in Computer Science, page 129-144. Springer, (2007)ARLO: A framework for Automated Reinforcement Learning., , , , and . Expert Syst. Appl., (August 2023)Risk-Averse Trust Region Optimization for Reward-Volatility Reduction., , , , and . IJCAI, page 4583-4589. ijcai.org, (2020)Special Track on AI in FinTech.A Novel Confidence-Based Algorithm for Structured Bandits., , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 3175-3185. PMLR, (2020)Importance Weighted Transfer of Samples in Reinforcement Learning., , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 4943-4952. PMLR, (2018)A Probabilistic Framework for Weighting Different Sensor Data in MUREA., , and . RoboCup, volume 3020 of Lecture Notes in Computer Science, page 678-685. Springer, (2003)Filling the Gap among Coordination, Planning, and Reaction Using a Fuzzy Cognitive Model., , and . RoboCup, volume 3020 of Lecture Notes in Computer Science, page 662-669. Springer, (2003)A Framework for Robust Sensing in Multi-agent Systems., , and . RoboCup, volume 2377 of Lecture Notes in Computer Science, page 287-292. Springer, (2001)Best Arm Identification for Stochastic Rising Bandits., , , , and . CoRR, (2023)Fast direct calibration of interest rate derivatives pricing models., , and . ICAIF, page 6:1-6:8. ACM, (2020)