Author of the publication

Safe Policy Iteration.

, , , and . ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 307-315. JMLR.org, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Risk-Averse Trust Region Optimization for Reward-Volatility Reduction., , , , and . IJCAI, page 4583-4589. ijcai.org, (2020)Special Track on AI in FinTech.ARLO: A framework for Automated Reinforcement Learning., , , , and . Expert Syst. Appl., (August 2023)Safe Policy Iteration: A Monotonically Improving Approximate Policy Iteration Approach., , , and . J. Mach. Learn. Res., (2021)Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation., , and . J. Artif. Intell. Res., (2016)Policy gradient approaches for multi-objective sequential decision making., , , , and . IJCNN, page 2323-2330. IEEE, (2014)Piecewise constant reinforcement learning for robotic applications., , and . ICINCO-ICSO, page 214-221. INSTICC Press, (2007)978-972-8865-82-5.Equilibrium approximation in simulation-based extensive-form games., and . AAMAS, page 199-206. IFAAMAS, (2011)Extensive-form games with heterogeneous populations: solution concepts, equilibria characterization, learning dynamics., , and . Intelligenza Artificiale, 10 (1): 19-31 (2016)Time-Variant Variational Transfer for Value Functions., , , and . CoRR, (2020)A Policy Gradient Method for Task-Agnostic Exploration., , and . CoRR, (2020)