Author of the publication

Shaping Proto-Value Functions Using Rewards.

, , , and . ECAI, volume 285 of Frontiers in Artificial Intelligence and Applications, page 1690-1691. IOS Press, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A time aggregation approach to Markov decision processes., , , , and . Autom., 38 (6): 929-943 (2002)General-sum stochastic games: Verifiability conditions for Nash equilibria., and . Autom., 48 (11): 2923-2930 (2012)On tight bounds for function approximation error in risk-sensitive reinforcement learning., and . Syst. Control. Lett., (2021)Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning., , , , , , , , and . ICRA, page 1631-1637. IEEE, (2022)Parametrized Actor-Critic Algorithms for Finite-Horizon MDPs., and . ACC, page 534-539. IEEE, (2007)Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm., , , and . IJCNN, page 1-10. IEEE, (2022)Fuzzy Clustering Based Ad Recommendation for TV Programs., , , and . EuroITV, volume 4471 of Lecture Notes in Computer Science, page 175-184. Springer, (2007)Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games., , and . AAMAS, page 1371-1379. ACM, (2015)Mechanisms for hostile agents with capacity constraints., , , and . AAMAS, page 659-666. IFAAMAS, (2013)A model based search method for prediction in model-free Markov decision process., and . IJCNN, page 170-177. IEEE, (2017)