Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning Instance-Independent Value Functions to Enhance Local Search., , , and . NIPS, page 1017-1023. The MIT Press, (1998)Incremental Natural Actor-Critic Algorithms., , , and . NIPS, page 105-112. Curran Associates, Inc., (2007)Off-policy learning based on weighted importance sampling with linear computational complexity., and . UAI, page 552-561. AUAI Press, (2015)Stimulus Representation and the Timing of Reward-Prediction Errors in Models of the Dopamine System., , and . Neural Comput., 20 (12): 3034-3054 (2008)Reward is enough., , , and . Artif. Intell., (2021)A new Q(lambda) with interim forward view and Monte Carlo equivalence., , , and . ICML, volume 32 of JMLR Workshop and Conference Proceedings, page 568-576. JMLR.org, (2014)An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task., and . CoRR, (2021)Online Real-Time Recurrent Learning Using Sparse Connections and Selective Learning., , , and . CoRR, (2023)DYNA, an integrated architecture for learning, planning, and reacting. Working Notes of the 1991 AAAI Spring Symposium on Integrated Intelligent Architectures, (1991)Reinforcement Learning of Local Shape in the Game of Go, , and . IJCAI, page 1053-1058. (2007)