Q-learning

Abstract

Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states.

BibTeX key: Watkins1992
entry type: article
year: 1992
month: may
day: 01
journal: Machine Learning
number: 3
pages: 279--292
volume: 8
issn: 1573-0565
DOI: 10.1007/BF00992698
url: https://doi.org/10.1007/BF00992698

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

BibSonomy

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on