, and .
Machine Learning 8 (3): 279--292 (May 1, 1992)

Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states.
  • @schaul
  • @zeno
  • @idsia
  • @butz
  • @analyst
  • @swarmlab
  • @lanteunis
  • @lukasw
  • @jan.hofmann1
This publication has not been reviewed yet.

rating distribution
average user rating0.0 out of 5.0 based on 0 reviews
    Please log in to take part in the discussion (add own reviews or comments).