copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Generalized Markov Decision Processes: Dynamic-programming and reinforcement-learning algorithms

{. Szepesvári, and M. Littman. CS-96-11. Brown University, Department of Computer Science, Providence, RI, (November 1996)

Abstract

Reinforcement learning is the process by which an autonomous agent uses its experience interacting with an environment to improve its behavior. The Markov decision process (MDP) model is a popular way of formalizing the reinforcement-learning problem, but it is by no means the only way. In this paper, we show how many of the important theoretical results concerning reinforcement learning in MDPs extend to a generalized MDP model that includes MDPs, two-player games and MDPs under a worst-case optimality criterion as special cases. The basis of this extension is a stochastic-approximation theorem that reduces asynchronous convergence to synchronous convergence.

Links and resources

BibTeX key: szepesvari1996h
entry type: techreport
address: Providence, RI
year: 1996
month: November
institution: Brown University, Department of Computer Science
number: CS-96-11
pdf: papers/gmdp.ps.pdf
date-modified: 2010-09-02 13:09:16 -0600
date-added: 2010-08-28 17:38:14 -0600

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Generalized Markov Decision Processes: Dynamic-programming and reinforcement-learning algorithms

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Generalized Markov Decision Processes: Dynamic-programming and reinforcement-learning algorithms

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Generalized Markov Decision Processes: Dynamic-programming and reinforcement-learning algorithms

Comments and Reviews
(0)