Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Regularized Fitted Q-Iteration: Application to Planning

A. Farahmand, M. Ghavamzadeh, {. Szepesvári, und S. Mannor. EWRL, Seite 55--68. (2008)
DOI: 10.1007/978-3-540-89722-4_5

Zusammenfassung

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-iteration with penalized (or regularized) least-squares regression as the regression subroutine to address the problem of controlling model-complexity. The algorithm is presented in detail for the case when the function space is a reproducing kernel Hilbert space underlying a user-chosen kernel function. We derive bounds on the quality of the solution and argue that data-dependent penalties can lead to almost optimal performance. A simple example is used to illustrate the benefits of using a penalized procedure.

Links und Ressourcen

BibTeX-Schlüssel: farahmand2008
Eintragstyp: inproceedings
Buchtitel: EWRL
Jahr: 2008
Seiten: 55--68
BibTeX-Querverweis: EWRL2008
date-added: 2010-08-28 17:38:14 -0600
bdsk-url-1: http://dx.doi.org/10.1007/978-3-540-89722-4_5
pdf: papers/RegFQI-Plan-EWRL08.pdf
bibsource: DBLP, http://dblp.uni-trier.de
date-modified: 2010-11-25 00:52:54 -0700
DOI: 10.1007/978-3-540-89722-4_5

BibSonomy

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Regularized Fitted Q-Iteration: Application to Planning

Zusammenfassung

Links und Ressourcen

Tags

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

BibSonomy

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Regularized Fitted Q-Iteration: Application to Planning

Zusammenfassung

Links und Ressourcen

Tags

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Regularized Fitted Q-Iteration: Application to Planning

Kommentare und Rezensionen
(0)