Inproceedings,

Agnostic KWIK learning and efficient approximate reinforcement learning

I. Szita, and {. Szepesvári.
COLT, page 739--772. (July 2011)

Abstract

A popular approach in reinforcement learning is to use a model-based algorithm, i.e., an algorithm that utilizes a model learner to learn an approximate model to the environment. It has been shown such a model-based learner is efficient if the model learner is efficient in the so-called ``knows what it knows'' (KWIK) framework. A major limitation of the standard KWIK framework is that, by its very definition, it covers only the case when the (model) learner can represent the actual environment with no errors. In this paper, we introduce the agnostic KWIK learning model, where we relax this assumption by allowing nonzero approximation errors. We show that with the new definition that an efficient model learner still leads to an efficient reinforcement learning algorithm. At the same time, though, we find that learning within the new framework can be substantially slower as compared to the standard framework, even in the case of simple learning problems.

BibTeX key: SziSze11
entry type: inproceedings
booktitle: COLT
year: 2011
month: July
pages: 739--772
pdf: papers/agnosticKwik.pdf
date-modified: 2012-06-03 14:10:27 -0600
date-added: 2011-07-03 20:59:06 -0600

BibSonomy

Agnostic KWIK learning and efficient approximate reinforcement learning

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on