Inproceedings,

Pseudo-MDPs and Factored Linear Action Models

H. Yao, {. Szepesvári, B. Pires, and X. Zhang.
IEEE ADPRL, page 189--197. (October 2014)

Abstract

In this paper we introduce the concept of pseudo-MDPs to develop abstractions. Pseudo-MDPs relax the requirement that the transition kernel has to be a probability kernel. We show that the new framework captures many existing abstractions. We also introduce the concept of factored linear action models; a special case. Again, the relation of factored linear action models and existing works are discussed. We use the general framework to develop a theory for bounding the suboptimality of policies derived from pseudo-MDPs. Specializing the framework, we recover existing results. We give a least-squares approach and a constrained optimization approach of learning the factored linear model as well as efficient computation methods. We demonstrate that the constrained optimization approach gives better performance than the least-squares approach with normalization.

BibTeX key: YaoSze14
entry type: inproceedings
booktitle: IEEE ADPRL
year: 2014
month: October
pages: 189--197
pdf: papers/ieee_adprl2014.pdf
date-modified: 2016-05-09 08:36:59 +0000
date-added: 2014-10-11 19:26:42 -0600

BibSonomy

Pseudo-MDPs and Factored Linear Action Models

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on