Inproceedings,

Multi-criteria Reinforcement Learning

Z. Gábor, {. Kalmár, and {. Szepesvári.
ICML, (1998)Revised on 05/04/2004.

Abstract

We consider multi-criteria sequential decision making problems where the vector-valued evaluations are compared by a given, fixed total ordering. Conditions for the optimality of stationary policies and the Bellman optimality equation are given for a special, but important class of problems when the evaluation of policies can be computed for the criteria independently of each other. The analysis requires special care as the topology introduced by pointwise convergence and the order-topology introduced by the preference order are in general incompatible. Reinforcement learning algorithms are proposed and analyzed. Preliminary computer experiments confirm the validity of the derived algorithms. These type of multi-criteria problems are most useful when there are several optimal solutions to a problem and one wants to choose the one among these which is optimal according to another fixed criterion. Possible application in robotics and repeated games are outlined.

BibTeX key: z.gabor1998
entry type: inproceedings
booktitle: ICML
year: 1998
owner: Beata
pdf: papers/multi98.ps.pdf
date-modified: 2010-09-02 13:09:16 -0600
note: Revised on 05/04/2004

BibSonomy

Multi-criteria Reinforcement Learning

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on