Constructing Temporal Abstractions Autonomously in Reinforcement Learning

Abstract

The idea of temporal abstraction, i.e. learning, planning and representing the world at multiple time scales, has been a constant thread in AI research, spanning sub-fields from classical planning and search to control and reinforcement learning. For example, programming a robot typically involves making decisions over a set of controllers, rather than working at the level of motor torques. While temporal abstraction is a very natural concept, learning such abstractions with no human input has proved quite daunting. In this paper, we present a general architecture, called option-critic, which allows learning temporal abstractions automatically, end-to-end, simply from the agent's experience. This approach allows continual learning and provides interesting qualitative and quantitative results in several tasks.

BibTeX key: BaconPrecup18aimag
entry type: article
year: 2018
month: #mar#
journal: AI Magazine
number: 1
pages: 39--50
volume: 39
groups: public
file: AAAI online:2018/BaconPrecup18aimag.pdf:PDF
issn: 0738-4602
DOI: 10.1609/aimag.v39i1.2780
username: flint63

BibSonomy

Constructing Temporal Abstractions Autonomously in Reinforcement Learning

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on