копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Constructing Temporal Abstractions Autonomously in Reinforcement Learning

P. Bacon, и D. Precup. AI Magazine, 39 (1): 39--50 (марта 2018)
DOI: 10.1609/aimag.v39i1.2780

Аннотация

The idea of temporal abstraction, i.e. learning, planning and representing the world at multiple time scales, has been a constant thread in AI research, spanning sub-fields from classical planning and search to control and reinforcement learning. For example, programming a robot typically involves making decisions over a set of controllers, rather than working at the level of motor torques. While temporal abstraction is a very natural concept, learning such abstractions with no human input has proved quite daunting. In this paper, we present a general architecture, called option-critic, which allows learning temporal abstractions automatically, end-to-end, simply from the agent's experience. This approach allows continual learning and provides interesting qualitative and quantitative results in several tasks.

Линки и ресурсы

ключ BibTeX

BaconPrecup18aimag

тип записи

article

год

2018

месяц

#mar#

журнал

AI Magazine

номер

страницы

39--50

том

groups

public

file

AAAI online:2018/BaconPrecup18aimag.pdf:PDF

issn

0738-4602

DOI

10.1609/aimag.v39i1.2780

username

flint63

дополнительные URL-адреса

AAAI Page

тэги

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 7 лет назад
Создан 7 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!