Sample Efficient Actor-Critic with Experience Replay.

Abstract

This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several continuous control problems. To achieve this, the paper introduces several innovations, including truncated importance sampling with bias correction, stochastic dueling network architectures, and a new trust region policy optimization method.

BibTeX key: wang2016acer
entry type: article
year: 2016
journal: CoRR
volume: abs/1611.01224
ee: http://arxiv.org/abs/1611.01224
url: http://dblp.uni-trier.de/db/journals/corr/corr1611.html#WangBHMMKF16

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

BibSonomy

Sample Efficient Actor-Critic with Experience Replay.

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on