Misc,

Incremental Reinforcement Learning --- a New Continuous Reinforcement Learning Frame Based on Stochastic Differential Equation methods

T. Chen, L. Cheng, Y. Liu, W. Jia, and S. Ma.
(2019)cite arxiv:1908.02974Comment: 13 pages, 4 figures.

Abstract

Continuous reinforcement learning such as DDPG and A3C are widely used in robot control and autonomous driving. However, both methods have theoretical weaknesses. While DDPG cannot control noises in the control process, A3C does not satisfy the continuity conditions under the Gaussian policy. To address these concerns, we propose a new continues reinforcement learning method based on stochastic differential equations and we call it Incremental Reinforcement Learning (IRL). This method not only guarantees the continuity of actions within any time interval, but controls the variance of actions in the training process. In addition, our method does not assume Markov control in agents' action control and allows agents to predict scene changes for action selection. With our method, agents no longer passively adapt to the environment. Instead, they positively interact with the environment for maximum rewards.

BibTeX key: chen2019incremental
entry type: misc
year: 2019
url: http://arxiv.org/abs/1908.02974
note: cite arxiv:1908.02974Comment: 13 pages, 4 figures

BibSonomy

Incremental Reinforcement Learning --- a New Continuous Reinforcement Learning Frame Based on Stochastic Differential Equation methods

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on