This article aims to provide a concise yet comprehensive introduction to one of the most important class of control algorithms in Reinforcement Learning - Policy Gradients. I will discuss these…
Y. Zhao, I. Borovikov, J. Rupert, C. Somers, und A. Beirami. (2019)cite arxiv:1906.10124Comment: Presented at ICML 2019 Workshop on Imitation, Intent, and Interaction (I3). arXiv admin note: substantial text overlap with arXiv:1903.10545.
Y. Li, H. Chang, Y. Lin, P. Wu, und Y. Wang. 2018 25th IEEE International Conference on Image Processing (ICIP), Seite 3778-3782. (Oktober 2018)cite arxiv:1805.02070Comment: ICIP 2018.
R. Sutton, D. McAllester, S. Singh, und Y. Mansour. Proceedings of the 12th International Conference on Neural Information Processing Systems, Seite 1057--1063. Cambridge, MA, USA, MIT Press, (1999)