This article aims to provide a concise yet comprehensive introduction to one of the most important class of control algorithms in Reinforcement Learning - Policy Gradients. I will discuss these…
J. Tritscher, D. Schlör, F. Gwinner, A. Krause, and A. Hotho. Machine Learning and Principles and Practice of Knowledge Discovery in Databases. ECML PKDD 2022, Communications in Computer and Information Science (1753):
79-96(2022)