This article aims to provide a concise yet comprehensive introduction to one of the most important class of control algorithms in Reinforcement Learning - Policy Gradients. I will discuss these…
Y. Su, R. Zhang, S. Erfani, and J. Gan. Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
, ACM, (July 2021)