This article aims to provide a concise yet comprehensive introduction to one of the most important class of control algorithms in Reinforcement Learning - Policy Gradients. I will discuss these…
D. Benchert, S. Meßlinger, S. Goller, J. Kaiser, J. Pfister, and A. Hotho. Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), page 1623--1638. Vienna, Austria, Association for Computational Linguistics, (July 2025)
T. Völker, J. Pfister, and A. Hotho. Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), page 852--864. Vienna, Austria, Association for Computational Linguistics, (July 2025)
J. Pfister, T. Völker, A. Vlasjuk, and A. Hotho. Proceedings of the 1st Joint Workshop on Large Language Models and Structure Modeling (XLLM 2025), page 115--128. Vienna, Austria, Association for Computational Linguistics, (August 2025)
J. Pfister, J. Wunderle, and A. Hotho. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), page 2227--2246. Vienna, Austria, Association for Computational Linguistics, (July 2025)
K. Kobs, T. Koopmann, A. Zehe, D. Fernes, P. Krop, and A. Hotho. Findings of the Association for Computational Linguistics: EMNLP 2020, page 878--883. Online, Association for Computational Linguistics, (November 2020)
T. Liang, C. Jin, L. Wang, W. Fan, C. Xia, K. Chen, and Y. Yin. Findings of the Association for Computational Linguistics: ACL 2024, page 8926--8939. Bangkok, Thailand, Association for Computational Linguistics, (August 2024)