From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards.

U. Siddique, P. Weng, и M. Zimmer. ICML, том 119 из Proceedings of Machine Learning Research, стр. 8905-8915. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Matthieu Zimmer

Friedrich Matthieu

Matthieu Saubanère

Matthieu Hammer

Matthieu Felsinger

Другие публикации лиц с тем же именем

Invariant Transform Experience Replay.Y. Lin, J. Huang, M. Zimmer, J. Rojas, и P. Weng. CoRR, (2019)Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning.M. Zimmer, U. Siddique, и P. Weng. CoRR, (2020)Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains.M. Zimmer, и P. Weng. IJCAI, стр. 4496-4502. ijcai.org, (2019)Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning.M. Zimmer, C. Glanois, U. Siddique, и P. Weng. ICML, том 139 из Proceedings of Machine Learning Research, стр. 12967-12978. PMLR, (2021)Neuro-Symbolic Hierarchical Rule Induction.C. Glanois, Z. Jiang, X. Feng, P. Weng, M. Zimmer, D. Li, W. Liu, и J. Hao. ICML, том 162 из Proceedings of Machine Learning Research, стр. 7583-7615. PMLR, (2022)Hyperparameter Auto-tuning in Self-Supervised Robotic Learning.J. Huang, J. Rojas, M. Zimmer, H. Wu, Y. Guan, и P. Weng. CoRR, (2020)Neuro-Symbolic Hierarchical Rule Induction.C. Glanois, X. Feng, Z. Jiang, P. Weng, M. Zimmer, D. Li, и W. Liu. CoRR, (2021)Lightweight Structural Choices Operator for Technology Mapping.A. Grosnit, M. Zimmer, R. Tutunov, X. Li, L. Chen, F. Yang, M. Yuan, и H. Bou-Ammar. DAC, стр. 1-6. IEEE, (2023)Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis.P. Gorinski, M. Zimmer, G. Lampouras, D. Deik, и I. Iacobacci. EMNLP (Findings), стр. 370-384. Association for Computational Linguistics, (2023)Differentiable Logic Machines.M. Zimmer, X. Feng, C. Glanois, Z. Jiang, J. Zhang, P. Weng, J. Hao, D. Li, и W. Liu. CoRR, (2021)

BibSonomy

Disambiguation