From post

Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards.

, , и . ICML, том 119 из Proceedings of Machine Learning Research, стр. 8905-8915. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Invariant Transform Experience Replay., , , , и . CoRR, (2019)Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning., , и . CoRR, (2020)Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains., и . IJCAI, стр. 4496-4502. ijcai.org, (2019)Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning., , , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 12967-12978. PMLR, (2021)Neuro-Symbolic Hierarchical Rule Induction., , , , , , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 7583-7615. PMLR, (2022)Hyperparameter Auto-tuning in Self-Supervised Robotic Learning., , , , , и . CoRR, (2020)Neuro-Symbolic Hierarchical Rule Induction., , , , , , и . CoRR, (2021)Lightweight Structural Choices Operator for Technology Mapping., , , , , , , и . DAC, стр. 1-6. IEEE, (2023)Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis., , , , и . EMNLP (Findings), стр. 370-384. Association for Computational Linguistics, (2023)Differentiable Logic Machines., , , , , , , , и . CoRR, (2021)