From post

Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks.

, , и . ECML/PKDD (1), том 10534 из Lecture Notes in Computer Science, стр. 445-459. Springer, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search., , и . CoRR, (2018)Learning Expected Emphatic Traces for Deep RL., , , , и . CoRR, (2021)mlpack 3: a fast, flexible machine learning library., , , , , и . J. Open Source Softw., 3 (26): 726 (2018)Deep Residual Reinforcement Learning (Extended Abstract)., , и . IJCAI, стр. 4869-4873. ijcai.org, (2021)A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms., , , , и . AAMAS, стр. 1491-1499. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), (2022)Average-Reward Off-Policy Policy Evaluation with Function Approximation., , , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 12578-12588. PMLR, (2021)Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards., , , , , и . CoRR, (2019)A Deeper Look at Experience Replay., и . CoRR, (2017)A Deep Neural Network for Modeling Music., , , , , , , и . ICMR, стр. 379-386. ACM, (2015)Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards., , , , , , и . AAAI, стр. 5826-5833. AAAI Press, (2020)