Author of the publication

Meta-learning of Exploration/Exploitation Strategies: The Multi-armed Bandit Case.

, , and . ICAART (Revised Selected Papers), volume 358 of Communications in Computer and Information Science, page 100-115. Springer, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Reinforcement Learning with Raw Image Pixels as Input State., , and . IWICPAS, volume 4153 of Lecture Notes in Computer Science, page 446-454. Springer, (2006)Clinical data based optimal STI strategies for HIV: a reinforcement learning approach., , , and . CDC, page 667-672. IEEE, (2006)Optimal discovery with probabilistic expert advice., , and . CDC, page 6808-6812. IEEE, (2012)Meta-learning of Exploration/Exploitation Strategies: The Multi-armed Bandit Case., , and . ICAART (Revised Selected Papers), volume 358 of Communications in Computer and Information Science, page 100-115. Springer, (2012)Impacts of spatial and temporal resolutions on the near-optimal spaces of energy system optimisation models., and . ISGT EUROPE, page 1-5. IEEE, (2023)Imitative Learning for Online Planning in Microgrids., , , , and . DARE, volume 9518 of Lecture Notes in Computer Science, page 1-15. Springer, (2015)Assessing the Economic Value of Renewable Resource Complementarity for Power Systems: an ENTSO-E Study., , , , , , , and . CoRR, (2020)Recurrent networks, hidden states and beliefs in partially observable environments., , and . CoRR, (2022)Warming-up recurrent neural networks to maximize reachable multi-stability greatly improves learning., , and . CoRR, (2021)On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract)., , , , and . IJCAI, page 5055-5059. ijcai.org, (2020)Journal track.