Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Meta-learning of Exploration/Exploitation Strategies: The Multi-armed Bandit Case.

F. Maes, L. Wehenkel, and D. Ernst. ICAART (Revised Selected Papers), volume 358 of Communications in Computer and Information Science, page 100-115. Springer, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Damien Guironnet

Catalytic polymerization of acrylates and in supercritical carbon dioxideD. Guironnet. Uni Konstanz, (2009)

Damien Neyret

Damien Ciabrini

Damien Cassou

Damien Stehlé

Other publications of authors with the same name

Reinforcement Learning with Raw Image Pixels as Input State.D. Ernst, R. Marée, and L. Wehenkel. IWICPAS, volume 4153 of Lecture Notes in Computer Science, page 446-454. Springer, (2006)Clinical data based optimal STI strategies for HIV: a reinforcement learning approach.D. Ernst, G. Stan, J. Gonçalves, and L. Wehenkel. CDC, page 667-672. IEEE, (2006)Optimal discovery with probabilistic expert advice.S. Bubeck, D. Ernst, and A. Garivier. CDC, page 6808-6812. IEEE, (2012)Meta-learning of Exploration/Exploitation Strategies: The Multi-armed Bandit Case.F. Maes, L. Wehenkel, and D. Ernst. ICAART (Revised Selected Papers), volume 358 of Communications in Computer and Information Science, page 100-115. Springer, (2012)Impacts of spatial and temporal resolutions on the near-optimal spaces of energy system optimisation models.A. Dubois, and D. Ernst. ISGT EUROPE, page 1-5. IEEE, (2023)Imitative Learning for Online Planning in Microgrids.S. Aittahar, V. François-Lavet, S. Lodeweyckx, D. Ernst, and R. Fonteneau. DARE, volume 9518 of Lecture Notes in Computer Science, page 1-15. Springer, (2015)Assessing the Economic Value of Renewable Resource Complementarity for Power Systems: an ENTSO-E Study.D. Radu, M. Berger, A. Dubois, R. Fonteneau, H. Pandzic, Y. Dvorkin, Q. Louveaux, and D. Ernst. CoRR, (2020)Recurrent networks, hidden states and beliefs in partially observable environments.G. Lambrechts, A. Bolland, and D. Ernst. CoRR, (2022)Warming-up recurrent neural networks to maximize reachable multi-stability greatly improves learning.N. Vecoven, D. Ernst, and G. Drion. CoRR, (2021)On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract).V. François-Lavet, G. Rabusseau, J. Pineau, D. Ernst, and R. Fonteneau. IJCAI, page 5055-5059. ijcai.org, (2020)Journal track.

BibSonomy

Disambiguation of "Ernst, Damien"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Meta-learning of Exploration/Exploitation Strategies: The Multi-armed Bandit Case.

Please choose a person to relate this publication to

Damien Guironnet

Damien Neyret

Damien Ciabrini

Damien Cassou

Damien Stehlé

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Ernst, Damien"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Meta-learning of Exploration/Exploitation Strategies: The Multi-armed Bandit Case.

Please choose a person to relate this publication to

Damien Guironnet

Damien Neyret

Damien Ciabrini

Damien Cassou

Damien Stehlé

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Meta-learning of Exploration/Exploitation Strategies: The Multi-armed Bandit Case.