Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Batch mode reinforcement learning based on the synthesis of artificial trajectories.

R. Fonteneau, S. Murphy, L. Wehenkel, and D. Ernst. Ann. Oper. Res., 208 (1): 383-416 (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Raphael Paschke

Banking regulation with value-at-riskR. Paschke. Uni Mannheim, (2009)

Raphael Feinberg

Willy Raphael

Ernst Raphael

Raphael Leonhardt

Other publications of authors with the same name

On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract).V. François-Lavet, G. Rabusseau, J. Pineau, D. Ernst, and R. Fonteneau. IJCAI, page 5055-5059. ijcai.org, (2020)Journal track.Imitative Learning for Online Planning in Microgrids.S. Aittahar, V. François-Lavet, S. Lodeweyckx, D. Ernst, and R. Fonteneau. DARE, volume 9518 of Lecture Notes in Computer Science, page 1-15. Springer, (2015)Assessing the Economic Value of Renewable Resource Complementarity for Power Systems: an ENTSO-E Study.D. Radu, M. Berger, A. Dubois, R. Fonteneau, H. Pandzic, Y. Dvorkin, Q. Louveaux, and D. Ernst. CoRR, (2020)On overfitting and asymptotic bias in batch reinforcement learning with partial observability.V. François-Lavet, D. Ernst, and R. Fonteneau. CoRR, (2017)A Gaussian mixture approach to model stochastic processes in power systems.Q. Gemine, B. Cornélusse, M. Glavic, R. Fonteneau, and D. Ernst. PSCC, page 1-7. IEEE, (2016)Active exploration by searching for experiments that falsify the computed control policy.R. Fonteneau, S. Murphy, L. Wehenkel, and D. Ernst. ADPRL, page 40-47. IEEE, (2011)How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies.V. François-Lavet, R. Fonteneau, and D. Ernst. CoRR, (2015)Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device.V. François-Lavet, R. Fonteneau, and D. Ernst. ADPRL, page 1-8. IEEE, (2014)Optimistic planning for belief-augmented Markov Decision Processes.R. Fonteneau, L. Busoniu, and R. Munos. ADPRL, page 77-84. IEEE, (2013)Batch mode reinforcement learning based on the synthesis of artificial trajectories.R. Fonteneau, S. Murphy, L. Wehenkel, and D. Ernst. Ann. Oper. Res., 208 (1): 383-416 (2013)

BibSonomy

Disambiguation of "Fonteneau, Raphael"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Batch mode reinforcement learning based on the synthesis of artificial trajectories.

Please choose a person to relate this publication to

Raphael Paschke

Raphael Feinberg

Willy Raphael

Ernst Raphael

Raphael Leonhardt

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Fonteneau, Raphael"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Batch mode reinforcement learning based on the synthesis of artificial trajectories.

Please choose a person to relate this publication to

Raphael Paschke

Raphael Feinberg

Willy Raphael

Ernst Raphael

Raphael Leonhardt

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Batch mode reinforcement learning based on the synthesis of artificial trajectories.