Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization

G. Dulac-Arnold, L. Denoyer, P. Preux, and P. Gallinari. CoRR, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Philippe Destouches

Philippe Staib

Philippe Kersting

Philippe Dreuw

Philippe Zysset

Other publications of authors with the same name

Preference-Like Score to Cope with Cold-Start User in Recommender Systems.C. Felício, K. Paixao, C. Barcelos, and P. Preux. ICTAI, page 62-69. IEEE Computer Society, (2016)A Multi-Armed Bandit Model Selection for Cold-Start User Recommendation.C. Felício, K. Paixão, C. Barcelos, and P. Preux. UMAP, page 32-40. ACM, (2017)A generic architecture for adaptive agents based on reinforcement learning.P. Preux, S. Delepoulle, and J. Darcheville. Inf. Sci., 161 (1-2): 37-55 (2004)Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques.J. Mary, P. Preux, and O. Nicol. ICML, volume 32 of JMLR Workshop and Conference Proceedings, page 172-180. JMLR.org, (2014)Consistent Algorithms for Clustering Time Series.A. Khaleghi, D. Ryabko, J. Mary, and P. Preux. J. Mach. Learn. Res., (2016)"I'm sorry Dave, I'm afraid I can't do that" Deep Q-learning from forbidden action.M. Seurin, P. Preux, and O. Pietquin. CoRR, (2019)Reinforcement learning for crop management support: Review, prospects and challenges.R. Gautron, O. Maillard, P. Preux, M. Corbeels, and R. Sabbadin. Comput. Electron. Agric., (2022)Soft Action Priors: Towards Robust Policy Transfer.M. Centa, and P. Preux. CoRR, (2022)gym-DSSAT: a crop model turned into a Reinforcement Learning environment.R. Gautron, E. Padrón, P. Preux, J. Bigot, O. Maillard, and D. Emukpere. CoRR, (2022)General System Architecture and COTS Prototyping of an AIoT-Enabled Sailboat for Autonomous Aquatic Ecosystem Monitoring.A. de Araújo, D. Daniel, R. Guerra, D. Brandão, E. Vasconcellos, Á. de Negreiros, E. Clua, L. Gonçalves, and P. Preux. IEEE Internet Things J., 11 (3): 3801-3811 (February 2024)

BibSonomy

Disambiguation of "Preux, Philippe"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization

Please choose a person to relate this publication to

Philippe Destouches

Philippe Staib

Philippe Kersting

Philippe Dreuw

Philippe Zysset

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Preux, Philippe"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization

Please choose a person to relate this publication to

Philippe Destouches

Philippe Staib

Philippe Kersting

Philippe Dreuw

Philippe Zysset

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Fast Reinforcement Learning with Large Action Sets using Error-Correcting Output Codes for MDP Factorization