Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration.

M. Papini, A. Battistello, and M. Restelli. AISTATS, volume 108 of Proceedings of Machine Learning Research, page 1188-1199. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Alfredo Marcello Marcello

Guglielmo Restelli

Marcello Andrea

Marcello Mariucci

Marcello Bisotti

Other publications of authors with the same name

Importance Weighted Transfer of Samples in Reinforcement Learning.A. Tirinzoni, A. Sessa, M. Pirotta, and M. Restelli. ICML, volume 80 of Proceedings of Machine Learning Research, page 4943-4952. PMLR, (2018)A Probabilistic Framework for Weighting Different Sensor Data in MUREA.M. Restelli, D. Sorrenti, and F. Marchese. RoboCup, volume 3020 of Lecture Notes in Computer Science, page 678-685. Springer, (2003)Filling the Gap among Coordination, Planning, and Reaction Using a Fuzzy Cognitive Model.A. Bonarini, M. Matteucci, and M. Restelli. RoboCup, volume 3020 of Lecture Notes in Computer Science, page 662-669. Springer, (2003)A Framework for Robust Sensing in Multi-agent Systems.A. Bonarini, M. Matteucci, and M. Restelli. RoboCup, volume 2377 of Lecture Notes in Computer Science, page 287-292. Springer, (2001)Tree-based Fitted Q-iteration for Multi-Objective Markov Decision problems.A. Castelletti, F. Pianosi, and M. Restelli. IJCNN, page 1-8. IEEE, (2012)Estimating Maximum Expected Value through Gaussian Approximation.C. D'Eramo, M. Restelli, and A. Nuara. ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 1032-1040. JMLR.org, (2016)Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game.A. Lazaric, E. de Cote, F. Dercole, and M. Restelli. Adaptive Agents and Multi-Agents Systems, volume 4865 of Lecture Notes in Computer Science, page 129-144. Springer, (2007)Inverse Reinforcement Learning with Sub-optimal Experts.R. Poiani, G. Curti, A. Metelli, and M. Restelli. CoRR, (2024)A Practical Guide to Multi-Objective Reinforcement Learning and Planning.C. Hayes, R. Radulescu, E. Bargiacchi, J. Källström, M. Macfarlane, M. Reymond, T. Verstraeten, L. Zintgraf, R. Dazeley, F. Heintz and 8 other author(s). CoRR, (2021)Simultaneously Updating All Persistence Values in Reinforcement Learning.L. Sabbioni, L. Daire, L. Bisi, A. Metelli, and M. Restelli. CoRR, (2022)

BibSonomy

Disambiguation of "Restelli, Marcello"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration.

Please choose a person to relate this publication to

Alfredo Marcello Marcello

Guglielmo Restelli

Marcello Andrea

Marcello Mariucci

Marcello Bisotti

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Restelli, Marcello"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration.

Please choose a person to relate this publication to

Alfredo Marcello Marcello

Guglielmo Restelli

Marcello Andrea

Marcello Mariucci

Marcello Bisotti

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Balancing Learning Speed and Stability in Policy Gradient via Adaptive Exploration.