copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multi-agent Learning and the Reinforcement Gradient

M. Kaisers, and K. Tuyls. Proc. of 9th European Workshop on Multi-agent Systems (EUMAS 2011), Maastricht University, (2011)

Abstract

The number of proposed reinforcement learning algorithms appears to be ever-growing. This article tackles the diversification by showing a persistent principle in several independent reinforcement learning algorithms that have been applied to multi-agent settings. While their learning structure may look very diverse, algorithms such as Gradient Ascent, Cross learning, variations of Q-learning and Regret minimization all follow the same basic pattern. Variations of Gradient Ascent can be described by the projection dynamics and the other algorithms follow the replicator dynamics. In combination with some modulations of the learning rate and deviations for the sake of exploration, they are primarily different implementations of learning in the direction of the reinforcement gradient.

Links and resources

BibTeX key: Kaisers2011
entry type: inproceedings
booktitle: Proc. of 9th European Workshop on Multi-agent Systems (EUMAS 2011)
year: 2011
publisher: Maastricht University
Document: http://michaelkaisers.com/publications/2011_EUMAS_MKaisers.pdf

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multi-agent Learning and the Reinforcement Gradient

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Multi-agent Learning and the Reinforcement Gradient

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multi-agent Learning and the Reinforcement Gradient

Comments and Reviews
(0)