Distributed Reinforcement Learning for Coordinate Multi-Robot Foraging

Abstract

In this paper, we propose a distributed dynamic correlation matrix based multi-Q (D-DCM-Multi-Q) learning method for multi-robot systems. First, a dynamic correlation matrix is proposed for multi-agent reinforcement learning, which not only considers each individual robot's Q-value, but also the correlated Q-values of neighboring robots. Then, the theoretical analysis of the system convergence for this D-DCM-Multi-Q method is provided. Various simulations for multi-robot foraging as well as a proof-of-concept experiment with a physical multi-robot system have been conducted to evaluate the proposed D-DCM-Multi-Q method. The extensive simulation/experimental results show the effectiveness, robustness, and stability of the proposed method.

BibTeX key: distRL
entry type: article
year: 2010
month: dec
day: 01
journal: Journal of Intelligent & Robotic Systems
number: 3
pages: 531--551
volume: 60
issn: 1573-0409
DOI: 10.1007/s10846-010-9429-4
url: https://doi.org/10.1007/s10846-010-9429-4

BibSonomy

Distributed Reinforcement Learning for Coordinate Multi-Robot Foraging

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on