Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms

Redder, Adrian; Ramaswamy, Arunselvan; Karl, Holger

Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms

A. Redder, A. Ramaswamy, H. Karl, ArXiv:2201.00570 (2022).

Download

No fulltext has been uploaded.

Preprint | English

Author

Redder, Adrian^LibreCat ; Ramaswamy, Arunselvan^LibreCat ; Karl, Holger^LibreCat

Department

Rechnernetze

Project

SFB 901 - C4: SFB 901 - Subproject C4
SFB 901: SFB 901
SFB 901 - C: SFB 901 - Project Area C

Abstract

We present sufficient conditions that ensure convergence of the multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm. It is an example of one of the most popular paradigms of Deep Reinforcement Learning (DeepRL) for tackling continuous action spaces: the actor-critic paradigm. In the setting considered herein, each agent observes a part of the global state space in order to take local actions, for which it receives local rewards. For every agent, DDPG trains a local actor (policy) and a local critic (Q-function). The analysis shows that multi-agent DDPG using neural networks to approximate the local policies and critics converge to limits with the following properties: The critic limits minimize the average squared Bellman loss; the actor limits parameterize a policy that maximizes the local critic's approximation of $Q_i^*$, where $i$ is the agent index. The averaging is with respect to a probability distribution over the global state-action space. It captures the asymptotics of all local training processes. Finally, we extend the analysis to a fully decentralized setting where agents communicate over a wireless network prone to delays and losses; a typical scenario in, e.g., robotic applications.

Publishing Year

2022

Journal Title

arXiv:2201.00570

LibreCat-ID

30791

Cite this

Redder A, Ramaswamy A, Karl H. Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms. arXiv:220100570. Published online 2022.

Redder, A., Ramaswamy, A., & Karl, H. (2022). Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms. In arXiv:2201.00570.

@article{Redder_Ramaswamy_Karl_2022, title={Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms}, journal={arXiv:2201.00570}, author={Redder, Adrian and Ramaswamy, Arunselvan and Karl, Holger}, year={2022} }

Redder, Adrian, Arunselvan Ramaswamy, and Holger Karl. “Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms.” ArXiv:2201.00570, 2022.

A. Redder, A. Ramaswamy, and H. Karl, “Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms,” arXiv:2201.00570. 2022.

Redder, Adrian, et al. “Asymptotic Convergence of Deep Multi-Agent Actor-Critic Algorithms.” ArXiv:2201.00570, 2022.

Export

Marked Publications

Open Data LibreCat

Sources

arXiv 2201.00570

Search this title in

Google Scholar