Multi Agent Deep Learning with Cooperative Communication

We consider the problem of multi agents cooperating in a partially-observable environment. Agents must learn to coordinate and share relevant information to solve the tasks successfully. This article describes Asynchronous Advantage Actor-Critic with Communication (A3C2), an end-to-end differentiable approach where agents learn policies and communication protocols simultaneously. A3C2 uses a centralized learning, distributed execution paradigm, supports independent agents, dynamic team sizes, partially-observable environments, and noisy communications. We compare and show that A3C2 outperforms other state-of-the-art proposals in multiple environments.

eISSN:: 2083-2567
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Computer Sciences, Databases and Data Mining, Artificial Intelligence

Journal RSS Feed

Multi Agent Deep Learning with Cooperative Communication

Published Online: May 23, 2020

Page range: 189 - 207

Received: Nov 01, 2019

Accepted: Mar 26, 2020

DOI: https://doi.org/10.2478/jaiscr-2020-0013

Keywords
multi-agent systems, deep reinforcement learning, centralized learning

© 2020 David Simões et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Multi Agent Deep Learning with Cooperative Communication

Published Online: May 23, 2020

Page range: 189 - 207

Received: Nov 01, 2019

Accepted: Mar 26, 2020

DOI: https://doi.org/10.2478/jaiscr-2020-0013

Keywordsmulti-agent systems, deep reinforcement learning, centralized learning

© 2020 David Simões et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Keywords
multi-agent systems, deep reinforcement learning, centralized learning