ESP: Exploiting Symmetry Prior for Multi-Agent
Reinforcement Learning

Xin Yu, Rongye Shi*, Pu Feng, Yongkai Tian, Jie Luo, Wenjun Wu

European Conference on Artificial Intelligence (ECAI 2023)

[Paper]   [Supplementary]

Abstract

Multi-agent reinforcement learning (MARL) has achieved promising results in recent years. However, most existing reinforcement learning methods require a large amount of data for model training. In addition, data-efficient reinforcement learning requires the construction of strong inductive biases, which are ignored in the current MARL approaches. Inspired by the symmetry phenomenon in multi-agent systems, this paper proposes a framework for exploiting prior knowledge by integrating data augmentation and a well-designed consistency loss into the existing MARL methods. In addition, the proposed framework is model-agnostic and can be applied to most of the current MARL algorithms. Experimental tests on multiple challenging tasks demonstrate the effectiveness of the proposed framework. Moreover, the proposed framework is applied to a physical multi-robot testbed to show its superiority.

Videos

MAPPO

MAPPO-ESP

MAPPO

MAPPO-ESP

Code

Coming soon.

Citation

Coming soon.

Contact

If you have any questions, please feel free to contact Xin Yu at nlsdeyuxin[at]buaa[dot]edu[dot]cn[dot].