arXiv Open Access 2025

CORD: Generalizable Cooperation via Role Diversity

Kanefumi Matsuyama Kefan Su Jiangxing Wang Deheng Ye Zongqing Lu
Lihat Sumber

Abstrak

Cooperative multi-agent reinforcement learning (MARL) aims to develop agents that can collaborate effectively. However, most cooperative MARL methods overfit training agents, making learned policies not generalize well to unseen collaborators, which is a critical issue for real-world deployment. Some methods attempt to address the generalization problem but require prior knowledge or predefined policies of new teammates, limiting real-world applications. To this end, we propose a hierarchical MARL approach to enable generalizable cooperation via role diversity, namely CORD. CORD's high-level controller assigns roles to low-level agents by maximizing the role entropy with constraints. We show this constrained objective can be decomposed into causal influence in role that enables reasonable role assignment, and role heterogeneity that yields coherent, non-redundant role clusters. Evaluated on a variety of cooperative multi-agent tasks, CORD achieves better performance than baselines, especially in generalization tests. Ablation studies further demonstrate the efficacy of the constrained objective in generalizable cooperation.

Topik & Kata Kunci

Penulis (5)

K

Kanefumi Matsuyama

K

Kefan Su

J

Jiangxing Wang

D

Deheng Ye

Z

Zongqing Lu

Format Sitasi

Matsuyama, K., Su, K., Wang, J., Ye, D., Lu, Z. (2025). CORD: Generalizable Cooperation via Role Diversity. https://arxiv.org/abs/2501.02221

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓