arXiv Open Access 2025

CORD: Generalizable Cooperation via Role Diversity

Kanefumi Matsuyama Kefan Su Jiangxing Wang Deheng Ye Zongqing Lu

Lihat Sumber

Abstrak

Cooperative multi-agent reinforcement learning (MARL) aims to develop agents that can collaborate effectively. However, most cooperative MARL methods overfit training agents, making learned policies not generalize well to unseen collaborators, which is a critical issue for real-world deployment. Some methods attempt to address the generalization problem but require prior knowledge or predefined policies of new teammates, limiting real-world applications. To this end, we propose a hierarchical MARL approach to enable generalizable cooperation via role diversity, namely CORD. CORD's high-level controller assigns roles to low-level agents by maximizing the role entropy with constraints. We show this constrained objective can be decomposed into causal influence in role that enables reasonable role assignment, and role heterogeneity that yields coherent, non-redundant role clusters. Evaluated on a variety of cooperative multi-agent tasks, CORD achieves better performance than baselines, especially in generalization tests. Ablation studies further demonstrate the efficacy of the constrained objective in generalizable cooperation.

Topik & Kata Kunci

cs.AI cs.LG cs.MA

Penulis (5)

Kanefumi Matsuyama

Kefan Su

Jiangxing Wang

Deheng Ye

Zongqing Lu

Format Sitasi

APA MLA BibTeX

Matsuyama, K., Su, K., Wang, J., Ye, D., Lu, Z. (2025). CORD: Generalizable Cooperation via Role Diversity. https://arxiv.org/abs/2501.02221

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓