Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning

Lee, Min Whoo; Kim, Kibeom; Shin, Soo Wung; Lee, Minsu; Zhang, Byoung-Tak

计算机科学 > 多智能体系统

arXiv:2501.01140 (cs)

[提交于 2025年1月2日 ]

标题：用于分布外多智能体强化学习的意外性通信

标题： Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning

Authors:Min Whoo Lee, Kibeom Kim, Soo Wung Shin, Minsu Lee, Byoung-Tak Zhang

摘要：将多智能体强化学习方法应用于现实场景具有挑战性，因为它可能需要智能体快速适应训练中很少或从未遇到的意外情况。针对此类分布外设置的泛化方法仅限于更具体、受限的分布偏移实例。为了解决分布偏移的适应问题，我们提出了Unexpected Encoding Scheme，这是一种新颖的去中心化多智能体强化学习算法，其中智能体通信“意外性”，即环境中的令人惊讶的方面。除了由原始奖励驱动通信产生的信息外，每个智能体还会根据先前经验预测下一个观察结果，测量预测与实际遇到的观察结果之间的差异，并将此差异作为信息进行编码。在多机器人仓库环境中的实验表明，我们提出的方法能够稳健地适应动态变化的训练环境以及分布外环境。

摘要： Applying multi-agent reinforcement learning methods to realistic settings is challenging as it may require the agents to quickly adapt to unexpected situations that are rarely or never encountered in training. Recent methods for generalization to such out-of-distribution settings are limited to more specific, restricted instances of distribution shifts. To tackle adaptation to distribution shifts, we propose Unexpected Encoding Scheme, a novel decentralized multi-agent reinforcement learning algorithm where agents communicate "unexpectedness," the aspects of the environment that are surprising. In addition to a message yielded by the original reward-driven communication, each agent predicts the next observation based on previous experience, measures the discrepancy between the prediction and the actually encountered observation, and encodes this discrepancy as a message. Experiments on multi-robot warehouse environment support that our proposed method adapts robustly to dynamically changing training environments as well as out-of-distribution environment.

评论：	7页，3图，发表于AAAI 2024研讨会（合作多智能体系统决策与学习：从个体需求到群体智能）
主题：	多智能体系统 (cs.MA)
引用方式：	arXiv:2501.01140 [cs.MA]
	(或者 arXiv:2501.01140v1 [cs.MA] 对于此版本)
	https://doi.org/10.48550/arXiv.2501.01140

提交历史

来自： Min Whoo Lee [查看电子邮件]
[v1] 星期四， 2025 年 1 月 2 日 08:47:12 UTC (457 KB)

计算机科学 > 多智能体系统

标题：用于分布外多智能体强化学习的意外性通信

标题： Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 多智能体系统

标题： 用于分布外多智能体强化学习的意外性通信 显示英文标题

标题： Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：用于分布外多智能体强化学习的意外性通信