Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey

Jiang, Bowen; Xie, Yangxinyu; Wang, Xiaomeng; Su, Weijie J.; Taylor, Camillo J.; Mallick, Tanwi

计算机科学 > 人工智能

arXiv:2406.00252v2 (cs)

[提交于 2024年6月1日 (v1) ，修订后的 2024年6月5日 (此版本， v2) ， 最新版本 2025年2月16日 (v6) ]

标题：多模态和多智能体系统与理性相遇：综述

标题： Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey

Authors:Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Weijie J. Su, Camillo J. Taylor, Tanwi Mallick

摘要：理性是指受理性指导的品质，其特点是符合证据和逻辑规则的逻辑思维和决策。这种品质对于有效的解决问题至关重要，因为它确保解决方案有坚实的基础并系统地得出。尽管大型语言模型（LLMs）在生成具有显著准确性的类人文本方面取得了进展，但它们会继承训练数据中的偏见，在不同上下文中表现出不一致，并且在理解涉及多层上下文的复杂场景时存在困难。因此，最近的研究尝试利用多个代理协同工作，结合不同类型的数据和工具，以提高一致性和可靠性。为此，本文旨在通过调查最新研究，了解多模态和多代理系统是否正在向理性发展，识别在理性方面相对于单代理和单模态系统的进步，并讨论开放问题和未来方向。我们在 https://github.com/bowen-upenn/MMMA_Rationality 保持一个开放的代码库。

摘要： Rationality is the quality of being guided by reason, characterized by logical thinking and decision-making that align with evidence and logical rules. This quality is essential for effective problem-solving, as it ensures that solutions are well-founded and systematically derived. Despite the advancements of large language models (LLMs) in generating human-like text with remarkable accuracy, they present biases inherited from the training data, inconsistency across different contexts, and difficulty understanding complex scenarios involving multiple layers of context. Therefore, recent research attempts to leverage the strength of multiple agents working collaboratively with various types of data and tools for enhanced consistency and reliability. To that end, this paper aims to understand whether multi-modal and multi-agent systems are advancing toward rationality by surveying the state-of-the-art works, identifying advancements over single-agent and single-modal systems in terms of rationality, and discussing open problems and future directions. We maintain an open repository at https://github.com/bowen-upenn/MMMA_Rationality.

主题：	人工智能 (cs.AI) ; 计算与语言 (cs.CL); 计算机视觉与模式识别 (cs.CV); 多智能体系统 (cs.MA)
引用方式：	arXiv:2406.00252 [cs.AI]
	(或者 arXiv:2406.00252v2 [cs.AI] 对于此版本)
	https://doi.org/10.48550/arXiv.2406.00252

提交历史

来自： Bowen Jiang [查看电子邮件]
[v1] 星期六， 2024 年 6 月 1 日 01:17:25 UTC (2,332 KB)
[v2] 星期三， 2024 年 6 月 5 日 19:39:56 UTC (2,332 KB)
[v3] 星期二， 2024 年 6 月 18 日 04:22:39 UTC (4,650 KB)
[v4] 星期二， 2024 年 10 月 15 日 20:11:42 UTC (3,975 KB)
[v5] 星期六， 2025 年 2 月 8 日 20:48:13 UTC (3,980 KB)
[v6] 星期日， 2025 年 2 月 16 日 05:30:30 UTC (3,980 KB)

计算机科学 > 人工智能

标题：多模态和多智能体系统与理性相遇：综述

标题： Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 人工智能

标题： 多模态和多智能体系统与理性相遇：综述 显示英文标题

标题： Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：多模态和多智能体系统与理性相遇：综述