The Serial Scaling Hypothesis

Liu, Yuxi; Preechakul, Konpat; Kuwaranancharoen, Kananart; Bai, Yutong

计算机科学 > 机器学习

arXiv:2507.12549 (cs)

[提交于 2025年7月16日 ]

标题：串行缩放假说

标题： The Serial Scaling Hypothesis

Authors:Yuxi Liu, Konpat Preechakul, Kananart Kuwaranancharoen, Yutong Bai

摘要：虽然机器学习通过大规模并行化取得了进展，但我们发现了一个关键的盲点：某些问题本质上是顺序的。这些“本质上串行”的问题——从数学推理到物理模拟再到顺序决策——需要依赖的计算步骤无法进行并行化。借鉴复杂性理论，我们正式明确了这一区别，并证明当前以并行为中心的架构在这些任务上存在根本性的局限。我们认为，认识到计算的串行性质对机器学习、模型设计和硬件开发具有深远的影响。随着人工智能处理越来越复杂的推理，有意识地扩展串行计算——而不仅仅是并行计算——对于持续进步至关重要。

摘要： While machine learning has advanced through massive parallelization, we identify a critical blind spot: some problems are fundamentally sequential. These "inherently serial" problems-from mathematical reasoning to physical simulations to sequential decision-making-require dependent computational steps that cannot be parallelized. Drawing from complexity theory, we formalize this distinction and demonstrate that current parallel-centric architectures face fundamental limitations on such tasks. We argue that recognizing the serial nature of computation holds profound implications on machine learning, model design, hardware development. As AI tackles increasingly complex reasoning, deliberately scaling serial computation-not just parallel computation-is essential for continued progress.

评论：	28页（13页正文+附录与参考文献），8张图，共同第一作者
主题：	机器学习 (cs.LG) ; 计算复杂性 (cs.CC); 机器学习 (stat.ML)
MSC 类：	68Q15, 68Q10, 68T07
ACM 类：	F.1.1; F.1.3; I.2.6
引用方式：	arXiv:2507.12549 [cs.LG]
	(或者 arXiv:2507.12549v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2507.12549

提交历史

来自： Konpat Preechakul [查看电子邮件]
[v1] 星期三， 2025 年 7 月 16 日 18:01:26 UTC (1,927 KB)

计算机科学 > 机器学习

标题：串行缩放假说

标题： The Serial Scaling Hypothesis

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 串行缩放假说 显示英文标题

标题： The Serial Scaling Hypothesis

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：串行缩放假说