Flash normalization: fast RMSNorm for LLMs

Graef, Nils; Clapp, Matthew; Wasielewski, Andrew

Computer Science > Machine Learning

arXiv:2407.09577v1 (cs)

[Submitted on 12 Jul 2024 (this version) , latest version 1 Jun 2025 (v3) ]

Title: Flash normalization: fast RMSNorm for LLMs

Title: 快速 RMSNorm 用于 LLMs 的归一化

Authors:Nils Graef, Matthew Clapp, Andrew Wasielewski

Abstract: RMSNorm is used by many LLMs such as Llama, Mistral, and OpenELM. This paper details FlashNorm, which is an exact but faster implementation of RMSNorm followed by linear layers. See https://huggingface.co/open-machine/FlashNorm for code and more transformer tricks.

Abstract: RMSNorm被许多大型语言模型如Llama、Mistral和OpenELM所使用。本文详细介绍了FlashNorm，这是RMSNorm的一种精确但更快的实现，随后是线性层。请访问https://huggingface.co/open-machine/FlashNorm查看代码和更多变换器技巧。

Comments:	7 pages, 8 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2407.09577 [cs.LG]
	(or arXiv:2407.09577v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.09577

Submission history

From: Nils Graef [view email]
[v1] Fri, 12 Jul 2024 00:37:55 UTC (440 KB)
[v2] Tue, 1 Apr 2025 23:19:22 UTC (449 KB)
[v3] Sun, 1 Jun 2025 22:12:10 UTC (584 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-07

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title: Flash normalization: fast RMSNorm for LLMs

Title: 快速 RMSNorm 用于 LLMs 的归一化

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title: Flash normalization: fast RMSNorm for LLMs Show Chinese title

Title: 快速 RMSNorm 用于 LLMs 的归一化

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Title: Flash normalization: fast RMSNorm for LLMs