cs.CL

A Single Layer to Explain Them All:Understanding Massive Activations in Large Language Models

arXiv:2605.08504v1 Announce Type: new
Abstract: We investigate the origins of massive activations in large language models (LLMs) and identify a specific layer named the \textbf{Massive Emergence Layer (ME Layer)}, that is consistently observed across…