The Truth Lies Somewhere in the Middle (of the Generated Tokens)
arXiv:2605.09969v1 Announce Type: cross
Abstract: How should hidden states generated autoregressively be collapsed into a representation that reflects a language model’s internal state? Despite tokens being generated under causal masking, we find that…