cs.CL, cs.LG

Improving LLM Predictions via Inter-Layer Structural Encoders

arXiv:2603.22665v2 Announce Type: replace
Abstract: The standard practice in Large Language Models (LLMs) is to base predictions on final-layer representations. However, intermediate layers encode complementary task-relevant signals, and the optimal l…