Author name: Bryan Sanchez

Correcting Suppressed Log-Probabilities in Language Models with Post-Transformer Adapters

Bryan Sanchez / April 20, 2026

arXiv:2604.14174v2 Announce Type: replace-cross
Abstract: Alignment-tuned language models frequently suppress factual log-probabilities on politically sensitive topics despite retaining the knowledge in their hidden representations. We show that a 786…

cs.CL, cs.LG

Correcting Suppressed Log-Probabilities in Language Models with Post-Transformer Adapters

Bryan Sanchez / April 17, 2026

arXiv:2604.14174v1 Announce Type: cross
Abstract: Alignment-tuned language models frequently suppress factual log-probabilities on politically sensitive topics despite retaining the knowledge in their hidden representations. We show that a 786K-parame…