Masha Fedzechkina, Eleonora Gualdoni, Rita Ramos, Sinead Williamson

What do your logits know? (The answer may surprise you!)

Masha Fedzechkina, Eleonora Gualdoni, Rita Ramos, Sinead Williamson / April 14, 2026

arXiv:2604.09885v1 Announce Type: new
Abstract: Recent work has shown that probing model internals can reveal a wealth of information not apparent from the model generations. This poses the risk of unintentional or malicious information leakage, where…

Author name: Masha Fedzechkina, Eleonora Gualdoni, Rita Ramos, Sinead Williamson

What do your logits know? (The answer may surprise you!)