cs.LG, stat.AP

Steer-to-Detect: Probing Hidden Representations for Detection of LLM-Generated Texts

arXiv:2605.12890v1 Announce Type: cross
Abstract: The rapid advancement of large language models (LLMs) has made machine-generated text increasingly difficult to distinguish from human-written text. While recent studies explore leveraging internal rep…