Paraphrasing Is (At Best) a Partial Defence Against Steganography in LLMs
Within the AI Safety community, paraphrasing, which, in the context of this post, simply means using another LLM (with nonzero temperature) to rewrite a given piece of content, is generally considered a viable defence and detection method for steganogr…