The Unintelligibility is Ours: Notes on Chain-of-Thought
Many people seem to think that the chains-of-thought in RL-trained LLMs are under a great deal of “pressure” to cease being English. The idea is that, as LLMs solve harder and harder problems, they will eventually slide into inventing a “new language” …