MachineLearning

What should happen when you feed impossible moves into a chess-playing language model? [D]

I'd appreciate some input on an experiment I've been mulling over. You can treat it as straight-up interpretability, but it would have theoretical implications. Karvonen (2024) trained a 50M-parameter transformer on chess game transcripts. Just…