Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO
arXiv:2511.09780v2 Announce Type: replace
Abstract: Group Relative Policy Optimization (GRPO) has demonstrated wide adoption in the post-training of Large Language Models (LLMs). In GRPO, prompts are answered by the model and preferred behaviour is le…