cs.CL

Stochasticity in Tokenisation Improves Robustness

arXiv:2604.16037v1 Announce Type: new
Abstract: The widespread adoption of large language models (LLMs) has increased concerns about their robustness. Vulnerabilities in perturbations of tokenisation of the input indicate that models trained with a de…