cs.CL

PromptSuite: A Task-Agnostic Framework for Multi-Prompt Generation

arXiv:2507.14913v5 Announce Type: replace
Abstract: Evaluating LLMs with a single prompt has proven unreliable, with small changes leading to significant performance differences. However, generating the prompt variations needed for a more robust multi…