SkillTester: Benchmarking Utility and Security of Agent Skills
arXiv:2603.28815v1 Announce Type: cross
Abstract: This technical report presents SkillTester, a tool for evaluating the utility and security of agent skills. Its evaluation framework combines paired baseline and with-skill execution conditions with a …