cs.AI, cs.CR

CyberCertBench: Evaluating LLMs in Cybersecurity Certification Knowledge

arXiv:2604.20389v1 Announce Type: cross
Abstract: The rapid evolution and use of Large Language Models (LLMs) in professional workflows require an evaluation of their domain-specific knowledge against industry standards. We introduceCyberCertBench, a …