cs.AI, cs.CR

CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios

arXiv:2605.07830v1 Announce Type: cross
Abstract: Large language models (LLMs) are increasingly deployed as autonomous agents in offensive cybersecurity. In this paper, we reveal an interesting phenomenon: different agents exhibit distinct attack patt…