cs.AI, cs.SE

AgentForge: Execution-Grounded Multi-Agent LLM Framework for Autonomous Software Engineering

arXiv:2604.13120v1 Announce Type: cross
Abstract: Large language models generate plausible code but cannot verify correctness. Existing multi-agent systems simulate execution or leave verification optional. We introduce execution-grounded verification…