cs.CL

Perception-Aware Policy Optimization for Multimodal Reasoning

arXiv:2507.06448v5 Announce Type: replace
Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has proven to be a highly effective strategy for endowing Large Language Models (LLMs) with robust multi-step reasoning abilities. However, its d…

cs.AI, cs.CR

LLM-Guided Prompt Evolution for Password Guessing

arXiv:2604.12601v1 Announce Type: cross
Abstract: Passwords still remain a dominant authentication method, yet their security is routinely subverted by predictable user choices and large-scale credential leaks. Automated password guessing is a key too…

Scroll to Top