cs.CL

Hierarchical Alignment: Enforcing Hierarchical Instruction-Following in LLMs through Logical Consistency

arXiv:2604.09075v1 Announce Type: new
Abstract: Large language models increasingly operate under multiple instructions from heterogeneous sources with different authority levels, including system policies, user requests, tool outputs, and retrieved co…