cs.LG

ContextPilot: Fast Long-Context Inference via Context Reuse

arXiv:2511.03475v4 Announce Type: replace
Abstract: AI applications increasingly depend on long-context inference, where LLMs consume substantial context to support stronger reasoning. Common examples include retrieval-augmented generation, agent memo…