cs.DB, cs.DC, cs.LG

DisCEdge: Distributed Context Management for Large Language Models at the Edge

arXiv:2511.22599v2 Announce Type: replace-cross
Abstract: Deploying Large Language Model (LLM) services at the edge benefits latency-sensitive and privacy-aware applications. However, the stateless nature of LLMs makes managing user context (e.g., ses…