Knowledge Packs: Zero-Token Knowledge Delivery via KV Cache Injection
arXiv:2604.03270v1 Announce Type: new
Abstract: RAG wastes tokens. We propose Knowledge Packs: pre-computed KV caches that deliver the same knowledge at zero token cost. For causal transformers, the KV cache from a forward pass on text F is identical …