Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio
arXiv:2603.25926v1 Announce Type: new
Abstract: Soft context compression reduces the computational workload of processing long contexts in LLMs by encoding long context into a smaller number of latent tokens. However, existing frameworks apply uniform…