Hyoseok Park, Yeonsang Park

cs.AI, cs.AR, cs.CL, cs.LG, physics.optics

PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection

Hyoseok Park, Yeonsang Park / March 26, 2026

arXiv:2603.21576v2 Announce Type: replace-cross
Abstract: Long-context LLM inference is bottlenecked not by compute but by the O(n) memory bandwidth cost of scanning the KV cache at every decode step — a wall that no amount of arithmetic scaling can …

Author name: Hyoseok Park, Yeonsang Park

PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection