cs.AI, cs.CR

Privacy Guard & Token Parsimony by Prompt and Context Handling and LLM Routing

arXiv:2603.28972v1 Announce Type: cross
Abstract: The large-scale adoption of Large Language Models (LLMs) forces a trade-off between operational cost (OpEx) and data privacy. Current routing frameworks reduce costs but ignore prompt sensitivity, expo…