cs.AI, cs.LG

Distributed Interpretability and Control for Large Language Models

arXiv:2604.06483v1 Announce Type: cross
Abstract: Large language models that require multiple GPU cards to host are usually the most capable models. It is necessary to understand and steer these models, but the current technologies do not support the …