Nengneng Yu, Sixian Xiong, Yibo Zhao, Wei Wang, Zaoxing Liu

cs.AI, cs.LG, cs.PF, cs.SE, cs.SY, eess.SY

Enabling Performant and Flexible Model-Internal Observability for LLM Inference

Nengneng Yu, Sixian Xiong, Yibo Zhao, Wei Wang, Zaoxing Liu / May 13, 2026

arXiv:2605.11093v1 Announce Type: new
Abstract: Today’s inference-time workloads increasingly depend on timely access to a model’s internal states. We present DMI-Lib, a high-speed deep model inspector that treats internal observability as a first-cla…

Author name: Nengneng Yu, Sixian Xiong, Yibo Zhao, Wei Wang, Zaoxing Liu

Enabling Performant and Flexible Model-Internal Observability for LLM Inference