SemanticDialect: Semantic-Aware Mixed-Format Quantization for Video Diffusion Transformers
arXiv:2603.02883v3 Announce Type: replace
Abstract: Diffusion Transformers (DiTs) achieve state-of-the-art video generation quality, but their substantial memory and computational footprints hinder edge deployment. Quantization can reduce these costs,…