Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers
arXiv:2603.27666v1 Announce Type: new
Abstract: Recent advances in diffusion-based controllable visual generation have led to remarkable improvements in image quality. However, these powerful models are typically deployed on cloud servers due to their…