PokeFusion Attention: A Lightweight Cross-Attention Mechanism for Style-Conditioned Image Generation
arXiv:2602.03220v3 Announce Type: replace
Abstract: Style-conditioned text-to-image (T2I) generation with diffusion models requires both stable character structure and consistent, fine-grained style expression across diverse prompts. Existing approach…