cs.AI, cs.CV

Latent-Compressed Variational Autoencoder for Video Diffusion Models

arXiv:2604.16479v1 Announce Type: new
Abstract: Video variational autoencoders (VAEs) used in latent diffusion models typically require a sufficiently large number of latent channels to ensure high-quality video reconstruction. However, recent studies…