SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
We present Stable Video 4D 2.0 (SV4D 2.0), a multi-view video diffusion
model for dynamic 3D asset generation. Compared to its predecessor SV4D,
SV4D 2.0 is more robust to occlusions and large motion, generalizes better
to real-world videos, and produces higher-quality outputs in terms of
detail sharpness and spatio-temporal consistency.