cs.AI, cs.CV, cs.LG, cs.RO

Seeking Physics in Diffusion Noise

arXiv:2603.14294v2 Announce Type: replace-cross
Abstract: Do video diffusion models encode signals predictive of physical plausibility? We probe intermediate denoising representations of a pretrained Diffusion Transformer (DiT) and find that physicall…