Seeking Physics in Diffusion Noise
arXiv:2603.14294v2 Announce Type: replace-cross
Abstract: Do video diffusion models encode signals predictive of physical plausibility? We probe intermediate denoising representations of a pretrained Diffusion Transformer (DiT) and find that physicall…