PhysNote: Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model
arXiv:2604.24443v1 Announce Type: new
Abstract: Vision-Language Models (VLMs) have demonstrated strong performance on textbook-style physics problems, yet they frequently fail when confronted with dynamic real-world scenarios that require temporal con…