Speech-Synchronized Whiteboard Generation via VLM-Driven Structured Drawing Representations
arXiv:2603.25870v1 Announce Type: new
Abstract: Creating whiteboard-style educational videos demands precise coordination between freehand illustrations and spoken narration, yet no existing method addresses this multimodal synchronization problem wit…