ProfVLM: A lightweight video-language model for multi-view proficiency estimation
arXiv:2509.26278v4 Announce Type: replace-cross
Abstract: Most existing approaches formulate action quality assessment and skill proficiency estimation as discriminative prediction tasks, typically producing discrete labels or scores without explicitl…