Multi-Modality Distillation via Learning the teacher’s modality-level Gram Matrix
arXiv:2112.11447v2 Announce Type: replace
Abstract: In the context of multi-modality knowledge distillation research, the existing methods was mainly focus on the problem of only learning teacher final output. Thus, there are still deep differences be…