cs.CL, cs.IR

AFMRL: Attribute-Enhanced Fine-Grained Multi-Modal Representation Learning in E-commerce

arXiv:2604.20135v1 Announce Type: new
Abstract: Multimodal representation is crucial for E-commerce tasks such as identical product retrieval. Large representation models (e.g., VLM2Vec) demonstrate strong multimodal understanding capabilities, yet th…