Multi-Modal LLM based Image Captioning in ICT: Bridging the Gap Between General and Industry Domain
arXiv:2601.09298v2 Announce Type: replace
Abstract: In the information and communications technology (ICT) industry, training a domain-specific large language model (LLM) or constructing a retrieval-augmented generation system requires a substantial a…