cs.AI, cs.CV, eess.IV

Towards a Large Language-Vision Question Answering Model for MSTAR Automatic Target Recognition

arXiv:2605.10772v1 Announce Type: cross
Abstract: Large language-vision models (LLVM), such as OpenAI’s ChatGPT and GPT-4, have gained prominence as powerful tools for analyzing text and imagery. The merging of these data domains represents a signific…