VLM 10

Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment May 21, 2024
RemoteCLIP A Vision Language Foundation Model for Remote Sensing Apr 11, 2024
Large Language Models for Captioning and Retrieving Remote Sensing Images Mar 12, 2024
SkyScript A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing Mar 1, 2024
Language-aware domain generalization network for cross-scene hyperspectral image classification Nov 22, 2023
Vlca vision-language aligning model with cross-modal attention for bilingual remote sensing image captioning Jun 5, 2023
Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning Apr 22, 2023
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval Feb 13, 2023
S-CLIP Semi-supervised Vision-Language Learning using Few Specialist Captions Jan 16, 2023
RS5M and GeoRSCLIP A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing Jan 3, 2023

Trending Tags

dataset 图像 paper Pretrain 图像、文本 Other VLM MLLM 视频 Agent