GeoChat Grounded Large Vision-Language Model for Remote Sensing
论文名称: GeoChat: Grounded Large Vision-Language Model for Remote Sensing 模型架构: MLLM Visual Encoder: Transformer Text Encoder: Transformer Model Details: Vision Encoder:CLIP-ViTText Encoder...