CtxMIM Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding
- 论文名称: CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding
- Link: https://arxiv.org/abs/2310.00022
- Published in: Arxiv 2023
- Type: Pretrain
- Code/Project: —
- 备注: 自监督,掩码
- Backbone: Swin Transformer
- Backbone 1: Swin-B
- 下游任务: Classification, Semantic Segmentation, Object Detection, Instance Segmentation
- 下游任务 1: classification, semantic segmentation, object detection, instance segmentation
- Short Summary: CtxMIM将原始图像块形式化为重建模板,并采用Siamese框架处理两组图像块。引入了一个上下文增强的生成分支,通过重建中的上下文一致性约束提供上下文信息。通过这种简单且优雅的设计,CtxMIM鼓励预训练模型在大规模数据集上学习对象级或像素级特征,而无需特定的时间或地理约束。
This post is licensed under CC BY 4.0 by the author.