**摘要**
Remote sensing vision-language models have advanced Earth observation understanding, but most existing work remains centered on RGB imagery, leaving the complementary information in infrared data underexplored. Infrared images provide distinctive cues, including thermal intensity structures, object boundaries, and illumination-invariant scene features, which can enrich visual-language learning bey
👤 作者: Jiaju Han, Ben Zhang, Xuemeng Sun, Qike Zhang, Yuxian Dong, Chengyin Hu, Fengyu Zhang, Yiwei Wei, Jiujiang Guo
---
🔗 **[FusionRS :用于双模态视觉语言基础模型的大规模RGB红外遥感数据集](https://arxiv.org/abs/2606.17020v1)**
> FusionRS: A Large-Scale RGB-Infrared Remote Sensing Dataset for Dual-Modal Vision-Language Foundation Models
🏷️ 来源: ArXiv cs.AI
⏱️ 2026-06-16 14:00
加载回复中...