IGARSS 2026 || Washington, D.C. || 9

WE1.R15.5

Training-Free KV-Cache for Zero-Shot Remote Sensing Scene Classification Based on Pre-trained Multi-modal Foundation Model

Guiying Zhu, Yin Zhuang, Beijing Institute of Technology, China; Tong Zhang, Peking University, China; Guanqun Wang, He Chen, Beijing Institute of Technology, China; Lianlin Li, Peking University, China

Session:

WE1.R15: Vision-Language Models for Remote Sensing: Foundations, Applications, and Challenges (1/3) Oral

Location:

TBD

Presentation Time:

Wed, 12 Aug, 09:30 - 09:45

Session Co-Chairs:

Linlin Xu, and Hongjie He,

Session WE1.R15

WE1.R15.1: Adapting Vision Language Models for High Resolution Range Profile Recognition with LoRA

Xinyi Niu, Lingfeng Chen, Xiaolong Su, Panhe Hu, College of Electronic Science and Technology, National University of Defense Technology, China

WE1.R15.2: Exploring the Setting of Normalization Statistics on Tuning Multispectral-Text Models

Huiying Yao, Jingtao Li, Yanfei Zhong, Wuhan university, China

WE1.R15.3: UNLOCKING MULTI-SPECTRAL DATA FOR MULTI-MODAL MODELS WITH GUIDED INPUTS AND CHAIN-OF-THOUGHT REASONING

Dahun Kim, Ganesh Mallya, Anelia Angelova, Google DeepMind, United States

WE1.R15.4: AIRSPATIALVLM++: TOWARDS STRONGER SPATIAL AWARENESS IN REMOTE SENSING LARGE VISION-LANGUAGE MODELS

Shujun Zhao, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Penghui Huang, Shanghai Jiao Tong University, China; Yue Zhou, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Xue Yang, Xue Jiang, Shanghai Jiao Tong University, China; Hongxin Yang, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Jonathan Li, School of GeoAI and Hindon STAI Institute, East China Normal University; Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China

WE1.R15.5: Training-Free KV-Cache for Zero-Shot Remote Sensing Scene Classification Based on Pre-trained Multi-modal Foundation Model