WE1.R15: Vision-Language Models for Remote Sensing: Foundations, Applications, and Challenges (1/3)
Oral
Wed, 12 Aug, 08:30 - 09:45
Location: TBD
Session Type: Oral
Session Co-Chairs: Linlin Xu, and Hongjie He,
Track: Community Contributed Themes
Click the to view the manuscript on IEEE Xplore Open Preview
Wed, 12 Aug, 08:30 - 08:45

WE1.R15.1: Adapting Vision Language Models for High Resolution Range Profile Recognition with LoRA

Xinyi Niu, Lingfeng Chen, Xiaolong Su, Panhe Hu, College of Electronic Science and Technology, National University of Defense Technology, China
Wed, 12 Aug, 08:45 - 09:00

WE1.R15.2: Exploring the Setting of Normalization Statistics on Tuning Multispectral-Text Models

Huiying Yao, Jingtao Li, Yanfei Zhong, Wuhan university, China
Wed, 12 Aug, 09:00 - 09:15

WE1.R15.3: UNLOCKING MULTI-SPECTRAL DATA FOR MULTI-MODAL MODELS WITH GUIDED INPUTS AND CHAIN-OF-THOUGHT REASONING

Dahun Kim, Ganesh Mallya, Anelia Angelova, Google DeepMind, United States
Wed, 12 Aug, 09:15 - 09:30

WE1.R15.4: AIRSPATIALVLM++: TOWARDS STRONGER SPATIAL AWARENESS IN REMOTE SENSING LARGE VISION-LANGUAGE MODELS

Shujun Zhao, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Penghui Huang, Shanghai Jiao Tong University, China; Yue Zhou, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Xue Yang, Xue Jiang, Shanghai Jiao Tong University, China; Hongxin Yang, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Jonathan Li, School of GeoAI and Hindon STAI Institute, East China Normal University; Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China
Wed, 12 Aug, 09:30 - 09:45

WE1.R15.5: Training-Free KV-Cache for Zero-Shot Remote Sensing Scene Classification Based on Pre-trained Multi-modal Foundation Model

Guiying Zhu, Yin Zhuang, Beijing Institute of Technology, China; Tong Zhang, Peking University, China; Guanqun Wang, He Chen, Beijing Institute of Technology, China; Lianlin Li, Peking University, China