WE1.R15.5
Training-Free KV-Cache for Zero-Shot Remote Sensing Scene Classification Based on Pre-trained Multi-modal Foundation Model
Guiying Zhu, Yin Zhuang, Beijing Institute of Technology, China; Tong Zhang, Peking University, China; Guanqun Wang, He Chen, Beijing Institute of Technology, China; Lianlin Li, Peking University, China
Session:
WE1.R15: Vision-Language Models for Remote Sensing: Foundations, Applications, and Challenges (1/3) Oral
Track:
Community Contributed Themes
Location:
TBD
Presentation Time:
Wed, 12 Aug, 09:30 - 09:45
Session Co-Chairs:
Linlin Xu, and Hongjie He,
Presentation
Discussion
Resources
No resources available.
Session WE1.R15
WE1.R15.1: Adapting Vision Language Models for High Resolution Range Profile Recognition with LoRA
Xinyi Niu, Lingfeng Chen, Xiaolong Su, Panhe Hu, College of Electronic Science and Technology, National University of Defense Technology, China
WE1.R15.2: Exploring the Setting of Normalization Statistics on Tuning Multispectral-Text Models
Huiying Yao, Jingtao Li, Yanfei Zhong, Wuhan university, China
WE1.R15.3: UNLOCKING MULTI-SPECTRAL DATA FOR MULTI-MODAL MODELS WITH GUIDED INPUTS AND CHAIN-OF-THOUGHT REASONING
Dahun Kim, Ganesh Mallya, Anelia Angelova, Google DeepMind, United States
WE1.R15.4: AIRSPATIALVLM++: TOWARDS STRONGER SPATIAL AWARENESS IN REMOTE SENSING LARGE VISION-LANGUAGE MODELS
Shujun Zhao, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Penghui Huang, Shanghai Jiao Tong University, China; Yue Zhou, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Xue Yang, Xue Jiang, Shanghai Jiao Tong University, China; Hongxin Yang, School of GeoAI and Hindon STAI Institute, Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China; Jonathan Li, School of GeoAI and Hindon STAI Institute, East China Normal University; Key Laboratory of Geographic Information Science (Ministry of Education), East China Nomal University, China
WE1.R15.5: Training-Free KV-Cache for Zero-Shot Remote Sensing Scene Classification Based on Pre-trained Multi-modal Foundation Model
Guiying Zhu, Yin Zhuang, Beijing Institute of Technology, China; Tong Zhang, Peking University, China; Guanqun Wang, He Chen, Beijing Institute of Technology, China; Lianlin Li, Peking University, China
Contacts