CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language Models

arXiv — cs.CVThursday, November 13, 2025 at 5:00:00 AM
The introduction of the CHOICE benchmark marks a significant advancement in the evaluation of Large Vision-Language Models (VLMs) specifically for remote sensing applications. As these models have shown remarkable capabilities in Earth observation, the absence of a systematic evaluation framework has been a notable gap. CHOICE aims to bridge this gap by providing a comprehensive assessment tool that includes 10,507 problems derived from data collected across 50 globally distributed cities. This benchmark categorizes capabilities into primary dimensions of perception and reasoning, along with secondary dimensions and leaf tasks, ensuring a thorough evaluation. The evaluation of 3 proprietary and 21 open-source VLMs revealed critical limitations, emphasizing the need for further development in this area. By offering a structured approach to assess VLMs, CHOICE is positioned to serve as a valuable resource, providing insights into the challenges and potential of these models in the field …
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
Noise-Adaptive Regularization for Robust Multi-Label Remote Sensing Image Classification
NeutralArtificial Intelligence
A new method called Noise-Adaptive Regularization (NAR) has been proposed to improve multi-label classification in remote sensing, addressing the challenges posed by noisy annotations that can arise from cost-effective data collection methods. NAR distinguishes between additive and subtractive noise within a semi-supervised learning framework, enhancing the robustness of image classification.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about