Where Do Vision-Language Models Fail? World Scale Analysis for Image Geolocalization
This study systematically evaluates various vision-language models for country-level image geolocalization, revealing their limitations in capturing fine-grained geographic cues.
Siddhant Bharadwaj, Ashish Vashist, Fahimul Aleem et al.