Object Counting with GPT-4o and GPT-5: A Comparative Study
PositiveArtificial Intelligence
- A comparative study has been conducted on the object counting capabilities of two multi-modal large language models, GPT-4o and GPT-5, focusing on their performance in zero-shot scenarios using only textual prompts. The evaluation was carried out on the FSC-147 and CARPK datasets, revealing that both models achieved results comparable to state-of-the-art methods, with some instances exceeding them.
- This development highlights the potential of leveraging advanced language models for complex tasks like object counting without the need for extensive annotated data or visual examples, marking a significant step forward in AI capabilities.
- The findings resonate with ongoing discussions in the AI community regarding the efficacy of large language models in various applications, including their role in enhancing vision-language synergy and addressing challenges in object recognition and counting, which are critical for advancing AI's practical applications.
— via World Pulse Now AI Editorial System

