Alibaba Technical Report: Qwen3-VL beats GPT-5 and Gemini 2.5 Pro on visual tasks and has 100% accuracy on "needle-in-a-haystack" tests for 30-minute videos (Jonathan Kemper/The Decoder)

TechmemeSunday, November 30, 2025 at 6:40:00 AM
Alibaba Technical Report: Qwen3-VL beats GPT-5 and Gemini 2.5 Pro on visual tasks and has 100% accuracy on "needle-in-a-haystack" tests for 30-minute videos (Jonathan Kemper/The Decoder)
  • Alibaba has released a technical report on its Qwen3-VL model, which outperforms competitors GPT-5 and Gemini 2.5 Pro in visual tasks and achieves 100% accuracy in 'needle-in-a-haystack' tests for 30-minute videos. This advancement highlights the model's capabilities in analyzing multimodal data, including video and images.
  • The success of Qwen3-VL is significant for Alibaba as it reinforces the company's position in the AI landscape, showcasing its technological advancements and commitment to innovation in multimodal AI applications. This could enhance its competitive edge in the rapidly evolving AI market.
  • This development reflects a broader trend in the AI industry, where companies are increasingly focusing on multimodal capabilities to improve performance across various tasks. As competition intensifies, particularly with the emergence of models like Gemini 3, the ability to excel in complex visual and analytical tasks will be crucial for maintaining leadership in AI technology.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
DeepSeek says its new DeepSeekMath-V2 model got gold-medal level status on the International Mathematical Olympiad 2025 and Chinese Mathematical Olympiad 2024 (Matthias Bastian/The Decoder)
PositiveArtificial Intelligence
Chinese startup DeepSeek has announced that its new AI model, DeepSeekMath-V2, has achieved gold medal status at both the International Mathematical Olympiad 2025 and the Chinese Mathematical Olympiad 2024, marking a significant milestone in its development.
Qwen3-VL can scan two-hour videos and pinpoint nearly every detail
PositiveArtificial Intelligence
Alibaba has released a technical report detailing its Qwen3-VL model, which demonstrates the ability to analyze two-hour video footage and excel in image-based math tasks. This advancement showcases the model's capabilities in processing multimodal data, integrating text, images, and video effectively.