Qwen3-VL can scan two-hour videos and pinpoint nearly every detail

THE DECODERFriday, November 28, 2025 at 6:47:03 PM
Qwen3-VL can scan two-hour videos and pinpoint nearly every detail
  • Alibaba has released a technical report detailing its Qwen3-VL model, which demonstrates the ability to analyze two-hour video footage and excel in image-based math tasks. This advancement showcases the model's capabilities in processing multimodal data, integrating text, images, and video effectively.
  • The introduction of Qwen3-VL is significant for Alibaba as it reinforces the company's position in the AI landscape, particularly in developing advanced models that can handle complex data analysis tasks. This positions Alibaba as a key player in the competitive AI market.
  • This development reflects a broader trend in the AI industry where companies are increasingly focusing on enhancing the reliability and performance of their models. As seen with other AI advancements, such as Google's Gemini 3 Pro and Meta's SAM 3, the push for improved capabilities and user applications continues to drive innovation and competition in the sector.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
The ARC benchmark's fall marks another casualty of relentless AI optimization
NegativeArtificial Intelligence
The ARC benchmark, once deemed a significant challenge for AI systems, has recently shown signs of decline as modern AI optimization techniques continue to advance. This benchmark was previously a reliable measure of fluid intelligence, distinguishing it from mere memorization tasks.
Programmers using AI ask fewer questions and may learn less deeply than with peers
NegativeArtificial Intelligence
Programmers utilizing AI assistants like GitHub Copilot are reportedly asking fewer questions and accepting code suggestions with less critical evaluation, potentially hindering their depth of learning. This trend raises concerns about the implications of relying heavily on AI for coding tasks.
General Agentic Memory tackles context rot and outperforms RAG in memory benchmarks
PositiveArtificial Intelligence
A Chinese research team has introduced a new memory architecture for AI agents called General Agentic Memory (GAM), which aims to reduce information loss during prolonged interactions by integrating compression techniques with deep research methodologies.
Chatbots are now rivaling social networks as a core layer of internet infrastructure
PositiveArtificial Intelligence
New data from Similarweb indicates that chatbots are experiencing significant growth, rivaling social networks as a fundamental component of internet infrastructure, with increased traffic and app downloads, particularly among older demographics.
Pinokio 5.0 turns local machines into personal AI clouds
PositiveArtificial Intelligence
Pinokio 5.0 has been launched to simplify the process of running open-source AI models on personal hardware, aiming to make it as user-friendly as a web application. This development represents a significant step towards democratizing AI technology by allowing users to leverage their own machines as personal AI clouds.
Alibaba Technical Report: Qwen3-VL beats GPT-5 and Gemini 2.5 Pro on visual tasks and has 100% accuracy on "needle-in-a-haystack" tests for 30-minute videos (Jonathan Kemper/The Decoder)
PositiveArtificial Intelligence
Alibaba has released a technical report on its Qwen3-VL model, which outperforms competitors GPT-5 and Gemini 2.5 Pro in visual tasks and achieves 100% accuracy in 'needle-in-a-haystack' tests for 30-minute videos. This advancement highlights the model's capabilities in analyzing multimodal data, including video and images.
Microsoft unveils Fara-7B, a compact model for running AI-driven computer control locally
PositiveArtificial Intelligence
Microsoft has introduced the Fara-7B, a compact AI model designed to operate user interfaces through visual input directly on personal computers. This model, featuring 7 billion parameters, aims to perform complex tasks locally, enhancing user experience and privacy by reducing reliance on cloud services.
DeepseekMath-V2 is Deepseek's latest attempt to pop the US AI bubble
PositiveArtificial Intelligence
Chinese startup Deepseek has announced that its new AI model, DeepseekMath-V2, has achieved gold medal status at the Math Olympiad, positioning the company as a strong competitor against Western AI labs.