Qwen3-VL can scan two-hour videos and pinpoint nearly every detail

THE DECODER•Friday, November 28, 2025 at 6:47:03 PM

PositiveArtificial Intelligence

Qwen3-VL can scan two-hour videos and pinpoint nearly every detail

Alibaba has released a technical report detailing its Qwen3-VL model, which demonstrates the ability to analyze two-hour video footage and excel in image-based math tasks. This advancement showcases the model's capabilities in processing multimodal data, integrating text, images, and video effectively.
The introduction of Qwen3-VL is significant for Alibaba as it reinforces the company's position in the AI landscape, particularly in developing advanced models that can handle complex data analysis tasks. This positions Alibaba as a key player in the competitive AI market.
This development reflects a broader trend in the AI industry where companies are increasingly focusing on enhancing the reliability and performance of their models. As seen with other AI advancements, such as Google's Gemini 3 Pro and Meta's SAM 3, the push for improved capabilities and user applications continues to drive innovation and competition in the sector.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataTry the app

Lenso.ai

Find any image instantly with AI-powered reverse search.

AI & DataTry the app

VidMax.ai

Create faceless videos automatically with AI, no editing skills required.

AI & DataTry the app

Continue Readings

THE DECODER9 hours ago

The ARC benchmark's fall marks another casualty of relentless AI optimization

NegativeArtificial Intelligence

The ARC benchmark, once deemed a significant challenge for AI systems, has recently shown signs of decline as modern AI optimization techniques continue to advance. This benchmark was previously a reliable measure of fluid intelligence, distinguishing it from mere memorization tasks.

Read full article

via THE DECODER

THE DECODER9 hours ago

Programmers using AI ask fewer questions and may learn less deeply than with peers

NegativeArtificial Intelligence

Programmers utilizing AI assistants like GitHub Copilot are reportedly asking fewer questions and accepting code suggestions with less critical evaluation, potentially hindering their depth of learning. This trend raises concerns about the implications of relying heavily on AI for coding tasks.

Read full article

via THE DECODER

THE DECODER10 hours ago

General Agentic Memory tackles context rot and outperforms RAG in memory benchmarks

PositiveArtificial Intelligence

A Chinese research team has introduced a new memory architecture for AI agents called General Agentic Memory (GAM), which aims to reduce information loss during prolonged interactions by integrating compression techniques with deep research methodologies.

Read full article

via THE DECODER

THE DECODER11 hours ago

Chatbots are now rivaling social networks as a core layer of internet infrastructure

PositiveArtificial Intelligence

New data from Similarweb indicates that chatbots are experiencing significant growth, rivaling social networks as a fundamental component of internet infrastructure, with increased traffic and app downloads, particularly among older demographics.

Read full article

via THE DECODER

THE DECODER12 hours ago

Pinokio 5.0 turns local machines into personal AI clouds

PositiveArtificial Intelligence

Pinokio 5.0 has been launched to simplify the process of running open-source AI models on personal hardware, aiming to make it as user-friendly as a web application. This development represents a significant step towards democratizing AI technology by allowing users to leverage their own machines as personal AI clouds.

Read full article

via THE DECODER

Techmeme15 hours ago

Alibaba Technical Report: Qwen3-VL beats GPT-5 and Gemini 2.5 Pro on visual tasks and has 100% accuracy on "needle-in-a-haystack" tests for 30-minute videos (Jonathan Kemper/The Decoder)

PositiveArtificial Intelligence

Alibaba has released a technical report on its Qwen3-VL model, which outperforms competitors GPT-5 and Gemini 2.5 Pro in visual tasks and achieves 100% accuracy in 'needle-in-a-haystack' tests for 30-minute videos. This advancement highlights the model's capabilities in analyzing multimodal data, including video and images.

Read full article

via Techmeme

THE DECODER2 days ago

Microsoft unveils Fara-7B, a compact model for running AI-driven computer control locally

PositiveArtificial Intelligence

Microsoft has introduced the Fara-7B, a compact AI model designed to operate user interfaces through visual input directly on personal computers. This model, featuring 7 billion parameters, aims to perform complex tasks locally, enhancing user experience and privacy by reducing reliance on cloud services.

Read full article

via THE DECODER

THE DECODER2 days ago

DeepseekMath-V2 is Deepseek's latest attempt to pop the US AI bubble

PositiveArtificial Intelligence

Chinese startup Deepseek has announced that its new AI model, DeepseekMath-V2, has achieved gold medal status at the Math Olympiad, positioning the company as a strong competitor against Western AI labs.

Read full article

via THE DECODER