CORA: Consistency-Guided Semi-Supervised Framework for Reasoning Segmentation

arXiv — cs.CV•Tuesday, November 25, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

CORA has been introduced as a semi-supervised reasoning segmentation framework that enhances pixel-accurate masks for targets based on complex instructions, addressing limitations in generalization and the high costs of annotation. The framework leverages both limited labeled data and a large set of unlabeled images, incorporating conditional visual instructions and a pseudo-label filter for improved consistency in outputs.
This development is significant as it aims to improve the performance of segmentation tasks in AI, particularly in scenarios where high-quality annotations are scarce. By utilizing a semi-supervised approach, CORA could potentially reduce the reliance on extensive labeled datasets while enhancing the robustness of segmentation models.
The introduction of CORA aligns with ongoing advancements in multimodal language models and their application in various domains, including autonomous driving. As seen in related datasets like CARScenes, the integration of vision-language models is crucial for enhancing scene understanding, indicating a broader trend towards improving AI's interpretative capabilities in complex environments.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Attentive AI

Extract digital maps from satellite, aerial, and drone imagery using deep learning.

AI & DataView app details

Augmeta

AI peers for collaborative problem-solving and enhanced team productivity.

AI & DataView app details

Continue Readings

arXiv — cs.LG2 days ago

The Missing Point in Vision Transformers for Universal Image Segmentation

PositiveArtificial Intelligence

A novel two-stage segmentation framework named ViT-P has been introduced to enhance image segmentation tasks in computer vision. This framework decouples mask generation from classification, utilizing a proposal generator for class-agnostic mask proposals and a point-based classification model based on Vision Transformers to refine predictions. The approach aims to address challenges such as ambiguous boundaries and imbalanced class distributions in mask classification.

Read full article

via arXiv — cs.LG

arXiv — cs.CV3 days ago

Fast and Flexible Robustness Certificates for Semantic Segmentation

PositiveArtificial Intelligence

A new class of certifiably robust Semantic Segmentation networks has been introduced, featuring built-in Lipschitz constraints that enhance their efficiency and pixel accuracy on challenging datasets like Cityscapes. This advancement addresses the vulnerability of Deep Neural Networks to small perturbations that can significantly alter predictions.

Read full article

via arXiv — cs.CV

arXiv — cs.LG3 days ago

Selective Masking based Self-Supervised Learning for Image Semantic Segmentation

PositiveArtificial Intelligence

A novel self-supervised learning method for semantic segmentation has been proposed, utilizing selective masking for image reconstruction as a pretraining task. This method improves upon traditional random masking techniques by focusing on image patches with the highest reconstruction loss, demonstrating superior performance on datasets such as Pascal VOC and Cityscapes.

Read full article

via arXiv — cs.LG