UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity

arXiv — cs.LG•Tuesday, November 18, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

UnSAMv2 has been launched to enhance the capabilities of the Segment Anything Model (SAM) by enabling segmentation at any granularity without the need for human annotations. This advancement addresses the limitations of SAM, which often requires manual adjustments for desired detail levels, making it a significant improvement in the field of computer vision.
The introduction of UnSAMv2 is crucial as it reduces the dependency on dense annotations, which are costly and time

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Recommended Readings

Tech Xplore — AI & ML6 hours ago

New augmented reality tech can turn any surface into keyboard

NegativeArtificial Intelligence

Virtual keyboards in augmented reality (AR) often frustrate users due to their slow response and high error rates. Users experience discomfort, commonly referred to as 'gorilla arm,' from raising their arms to type on these virtual surfaces.

Read full article

via Tech Xplore — AI & ML

arXiv — cs.LG19 hours ago

Weight Variance Amplifier Improves Accuracy in High-Sparsity One-Shot Pruning

PositiveArtificial Intelligence

Deep neural networks excel in visual recognition tasks but their large parameter counts hinder practical applications. One-shot pruning has emerged as a solution to reduce model size without retraining. However, aggressive pruning often leads to significant accuracy drops. Existing optimizers like SAM and CrAM help mitigate this issue but require additional computations. The proposed Variance Amplifying Regularizer (VAR) increases parameter variance during training, enhancing pruning robustness and maintaining accuracy.

Read full article

via arXiv — cs.LG

arXiv — cs.CV19 hours ago

SMOL-MapSeg: Show Me One Label as prompt

PositiveArtificial Intelligence

SMOL-MapSeg, a new segmentation model, addresses the challenges posed by historical maps in modern segmentation tasks. Traditional deep learning models like UNet and Segment Anything Model (SAM) struggle with the variability in visual styles and symbols found in these maps. The proposed On-Need Declarative (OND) prompting method allows users to provide explicit image-label pair prompts, facilitating flexible and concept-aware segmentation tailored to historical map analysis.

Read full article

via arXiv — cs.CV

arXiv — cs.CV19 hours ago

LENS: Learning to Segment Anything with Unified Reinforced Reasoning

PositiveArtificial Intelligence

LENS is a new reinforcement-learning framework designed for text-prompted image segmentation, enhancing visual understanding crucial for applications in human-computer interaction and robotics. Unlike traditional supervised methods, LENS incorporates explicit chain-of-thought reasoning during testing, improving generalization to unseen prompts. By utilizing a 3-billion-parameter vision-language model, LENS achieves an average cIoU of 81.2% on benchmark datasets, surpassing existing fine-tuning methods.

Read full article

via arXiv — cs.CV

arXiv — cs.LG2 days ago

MixAR: Mixture Autoregressive Image Generation

PositiveArtificial Intelligence

MixAR is a new framework introduced to enhance image generation through autoregressive (AR) modeling. Traditional AR approaches, which utilize discrete tokens from a limited codebook, often lose fine-grained details due to quantization. Recent advancements have shifted towards continuous latent spaces for improved quality, but these spaces present challenges for efficient modeling. MixAR addresses these issues by integrating discrete tokens as prior guidance, facilitating better continuous AR modeling and potentially leading to higher fidelity in generated images.

Read full article

via arXiv — cs.LG