Empowering DINO Representations for Underwater Instance Segmentation via Aligner and Prompter

arXiv — cs.CVWednesday, November 12, 2025 at 5:00:00 AM
The introduction of DiveSeg marks a significant advancement in underwater instance segmentation (UIS), a critical technology for marine resource exploration and ecological protection. By leveraging the DINO model as a feature learner, DiveSeg enhances the UIS process through two innovative components: the AquaStyle Aligner, which embeds underwater color style features into DINO's fine-tuning, and the ObjectPrior Prompter, which provides essential object-level guidance through binary segmentation-based prompts. Thorough experiments conducted on the UIIS and USIS10K datasets reveal that DiveSeg achieves state-of-the-art performance, underscoring its potential impact on the field. This development not only showcases the capabilities of large-scale pretrained visual models but also emphasizes the importance of adapting these technologies to specific domains, such as underwater environments, where traditional methods may fall short.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended Readings
NP-LoRA: Null Space Projection Unifies Subject and Style in LoRA Fusion
PositiveArtificial Intelligence
The article introduces NP-LoRA, a novel framework for Low-Rank Adaptation (LoRA) fusion that addresses the issue of interference in existing methods. Traditional weight-based merging often leads to one LoRA dominating another, resulting in degraded fidelity. NP-LoRA utilizes a projection-based approach to maintain subspace separation, thereby enhancing the quality of fusion by preventing structural interference among principal directions.