Comparative Study of UNet-based Architectures for Liver Tumor Segmentation in Multi-Phase Contrast-Enhanced Computed Tomography

arXiv — cs.CV•Tuesday, November 25, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A comparative study has been conducted on UNet-based architectures for liver tumor segmentation in multi-phase contrast-enhanced computed tomography (CECT), revealing that ResNet-based models consistently outperform Transformer and Mamba-based alternatives. The study also highlights the effectiveness of integrating attention mechanisms, particularly the Convolutional Block Attention Module (CBAM), in enhancing segmentation quality.
This development is significant as it improves the accuracy of liver tumor detection, which is crucial for effective diagnosis and treatment planning in liver diseases. The findings suggest that leveraging advanced architectures can lead to better outcomes in medical imaging tasks.
The research underscores a growing trend in medical image segmentation towards hybrid architectures that combine the strengths of various neural network models. As the field evolves, the integration of attention mechanisms and the exploration of new architectures like Mamba and HyM-UNet reflect ongoing efforts to enhance diagnostic capabilities and address challenges in medical imaging.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Airparser

Extract and parse data from documents using GPT-4 automation.

AI & DataView app details

Cato Networks

Next-gen firewall built into the network fabric with app-level policies and URL filtering.

AI & DataView app details

Attentive AI

Extract digital maps from satellite, aerial, and drone imagery using deep learning.

AI & DataView app details

Supametas.AI

Extract and structure unstructured data for seamless LLM RAG integration.

AI & DataView app details

AIPortalX

Browse, compare, and use over 100 verified AI models with detailed insights and filtering.

Creative & DesignView app details

Https

Access multiple AI models seamlessly in one unified chat application.

AI & DataView app details

Continue Readings

arXiv — cs.LG2 days ago

SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs

PositiveArtificial Intelligence

A new neural architecture named SigMA has been introduced, integrating path signatures with multi-head self-attention for parameter learning in stochastic differential equations (SDEs) driven by fractional Brownian motion (fBm). This approach addresses the challenges posed by non-Markovian processes, which complicate traditional parameter estimation techniques.

Read full article

via arXiv — cs.LG

arXiv — cs.CL2 days ago

Characterizing Mamba's Selective Memory using Auto-Encoders

NeutralArtificial Intelligence

A recent study has characterized the selective memory of Mamba's state space models (SSMs) using auto-encoders, revealing the types of tokens and sequences that are frequently forgotten during long sequence processing. This research addresses a critical knowledge gap in understanding the information loss associated with SSMs in language modeling.

Read full article

via arXiv — cs.CL

arXiv — cs.CV2 days ago

Model Agnostic Preference Optimization for Medical Image Segmentation

PositiveArtificial Intelligence

A new training framework called Model Agnostic Preference Optimization (MAPO) has been introduced for medical image segmentation, which utilizes Dropout-driven stochastic segmentation hypotheses to create preference-consistent gradients without relying on direct ground-truth supervision. This model-agnostic approach supports various architectures, including 2D/3D CNNs and Transformers.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos

PositiveArtificial Intelligence

The introduction of MS-Temba, a Multi-Scale Temporal Mamba model, addresses significant challenges in Temporal Action Detection (TAD) for untrimmed videos, particularly in Activities of Daily Living (ADL). This model enhances the ability to process long-duration videos, capture temporal variations, and detect overlapping actions effectively through the use of dilated State-space Models (SSMs).

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

Weakly Supervised Pneumonia Localization from Chest X-Rays Using Deep Neural Network and Grad-CAM Explanations

PositiveArtificial Intelligence

A recent study has introduced a weakly supervised deep learning framework for pneumonia classification and localization using Gradient-weighted Class Activation Mapping (Grad-CAM). This approach utilizes image-level labels to generate heatmaps that highlight pneumonia-affected regions in chest X-rays, addressing the challenge of obtaining detailed pixel-level annotations. Experimental results indicate high classification accuracy across various pre-trained models, including ResNet-18 and EfficientNet-B0.

Read full article

via arXiv — cs.CV

arXiv — cs.LG2 days ago

Improving Underwater Acoustic Classification Through Learnable Gabor Filter Convolution and Attention Mechanisms

PositiveArtificial Intelligence

A new study has introduced GSE ResNeXt, a deep learning architecture that enhances underwater acoustic target classification by integrating learnable Gabor convolutional layers with a ResNeXt backbone and squeeze-and-excitation attention mechanisms. This innovation addresses the challenges posed by complex underwater noise and limited datasets, improving the model's ability to extract discriminative features.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Empirical Investigation of the Impact of Phase Information on Fault Diagnosis of Rotating Machinery

PositiveArtificial Intelligence

An empirical investigation has revealed that incorporating phase information significantly enhances fault diagnosis in rotating machinery. The study introduces two innovative phase-aware preprocessing strategies that effectively address random phase variations in multi-axis vibration data, demonstrating improvements across various deep learning architectures.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

A Conditioned UNet for Music Source Separation

PositiveArtificial Intelligence

A novel conditioned UNet architecture has been proposed for Music Source Separation (MSS), allowing for the extraction of specific audio stems based on an audio query, thus eliminating the need for a strict instrument vocabulary. This approach leverages the recently developed MoisesDb dataset to enhance the realism of MSS tasks.

Read full article

via arXiv — cs.LG

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about