Comparative Study of UNet-based Architectures for Liver Tumor Segmentation in Multi-Phase Contrast-Enhanced Computed Tomography

arXiv — cs.CVTuesday, November 25, 2025 at 5:00:00 AM
  • A comparative study has been conducted on UNet-based architectures for liver tumor segmentation in multi-phase contrast-enhanced computed tomography (CECT), revealing that ResNet-based models consistently outperform Transformer and Mamba-based alternatives. The study also highlights the effectiveness of integrating attention mechanisms, particularly the Convolutional Block Attention Module (CBAM), in enhancing segmentation quality.
  • This development is significant as it improves the accuracy of liver tumor detection, which is crucial for effective diagnosis and treatment planning in liver diseases. The findings suggest that leveraging advanced architectures can lead to better outcomes in medical imaging tasks.
  • The research underscores a growing trend in medical image segmentation towards hybrid architectures that combine the strengths of various neural network models. As the field evolves, the integration of attention mechanisms and the exploration of new architectures like Mamba and HyM-UNet reflect ongoing efforts to enhance diagnostic capabilities and address challenges in medical imaging.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs
PositiveArtificial Intelligence
A new neural architecture named SigMA has been introduced, integrating path signatures with multi-head self-attention for parameter learning in stochastic differential equations (SDEs) driven by fractional Brownian motion (fBm). This approach addresses the challenges posed by non-Markovian processes, which complicate traditional parameter estimation techniques.
Characterizing Mamba's Selective Memory using Auto-Encoders
NeutralArtificial Intelligence
A recent study has characterized the selective memory of Mamba's state space models (SSMs) using auto-encoders, revealing the types of tokens and sequences that are frequently forgotten during long sequence processing. This research addresses a critical knowledge gap in understanding the information loss associated with SSMs in language modeling.
Model Agnostic Preference Optimization for Medical Image Segmentation
PositiveArtificial Intelligence
A new training framework called Model Agnostic Preference Optimization (MAPO) has been introduced for medical image segmentation, which utilizes Dropout-driven stochastic segmentation hypotheses to create preference-consistent gradients without relying on direct ground-truth supervision. This model-agnostic approach supports various architectures, including 2D/3D CNNs and Transformers.
MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos
PositiveArtificial Intelligence
The introduction of MS-Temba, a Multi-Scale Temporal Mamba model, addresses significant challenges in Temporal Action Detection (TAD) for untrimmed videos, particularly in Activities of Daily Living (ADL). This model enhances the ability to process long-duration videos, capture temporal variations, and detect overlapping actions effectively through the use of dilated State-space Models (SSMs).
Weakly Supervised Pneumonia Localization from Chest X-Rays Using Deep Neural Network and Grad-CAM Explanations
PositiveArtificial Intelligence
A recent study has introduced a weakly supervised deep learning framework for pneumonia classification and localization using Gradient-weighted Class Activation Mapping (Grad-CAM). This approach utilizes image-level labels to generate heatmaps that highlight pneumonia-affected regions in chest X-rays, addressing the challenge of obtaining detailed pixel-level annotations. Experimental results indicate high classification accuracy across various pre-trained models, including ResNet-18 and EfficientNet-B0.
Improving Underwater Acoustic Classification Through Learnable Gabor Filter Convolution and Attention Mechanisms
PositiveArtificial Intelligence
A new study has introduced GSE ResNeXt, a deep learning architecture that enhances underwater acoustic target classification by integrating learnable Gabor convolutional layers with a ResNeXt backbone and squeeze-and-excitation attention mechanisms. This innovation addresses the challenges posed by complex underwater noise and limited datasets, improving the model's ability to extract discriminative features.
Empirical Investigation of the Impact of Phase Information on Fault Diagnosis of Rotating Machinery
PositiveArtificial Intelligence
An empirical investigation has revealed that incorporating phase information significantly enhances fault diagnosis in rotating machinery. The study introduces two innovative phase-aware preprocessing strategies that effectively address random phase variations in multi-axis vibration data, demonstrating improvements across various deep learning architectures.
A Conditioned UNet for Music Source Separation
PositiveArtificial Intelligence
A novel conditioned UNet architecture has been proposed for Music Source Separation (MSS), allowing for the extraction of specific audio stems based on an audio query, thus eliminating the need for a strict instrument vocabulary. This approach leverages the recently developed MoisesDb dataset to enhance the realism of MSS tasks.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about