Hybrid Transformer-Mamba Architecture for Weakly Supervised Volumetric Medical Segmentation

arXiv — cs.CVFriday, December 12, 2025 at 5:00:00 AM
  • A new hybrid architecture named TranSamba has been proposed for weakly supervised volumetric medical segmentation, integrating a Vision Transformer backbone with Cross-Plane Mamba blocks. This design aims to enhance the model's ability to capture 3D context, improving object localization in volumetric medical imaging while maintaining efficient memory usage and linear time complexity with respect to input volume depth.
  • The introduction of TranSamba represents a significant advancement in the field of medical imaging, as it addresses the limitations of existing 2D encoders that fail to fully utilize the volumetric nature of medical data. This innovation is expected to improve the accuracy and efficiency of segmentation tasks, which are critical for diagnostics and treatment planning in healthcare.
  • The development of TranSamba aligns with ongoing trends in artificial intelligence, particularly the integration of advanced architectures like Vision Transformers in medical applications. This reflects a broader shift towards leveraging sophisticated machine learning techniques to enhance diagnostic capabilities, as seen in various frameworks addressing challenges in segmentation, classification, and assessment across different medical conditions.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
ZK-APEX: Zero-Knowledge Approximate Personalized Unlearning with Executable Proofs
PositiveArtificial Intelligence
ZK APEX introduces a zero-shot personalized unlearning method that allows models to forget specific data points without retraining, addressing privacy and compliance challenges in machine learning. This method combines sparse masking and a compensation step to ensure that personalized models can effectively forget targeted samples while maintaining local utility.
Empirical evaluation of the Frank-Wolfe methods for constructing white-box adversarial attacks
NeutralArtificial Intelligence
The empirical evaluation of Frank-Wolfe methods for constructing white-box adversarial attacks highlights the need for efficient adversarial attack construction in neural networks, particularly focusing on numerical optimization techniques. The study emphasizes the application of modified Frank-Wolfe methods to enhance the robustness of neural networks against adversarial threats, utilizing datasets like MNIST and CIFAR-10 for testing.
Hierarchical Attention for Sparse Volumetric Anomaly Detection in Subclinical Keratoconus
PositiveArtificial Intelligence
A recent study has introduced a hierarchical attention model for detecting sparse volumetric anomalies in subclinical keratoconus using three-dimensional anterior segment optical coherence tomography (AS-OCT). This model outperformed traditional convolutional neural networks (CNNs) and global-attention Vision Transformers (ViTs) by achieving 21-23% higher sensitivity and specificity in identifying subtle abnormalities.
Towards Visual Re-Identification of Fish using Fine-Grained Classification for Electronic Monitoring in Fisheries
PositiveArtificial Intelligence
A new study has developed an optimized deep learning pipeline for automated fish re-identification using the AutoFish dataset, which simulates Electronic Monitoring systems with conveyor belts featuring six similar fish species. The research demonstrates significant improvements in key metrics, achieving a peak performance of 41.65% mAP@k and 90.43% Rank-1 accuracy through advanced techniques like hard triplet mining and a custom image transformation pipeline.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about