Hybrid Convolution and Frequency State Space Network for Image Compression

arXiv — cs.CV•Wednesday, November 26, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A new architecture named HCFSSNet has been introduced, combining Convolutional Neural Networks (CNNs) with a Vision Frequency State Space block to enhance learned image compression (LIC). This hybrid approach captures local high-frequency details while effectively modeling long-range low-frequency information, addressing limitations seen in traditional methods.
The development of HCFSSNet is significant as it aims to improve the efficiency and quality of image compression, which is crucial for applications in various fields, including digital media and medical imaging, where high fidelity is essential.
This advancement reflects a broader trend in AI and image processing, where hybrid models are increasingly utilized to leverage the strengths of different architectures, such as CNNs and Transformers, to tackle complex challenges in image analysis and compression.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Continue Readings

arXiv — cs.CV16 hours ago

One Patch is All You Need: Joint Surface Material Reconstruction and Classification from Minimal Visual Cues

PositiveArtificial Intelligence

A new model named SMARC has been introduced, enabling surface material reconstruction and classification from minimal visual cues, specifically using just a 10% contiguous patch of an image. This approach addresses the limitations of existing methods that require dense observations, making it particularly useful in constrained environments.

Read full article

via arXiv — cs.CV

arXiv — cs.LG2 days ago

Cross-Domain Generalization of Multimodal LLMs for Global Photovoltaic Assessment

PositiveArtificial Intelligence

A study has demonstrated the cross-domain generalization capabilities of a multimodal large language model (LLM) for assessing global photovoltaic (PV) systems, addressing challenges posed by undocumented installations and the limitations of traditional computer vision models. The model integrates detection, localization, and quantification, achieving superior performance across unseen regions compared to conventional methods.

Read full article

via arXiv — cs.LG

arXiv — cs.CV3 days ago

HyM-UNet: Synergizing Local Texture and Global Context via Hybrid CNN-Mamba Architecture for Medical Image Segmentation

PositiveArtificial Intelligence

A novel hybrid architecture named HyM-UNet has been proposed to enhance medical image segmentation by combining the local feature extraction strengths of Convolutional Neural Networks (CNNs) with the global modeling capabilities of Mamba. This architecture employs a Hierarchical Encoder and a Mamba-Guided Fusion Skip Connection to effectively bridge local and global features, addressing the limitations of traditional CNNs in capturing complex anatomical structures.

Read full article

via arXiv — cs.CV

arXiv — cs.CV3 days ago

Stage-Specific Benchmarking of Deep Learning Models for Glioblastoma Follow-Up MRI

NeutralArtificial Intelligence

A recent study has benchmarked deep learning models for differentiating true tumor progression from treatment-related pseudoprogression in glioblastoma using follow-up MRI scans from the Burdenko GBM Progression cohort. The analysis involved various deep learning architectures, revealing comparable accuracies across stages, with improved discrimination at later follow-ups.

Read full article

via arXiv — cs.CV

arXiv — cs.CV3 days ago

CoD: A Diffusion Foundation Model for Image Compression

PositiveArtificial Intelligence

CoD, a new compression-oriented diffusion foundation model, has been introduced to enhance image compression efficiency, particularly at ultra-low bitrates. Unlike existing models that rely on text conditioning, CoD is designed for end-to-end optimization of both compression and generation, achieving state-of-the-art results when integrated with downstream codecs like DiffC.

Read full article

via arXiv — cs.CV

arXiv — cs.CV3 days ago

Peregrine: One-Shot Fine-Tuning for FHE Inference of General Deep CNNs

PositiveArtificial Intelligence

The recent paper titled 'Peregrine: One-Shot Fine-Tuning for FHE Inference of General Deep CNNs' addresses key challenges in adapting deep convolutional neural networks (CNNs) for fully homomorphic encryption (FHE) inference. It introduces a single-stage fine-tuning strategy and a generalized interleaved packing scheme to enhance the performance of CNNs while maintaining accuracy and supporting high-resolution image processing.

Read full article

via arXiv — cs.CV