World PulseNowPowered by AI

Trending:

Comba: Improving Bilinear RNNs with Closed-loop Control

arXiv — cs.LG•Thursday, December 4, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

The introduction of Comba, a novel variant of Bilinear RNNs, leverages closed-loop control theory to enhance recurrent memory management, presenting a scalar-plus-low-rank state transition model. This development builds on recent advancements in sequence modeling, including Gated DeltaNet and RWKV-7, which have improved performance through innovative memory supervision techniques.
Comba's design aims to address the limitations of existing state-space models and gated linear attentions, potentially offering superior performance in sequence modeling tasks. The implementation of a hardware-efficient chunk-wise parallel kernel in Triton further emphasizes its practical application in large-scale training scenarios.
This advancement reflects a broader trend in artificial intelligence towards integrating control theory with machine learning models, as seen in related innovations like Gated KalmaNet and DiffuApriel. These developments highlight ongoing efforts to enhance memory retention and inference efficiency in AI systems, addressing critical challenges in the field.

— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataTry the app

Chattermate

Build and deploy AI support agents without writing any code.

AI & DataTry the app

Agentcloud

Build and deploy custom AI agents with this open-source GPT platform.

AI & DataTry the app

Continue Readings

SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving

arXiv — cs.LGa day ago

SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving

PositiveArtificial Intelligence

A new framework named throttLL'eM has been introduced to optimize energy consumption during Large Language Model (LLM) inference by utilizing GPU frequency scaling while adhering to Service-Level Objectives (SLOs). This approach addresses the growing energy demands associated with LLMs, which are heavily reliant on GPUs for processing. The framework incorporates machine learning to predict future cache usage and batch sizes, allowing for efficient performance management.

Read full article

via arXiv — cs.LG

Traffic Image Restoration under Adverse Weather via Frequency-Aware Mamba

arXiv — cs.CVa day ago

Traffic Image Restoration under Adverse Weather via Frequency-Aware Mamba

PositiveArtificial Intelligence

A novel framework named Frequency-Aware Mamba (FAMamba) has been introduced to enhance traffic image restoration under adverse weather conditions, addressing a significant challenge in intelligent transportation systems. This architecture leverages frequency guidance alongside sequence modeling, featuring components like the Dual-Branch Feature Extraction Block and the Prior-Guided Block for improved texture detail recovery.

Read full article

via arXiv — cs.CV

PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer

arXiv — cs.CVa day ago

PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer

PositiveArtificial Intelligence

PanFoMa has been introduced as a lightweight hybrid neural network model designed to enhance pan-cancer research by addressing challenges in learning efficient single-cell representations and establishing a comprehensive evaluation benchmark. This model integrates the capabilities of Transformers and state-space models, enabling effective transcriptome modeling and capturing complex gene interactions.

Read full article

via arXiv — cs.CV

See Through Walls: AI's New Eye on Occluded Motion by Arvind Sundararajan

DEV Communitya day ago

See Through Walls: AI's New Eye on Occluded Motion by Arvind Sundararajan

PositiveArtificial Intelligence

A novel approach to motion capture using a deformable state space model has been developed, allowing AI to accurately track occluded motion, such as hands hidden behind objects. This advancement addresses the limitations of traditional computer vision systems that struggle with occlusions, leading to improved animation and robotic control.

Read full article

via DEV Community

Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation

arXiv — cs.CV2 days ago

Toward Content-based Indexing and Retrieval of Head and Neck CT with Abscess Segmentation

PositiveArtificial Intelligence

A new study has introduced AbscessHeNe, a dataset of 4,926 contrast-enhanced CT slices specifically focused on head and neck abscesses, which are critical for timely diagnosis and treatment. This dataset aims to enhance the development of semantic segmentation models that can accurately identify abscess boundaries and assess deep neck space involvement.

Read full article

via arXiv — cs.CV

MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain Integration

arXiv — cs.CV2 days ago

MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain Integration

PositiveArtificial Intelligence

A new dataset named MasHeNe has been introduced, comprising 3,779 contrast-enhanced CT slices that include both tumors and cysts, complete with pixel-level annotations. This initiative aims to fill the gap in existing public datasets that primarily focus on malignant lesions in head and neck imaging. The Windowing-Enhanced Mamba with Frequency integration (WEMF) model has been proposed, achieving a Dice score of 70.4, marking it as the top performer among evaluated methods.

Read full article

via arXiv — cs.CV

DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions

arXiv — cs.LG2 days ago

DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions

PositiveArtificial Intelligence

DF-Mamba, a new framework for 3D hand pose estimation, addresses the challenges of severe occlusions in hand interactions by leveraging deformable state space modeling to enhance visual feature extraction beyond traditional convolutional methods. This innovation aims to improve the accuracy of hand pose estimation in complex scenarios where hands overlap or are partially obscured.

Read full article

via arXiv — cs.LG