World PulseNowPowered by AI

Trending:

Federated style aware transformer aggregation of representations

arXiv — cs.LG•Tuesday, November 25, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

The introduction of FedSTAR, a style-aware federated learning framework, addresses key challenges in Personalized Federated Learning (PFL) such as domain heterogeneity and data imbalance. By utilizing a Transformer-based attention mechanism, FedSTAR effectively disentangles client-specific style factors from shared content representations, enhancing personalization in model predictions.
This development is significant as it allows for adaptive weighting of client contributions while minimizing communication overhead. By exchanging compact prototypes and style vectors instead of full model parameters, FedSTAR improves efficiency and personalization in federated learning environments.
The emergence of FedSTAR reflects a broader trend in artificial intelligence towards enhancing model personalization and efficiency. Similar advancements in Transformer architectures, such as Algebraformer and DeepCoT, indicate a growing focus on optimizing computational resources while addressing complex data challenges across various applications.

— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps

Teilord

Create and share custom AI agents instantly without any login required.

AI & DataTry the app

Supametas.AI

Extract and structure unstructured data for seamless LLM RAG integration.

AI & DataTry the app

Agentcloud

Build and deploy custom AI agents with this open-source GPT platform.

AI & DataTry the app

Continue Readings

BCWildfire: A Long-term Multi-factor Dataset and Deep Learning Benchmark for Boreal Wildfire Risk Prediction

arXiv — cs.CVa day ago

BCWildfire: A Long-term Multi-factor Dataset and Deep Learning Benchmark for Boreal Wildfire Risk Prediction

PositiveArtificial Intelligence

A new dataset titled 'BCWildfire' has been introduced, providing a comprehensive 25-year daily-resolution record of wildfire risk across 240 million hectares in British Columbia. This dataset includes 38 covariates such as active fire detections, weather variables, fuel conditions, terrain features, and human activity, addressing the scarcity of publicly available benchmark datasets for wildfire risk prediction.

Read full article

via arXiv — cs.CV

MapFormer: Self-Supervised Learning of Cognitive Maps with Input-Dependent Positional Embeddings

arXiv — cs.LGa day ago

MapFormer: Self-Supervised Learning of Cognitive Maps with Input-Dependent Positional Embeddings

PositiveArtificial Intelligence

A new architecture called MapFormer has been introduced, which utilizes self-supervised learning to create cognitive maps from observational data. This model, based on Transformer technology, aims to enhance AI's ability to generalize across different situations, a capability that has been lacking in existing systems.

Read full article

via arXiv — cs.LG

DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search

arXiv — cs.CLa day ago

DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search

PositiveArtificial Intelligence

A novel framework called Divide-and-Conquer Incremental Search (DCIS) has been proposed to enhance the fine-tuning of large language models (LLMs) by optimizing the scaling factors of Rotary Position Embedding (RoPE). This approach aims to extend the context length of LLMs while mitigating performance decay during fine-tuning, addressing the limitations of traditional methods that often lead to increased costs and reduced efficiency.

Read full article

via arXiv — cs.CL

DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection

arXiv — cs.CVa day ago

DualGazeNet: A Biologically Inspired Dual-Gaze Query Network for Salient Object Detection

PositiveArtificial Intelligence

DualGazeNet has been introduced as a biologically inspired dual-gaze query network aimed at enhancing salient object detection (SOD) while minimizing architectural complexity. This framework seeks to overcome challenges faced by existing SOD methods, which often suffer from feature redundancy and performance bottlenecks due to their intricate designs. By simplifying the architecture, DualGazeNet aims to achieve state-of-the-art accuracy and computational efficiency.

Read full article

via arXiv — cs.CV

DeepCoT: Deep Continual Transformers for Real-Time Inference on Data Streams

arXiv — cs.LGa day ago

DeepCoT: Deep Continual Transformers for Real-Time Inference on Data Streams

PositiveArtificial Intelligence

The introduction of DeepCoT, or Deep Continual Transformers, represents a significant advancement in real-time inference on data streams, addressing the challenges of high computational costs and redundancy in existing models. This encoder-only model is designed to work with deep architectures while maintaining performance across audio, video, and text streams.

Read full article

via arXiv — cs.LG

Accelerating Time Series Foundation Models with Speculative Decoding

arXiv — cs.LGa day ago

Accelerating Time Series Foundation Models with Speculative Decoding

PositiveArtificial Intelligence

A new framework has been proposed to accelerate time-series forecasting using speculative decoding, which leverages a smaller draft model to suggest future time-series patches that are verified by a larger target model. This approach aims to reduce computational costs associated with large-scale Transformer-based models, which are essential for real-time applications like content recommendation and dynamic pricing.

Read full article

via arXiv — cs.LG

SAMBA: Toward a Long-Context EEG Foundation Model via Spatial Embedding and Differential Mamba

arXiv — cs.LGa day ago

SAMBA: Toward a Long-Context EEG Foundation Model via Spatial Embedding and Differential Mamba

PositiveArtificial Intelligence

A new framework named SAMBA has been introduced to enhance long-sequence electroencephalogram (EEG) modeling, addressing the challenges posed by high sampling rates and extended recording durations. This self-supervised learning model utilizes a Mamba-based U-shaped encoder-decoder architecture to effectively capture long-range temporal dependencies and spatial variability in EEG data.

Read full article

via arXiv — cs.LG

Gate-level boolean evolutionary geometric attention neural networks

arXiv — cs.LGa day ago

Gate-level boolean evolutionary geometric attention neural networks

PositiveArtificial Intelligence

A new paper introduces a gate-level Boolean evolutionary geometric attention neural network that models images as Boolean fields using logic gates. This innovative approach allows each pixel to function as a Boolean variable on a geometric manifold, facilitating information propagation and state updates through a Boolean reaction-diffusion mechanism.

Read full article

via arXiv — cs.LG