SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs

arXiv — cs.LG•Thursday, December 18, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A new neural architecture named SigMA has been introduced, integrating path signatures with multi-head self-attention for parameter learning in stochastic differential equations (SDEs) driven by fractional Brownian motion (fBm). This approach addresses the challenges posed by non-Markovian processes, which complicate traditional parameter estimation techniques.
The development of SigMA is significant as it aims to enhance the accuracy and efficiency of parameter estimation in complex systems, particularly in fields like quantitative finance and reliability engineering, where understanding rough dynamics is crucial.
This advancement reflects a broader trend in artificial intelligence where deep learning models are increasingly being optimized for complex data structures. The integration of path signatures and attention mechanisms highlights a growing interest in improving model interpretability and performance, paralleling efforts in other domains such as market behavior prediction and time series forecasting.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

finlight.me

Realtime financial and market news API with sentiment analysis and full articles.

Business & ProductivityView app details

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Deltabadger

Automate dollar-cost averaging and portfolio rebalancing for early retirement planning.

Tech & Developer ToolsView app details

FastML

Build and deploy machine learning pipelines with speed and efficiency.

Business & ProductivityView app details

Portfolio Backtest

AI-powered portfolio backtesting for data-driven investment strategies.

AI & DataView app details

Hummingbot

Automate crypto trading and market making across multiple exchanges efficiently.

Tech & Developer ToolsView app details

Continue Readings

arXiv — cs.LGa day ago

Generalization and Feature Attribution in Machine Learning Models for Crop Yield and Anomaly Prediction in Germany

NeutralArtificial Intelligence

A recent study has analyzed the generalization performance and interpretability of machine learning models for predicting crop yield and anomalies in Germany's NUTS-3 regions. The research compares ensemble tree-based models like XGBoost and Random Forest with deep learning approaches such as LSTM and TCN, revealing significant performance degradation on temporally independent validation years despite strong accuracy on conventional test sets.

Read full article

via arXiv — cs.LG

arXiv — cs.CVa day ago

Model Agnostic Preference Optimization for Medical Image Segmentation

PositiveArtificial Intelligence

A new training framework called Model Agnostic Preference Optimization (MAPO) has been introduced for medical image segmentation, which utilizes Dropout-driven stochastic segmentation hypotheses to create preference-consistent gradients without relying on direct ground-truth supervision. This model-agnostic approach supports various architectures, including 2D/3D CNNs and Transformers.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos

PositiveArtificial Intelligence

The introduction of MS-Temba, a Multi-Scale Temporal Mamba model, addresses significant challenges in Temporal Action Detection (TAD) for untrimmed videos, particularly in Activities of Daily Living (ADL). This model enhances the ability to process long-duration videos, capture temporal variations, and detect overlapping actions effectively through the use of dilated State-space Models (SSMs).

Read full article

via arXiv — cs.CV

arXiv — cs.LGa day ago

Empirical Investigation of the Impact of Phase Information on Fault Diagnosis of Rotating Machinery

PositiveArtificial Intelligence

An empirical investigation has revealed that incorporating phase information significantly enhances fault diagnosis in rotating machinery. The study introduces two innovative phase-aware preprocessing strategies that effectively address random phase variations in multi-axis vibration data, demonstrating improvements across various deep learning architectures.

Read full article

via arXiv — cs.LG

THE DECODER2 days ago

Nvidia's Nemotron 3 swaps pure Transformers for a Mamba hybrid to run AI agents efficiently

PositiveArtificial Intelligence

Nvidia has introduced the Nemotron 3 family, which integrates Mamba and Transformer architectures to efficiently manage long context windows for AI agents. This hybrid approach aims to optimize resource usage while enhancing performance in AI applications.

Read full article

via THE DECODER

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about