QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory

arXiv — cs.LG•Friday, December 5, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

The introduction of the Quantum-inspired Kolmogorov-Arnold Long Short-Term Memory (QKAN-LSTM) model represents a significant advancement in the field of artificial intelligence, particularly in sequential modeling tasks. This model enhances conventional LSTMs by integrating Data Re-Uploading Activation (DARUAN) modules, which improve frequency adaptability and spectral representation without requiring quantum entanglement.
The QKAN-LSTM's ability to maintain quantum-level expressivity while being executable on classical hardware positions it as a promising tool for various applications, including urban telecommunication forecasting, where understanding temporal correlations is crucial.
This development aligns with ongoing research efforts to enhance the capabilities of LSTM networks, which are widely used in diverse fields such as finance, energy forecasting, and real-time translation. The integration of innovative techniques like DARUAN may address existing limitations in LSTM models, fostering advancements in predictive accuracy and efficiency across multiple domains.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Airparser

Extract and parse data from documents using GPT-4 automation.

AI & DataView app details

Humanize AI

Transform AI-generated text into undetectable, human-like content effortlessly.

Business & ProductivityView app details

Continue Readings

arXiv — cs.LG2 days ago

Beyond Wave Variables: A Data-Driven Ensemble Approach for Enhanced Teleoperation Transparency and Stability

PositiveArtificial Intelligence

A new study introduces a data-driven ensemble approach to enhance transparency and stability in bilateral teleoperation systems, addressing challenges posed by communication delays. The framework replaces traditional wave-variable methods with advanced sequence models, including LSTM and CNN-LSTM, optimized through the Optuna algorithm. Experimental validation was conducted using Python, demonstrating the effectiveness of this innovative approach.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Using Text-Based Life Trajectories from Swedish Register Data to Predict Residential Mobility with Pretrained Transformers

PositiveArtificial Intelligence

A recent study has transformed extensive Swedish register data into textual life trajectories to predict residential mobility, utilizing data from 6.9 million individuals between 2001 and 2013. By converting demographic and life changes into semantically rich texts, the research employs various NLP architectures, including LSTM and BERT, to enhance prediction accuracy for residential moves from 2013 to 2017.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

A Hybrid Model for Stock Market Forecasting: Integrating News Sentiment and Time Series Data with Graph Neural Networks

PositiveArtificial Intelligence

A recent study introduces a hybrid model for stock market forecasting that integrates news sentiment and time series data using Graph Neural Networks (GNNs). This approach contrasts with traditional models that primarily rely on historical price data, aiming to enhance prediction accuracy by incorporating external signals from financial news articles. The GNN model was evaluated against a baseline Long Short-Term Memory (LSTM) model, demonstrating superior performance in predicting stock price movements.

Read full article

via arXiv — cs.LG

arXiv — cs.LG3 days ago

In-Context and Few-Shots Learning for Forecasting Time Series Data based on Large Language Models

PositiveArtificial Intelligence

A recent study has explored the application of Large Language Models (LLMs) for forecasting time series data, particularly focusing on Google's TimesFM model. The research highlights the potential of LLMs to surpass traditional methods like LSTM and TCN in predictive accuracy, utilizing in-context learning techniques to enhance model performance.

Read full article

via arXiv — cs.LG

arXiv — cs.LG3 days ago

Hidden Leaks in Time Series Forecasting: How Data Leakage Affects LSTM Evaluation Across Configurations and Validation Strategies

NeutralArtificial Intelligence

A recent study highlights the issue of data leakage in Long Short-Term Memory (LSTM) networks used for time series forecasting, revealing that improper sequence construction before dataset partitioning can lead to misleading evaluation results. The research evaluates three validation techniques under both leaky and clean conditions, demonstrating how validation design can influence leakage sensitivity and performance metrics such as RMSE Gain.

Read full article

via arXiv — cs.LG

arXiv — stat.ML3 days ago

LAD-BNet: Lag-Aware Dual-Branch Networks for Real-Time Energy Forecasting on Edge Devices

PositiveArtificial Intelligence

LAD-BNet, a Lag-Aware Dual-Branch Network, has been introduced as a novel neural architecture aimed at enhancing real-time energy forecasting on edge devices, specifically optimized for Google Coral TPU. This model effectively combines temporal lag exploitation with a Temporal Convolutional Network (TCN) to capture both short and long-term dependencies, achieving a mean absolute percentage error (MAPE) of 14.49% at a one-hour forecasting horizon.

Read full article

via arXiv — stat.ML

arXiv — stat.ML3 days ago

Emergent Granger Causality in Neural Networks: Can Prediction Alone Reveal Structure?

NeutralArtificial Intelligence

A novel approach to Granger Causality (GC) using deep neural networks (DNNs) has been proposed, focusing on the joint modeling of multivariate time series data. This method aims to enhance the understanding of complex associations that traditional vector autoregressive models struggle to capture, particularly in non-linear contexts.

Read full article

via arXiv — stat.ML

arXiv — cs.LG3 days ago

QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling

NeutralArtificial Intelligence

The introduction of the Quantum-Leap LSTM (QL-LSTM) addresses significant limitations in traditional recurrent neural architectures like LSTM and GRU, particularly in managing long sequences and reducing redundant parameters. This new architecture employs a Parameter-Shared Unified Gating mechanism and a Hierarchical Gated Recurrence with Additive Skip Connections to enhance performance while decreasing the number of parameters by approximately 48 percent.

Read full article

via arXiv — cs.LG