Q-Learning-Based Time-Critical Data Aggregation Scheduling in IoT

arXiv — cs.LG•Tuesday, November 25, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A novel Q-learning framework has been proposed for time-critical data aggregation scheduling in Internet of Things (IoT) networks, aiming to reduce latency in applications such as smart cities and industrial automation. This approach integrates aggregation tree construction and scheduling into a unified model, enhancing efficiency and scalability.
The significance of this development lies in its potential to optimize data transmission in IoT environments, addressing the limitations of traditional heuristic methods that often lead to high computational overhead and delays.
This advancement reflects a broader trend in leveraging reinforcement learning techniques, such as Q-learning, to improve operational efficiency in resource-constrained environments, highlighting the ongoing evolution of smart technologies and their applications in various sectors.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Cogent

AI study companion that organizes notes, quizzes, and tracks your progress.

AI & DataTry the app

AIvilization

Create an AI agent to learn, work, and socialize in a self-running multiplayer town.

Lifestyle & HealthTry the app

Snapshot AI

AI-powered insights to optimize engineering team performance and productivity.

Business & ProductivityTry the app

Continue Readings

arXiv — cs.LGa day ago

Physical Reinforcement Learning

NeutralArtificial Intelligence

Recent advancements in Contrastive Local Learning Networks (CLLNs) have demonstrated their potential for reinforcement learning (RL) applications, particularly in energy-limited environments. This study successfully applied Q-learning techniques to simulated CLLNs, showcasing their robustness and low power consumption compared to traditional digital systems.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

Reinforcement Learning for Self-Healing Material Systems

PositiveArtificial Intelligence

A recent study has framed the self-healing process of material systems as a Reinforcement Learning (RL) problem within a Markov Decision Process (MDP), demonstrating that RL agents can autonomously derive optimal policies for maintaining structural integrity while managing resource consumption. The research highlighted the superior performance of continuous-action agents, particularly the TD3 agent, in achieving near-complete material recovery compared to traditional heuristic methods.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

Non-stationary and Varying-discounting Markov Decision Processes for Reinforcement Learning

PositiveArtificial Intelligence

The introduction of the Non-stationary and Varying-discounting Markov Decision Processes (NVMDP) framework addresses the limitations faced by traditional stationary Markov Decision Processes (MDPs) in non-stationary environments. This framework allows for varying discount rates over time and transitions, making it applicable to both finite and infinite-horizon tasks.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

Hi-SAFE: Hierarchical Secure Aggregation for Lightweight Federated Learning

PositiveArtificial Intelligence

Hi-SAFE, a new framework for Hierarchical Secure Aggregation in Federated Learning (FL), addresses privacy and communication efficiency challenges in resource-constrained environments like IoT and edge networks. It enhances the security of sign-based methods, such as SIGNSGD-MV, by utilizing efficient majority vote polynomials derived from Fermat's Little Theorem.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

First-order Sobolev Reinforcement Learning

PositiveArtificial Intelligence

A new refinement in temporal-difference learning has been proposed, emphasizing first-order Bellman consistency. This approach trains the learned value function to align with both the Bellman targets and their derivatives, enhancing the stability and convergence of reinforcement learning algorithms like Q-learning and actor-critic methods.

Read full article

via arXiv — cs.LG