MalRAG: A Retrieval-Augmented LLM Framework for Open-set Malicious Traffic Identification

arXiv — cs.LG•Wednesday, November 19, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

MalRAG introduces a groundbreaking retrieval
This development is significant as it enhances the adaptability and effectiveness of cybersecurity measures, allowing organizations to respond more effectively to evolving cyber threats. The framework's reliance on comprehensive traffic knowledge construction positions it as a vital tool in the ongoing battle against cybercrime.
The emergence of MalRAG reflects a broader trend in cybersecurity towards advanced machine learning techniques, such as heterogeneous graph neural networks and automated penetration testing, which aim to improve anomaly detection and threat identification. As organizations increasingly adopt AI

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Recommended Readings

arXiv — cs.LG7 hours ago

Can Machines Think Like Humans? A Behavioral Evaluation of LLM Agents in Dictator Games

NeutralArtificial Intelligence

The study titled 'Can Machines Think Like Humans? A Behavioral Evaluation of LLM Agents in Dictator Games' investigates the prosocial behaviors of Large Language Model (LLM) agents. It examines how different personas influence these behaviors and benchmarks them against human actions. The findings indicate that assigning human-like identities to LLMs does not guarantee human-like decision-making, revealing significant variability in alignment with human behavior across different model architectures.

Read full article

via arXiv — cs.LG

arXiv — cs.LG7 hours ago

Node-Level Uncertainty Estimation in LLM-Generated SQL

PositiveArtificial Intelligence

A new framework for detecting errors in SQL generated by large language models (LLMs) has been introduced, focusing on estimating uncertainty at the node level within the query's abstract syntax tree (AST). The method employs a semantically aware labeling algorithm to assess node correctness and utilizes a classifier to predict error probabilities for each node. This approach allows for precise diagnostics, significantly improving error detection compared to traditional token log-probabilities across various databases and datasets.

Read full article

via arXiv — cs.LG

arXiv — cs.CL7 hours ago

Scaling Textual Gradients via Sampling-Based Momentum

PositiveArtificial Intelligence

The article discusses the challenges and potential of scaling prompt optimization using LLM-provided textual gradients. While this method has proven effective for automatic prompt engineering, issues arise when increasing training data due to context-length limits and diminishing returns from long-context degradation. The authors propose a new approach called Textual Stochastic Gradient Descent with Momentum (TSGD-M), which utilizes momentum sampling to enhance training stability and scalability.

Read full article

via arXiv — cs.CL

arXiv — cs.LG7 hours ago

LogPurge: Log Data Purification for Anomaly Detection via Rule-Enhanced Filtering

PositiveArtificial Intelligence

Log anomaly detection is essential for identifying system failures and preventing security breaches by recognizing irregular patterns in large volumes of log data. Traditional methods depend on training deep learning models with clean log sequences, which are often difficult and costly to obtain due to the need for human labeling. Existing automatic cleaning methods do not adequately consider the specific characteristics of logs. The proposed LogPurge framework offers a cost-effective solution by using a rule-enhanced purification process that selects normal log sequences from contaminated dat…

Read full article

via arXiv — cs.LG

arXiv — cs.LG7 hours ago

ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents

PositiveArtificial Intelligence

ReflexGrad is a new architecture designed to enhance zero-shot generalization in large language model (LLM) agents. It integrates three mechanisms: hierarchical TODO decomposition for strategic planning, history-aware causal reflection for identifying failure causes, and gradient-based optimization for systematic improvement. This approach allows agents to learn from experiences without needing task-specific training, marking a significant advancement in reinforcement learning and decision-making.

Read full article

via arXiv — cs.LG

arXiv — cs.LG7 hours ago

Contextual Learning for Anomaly Detection in Tabular Data

PositiveArtificial Intelligence

Anomaly detection is essential in fields like cybersecurity and finance, particularly with large-scale tabular data. Traditional unsupervised methods struggle due to their reliance on a single global distribution, which does not account for the diverse contexts present in real-world data. This paper introduces a contextual learning framework that models normal behavior variations across different contexts, focusing on conditional data distributions instead of a global joint distribution, enhancing anomaly detection effectiveness.

Read full article

via arXiv — cs.LG

arXiv — cs.CL7 hours ago

Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries

NeutralArtificial Intelligence

Large Language Models (LLMs) have shown remarkable capabilities in generalizing across various domains and modalities. This study explores their potential to encode astrophysical information typically derived from scientific measurements. The research focuses on two primary questions: the impact of prompting on the codification of physical quantities by LLMs and the linguistic aspects crucial for encoding the physics represented by these measurements. Sparse autoencoders are utilized to extract interpretable features from the text.

Read full article

via arXiv — cs.CL

arXiv — cs.CL7 hours ago

SpiderGen: Towards Procedure Generation For Carbon Life Cycle Assessments with Generative AI

PositiveArtificial Intelligence

SpiderGen is a new workflow that utilizes large language models (LLMs) to enhance the process of conducting Life Cycle Assessments (LCAs) for consumer products. These assessments are crucial for understanding the environmental impact of goods, particularly in the context of greenhouse gas (GHG) emissions. SpiderGen integrates traditional LCA methodologies with the advanced reasoning capabilities of LLMs to produce graphical representations known as Product Category Rules Process Flow Graphs (PCR PFGs). The effectiveness of SpiderGen was evaluated against 65 real-world LCA documents.

Read full article

via arXiv — cs.CL