Anomaly Detection with Adaptive and Aggressive Rejection for Contaminated Training Data

arXiv — cs.LG•Thursday, November 27, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A new method called Adaptive and Aggressive Rejection (AAR) has been proposed to improve anomaly detection in contaminated training data, addressing the limitations of traditional models that rely on fixed contamination ratios. AAR utilizes a modified z-score and Gaussian mixture model-based thresholds to dynamically exclude anomalies while preserving normal data. Extensive experiments show that AAR outperforms existing methods by a notable margin.
This development is significant as it enhances the reliability of anomaly detection systems, particularly in sectors like healthcare and security, where contaminated data can lead to critical errors. By effectively balancing the exclusion of anomalies with the retention of normal data, AAR provides a scalable solution that can adapt to varying levels of data contamination.
The introduction of AAR aligns with ongoing efforts to improve data quality and model reliability in machine learning, especially in healthcare and cybersecurity. As the reliance on data-driven decision-making grows, addressing issues of data contamination and privacy becomes increasingly vital. This method complements other advancements in the field, such as frameworks for bias mitigation in synthetic medical data and techniques for securing IoT devices against cyber threats.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Subtle

Acquire SaaS customers from Reddit with automated outreach and engagement.

AI & DataTry the app

Research AI

Find untapped prospects with AI-powered research and outreach.

AI & DataTry the app

Kadag Security

Test your app with AI security agents in a sandboxed, instrumented environment.

Business & ProductivityTry the app

Continue Readings

International Business Timesa day ago

MIT: AI Can Do 12% of US Work; Where Human Soft Power Is Irreplaceable

NeutralArtificial Intelligence

A recent report from MIT indicates that artificial intelligence (AI) has the potential to automate approximately 12% of jobs in the United States, which translates to over $1.2 trillion in wages, particularly affecting sectors such as finance and healthcare.

Read full article

via International Business Times

arXiv — cs.LGa day ago

Exploring Time-Step Size in Reinforcement Learning for Sepsis Treatment

PositiveArtificial Intelligence

Recent research has explored the impact of varying time-step sizes in reinforcement learning (RL) for sepsis treatment, examining four distinct intervals (1, 2, 4, and 8 hours) to assess their effects on patient data aggregation and treatment policies. The study highlights concerns regarding the traditional 4-hour time-step, which may lead to suboptimal treatment outcomes due to its coarse nature.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

TAB-DRW: A DFT-based Robust Watermark for Generative Tabular Data

PositiveArtificial Intelligence

A new watermarking scheme named TAB-DRW has been proposed to enhance the traceability of generative tabular data, addressing concerns over data provenance and misuse in sectors like healthcare and finance. This method utilizes a discrete Fourier transform to embed watermark signals efficiently, overcoming limitations of existing techniques that are often computationally expensive or lack robustness.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Jailbreaking and Mitigation of Vulnerabilities in Large Language Models

PositiveArtificial Intelligence

Recent research has highlighted significant vulnerabilities in Large Language Models (LLMs), particularly concerning prompt injection and jailbreaking attacks. This review categorizes various attack methods and evaluates defense strategies, including prompt filtering and self-regulation, to mitigate these risks.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

A Survey on Diffusion Models for Time Series and Spatio-Temporal Data

PositiveArtificial Intelligence

A recent survey on diffusion models for time series and spatio-temporal data highlights their growing application across various fields, including healthcare, climate, and traffic. The study emphasizes the separation of applications for time series and spatio-temporal data, providing a structured perspective on model categories and practical applications.

Read full article

via arXiv — cs.LG