STONE: Pioneering the One-to-N Universal Backdoor Threat in 3D Point Cloud

arXiv — cs.CVTuesday, December 9, 2025 at 5:00:00 AM
  • A new method named STONE has been introduced to address the critical threat of one-to-N universal backdoor attacks in 3D point clouds, particularly relevant in safety-sensitive areas like autonomous driving and robotics. This method utilizes a configurable spherical trigger design, allowing a single trigger to map to multiple target labels, thereby enhancing the flexibility of backdoor attacks beyond the traditional one-to-one paradigms.
  • The development of STONE is significant as it provides a theoretical foundation through Neural Tangent Kernel analysis, marking a pioneering step in understanding and potentially mitigating the risks associated with multi-target backdoor threats in deep learning systems used in autonomous technologies.
  • This advancement highlights a growing concern in the field of AI regarding adversarial attacks, as researchers increasingly focus on vulnerabilities in systems that rely on 3D data. The exploration of various attack vectors, including physical adversarial examples and optimized textures, underscores the urgent need for robust defenses in autonomous driving and robotics, where safety is paramount.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
SoC: Semantic Orthogonal Calibration for Test-Time Prompt Tuning
PositiveArtificial Intelligence
A new study introduces Semantic Orthogonal Calibration (SoC), a method aimed at improving the calibration of uncertainty estimates in vision-language models (VLMs) during test-time prompt tuning. This approach addresses the challenge of overconfidence in models by enforcing smooth prototype separation while maintaining semantic proximity.
Learning-based Multi-View Stereo: A Survey
NeutralArtificial Intelligence
A recent survey on learning-based Multi-View Stereo (MVS) techniques highlights the advancements in 3D reconstruction, which is crucial for applications such as Augmented and Virtual Reality, autonomous driving, and robotics. The study categorizes these methods into depth map-based, voxel-based, NeRF-based, and others, emphasizing the effectiveness of depth map-based approaches.
Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization
PositiveArtificial Intelligence
A recent study has introduced a framework aimed at mitigating hallucination issues in Multimodal Large Language Models (MLLMs) during Reinforcement Learning (RL) optimization. The research identifies key factors contributing to hallucinations, including over-reliance on visual reasoning and insufficient exploration diversity. The proposed framework incorporates modules for caption feedback, diversity-aware sampling, and conflict regularization to enhance model reliability.
SPARK: Scalable Real-Time Point Cloud Aggregation with Multi-View Self-Calibration
PositiveArtificial Intelligence
A new framework named SPARK has been introduced for scalable real-time multi-camera point cloud aggregation, addressing challenges in 3D reconstruction, particularly in handling extrinsic uncertainty and multi-view fusion. This innovative approach combines geometry-aware online extrinsic estimation with a confidence-driven point cloud fusion strategy, enabling stable point cloud generation in dynamic environments.
On the Sample Complexity of Differentially Private Policy Optimization
NeutralArtificial Intelligence
A recent study on differentially private policy optimization (DPPO) has been published, focusing on the sample complexity of policy optimization (PO) in reinforcement learning (RL). This research addresses privacy concerns in sensitive applications such as robotics and healthcare by formalizing a definition of differential privacy tailored to PO and analyzing the sample complexity of various PO algorithms under DP constraints.
Simulating the Visual World with Artificial Intelligence: A Roadmap
NeutralArtificial Intelligence
The landscape of video generation is evolving, transitioning from merely creating visually appealing clips to constructing interactive virtual environments that adhere to physical plausibility. This shift is highlighted in a recent survey that conceptualizes modern video foundation models as a combination of implicit world models and video renderers, enabling coherent visual reasoning and task planning.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about