Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?

arXiv — cs.LG•Thursday, November 20, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

The study introduces Class
The development of CAPMix is significant as it could improve the accuracy and efficiency of 3D perception tasks, which are crucial for applications in autonomous driving and robotics. Enhanced detection capabilities can lead to safer and more reliable systems.
The challenges faced in adapting MSDA for radar point clouds reflect broader issues in the field of 3D perception, where advancements in LiDAR technology often overshadow radar applications. This highlights a need for innovative solutions that can leverage the strengths of both technologies in diverse environments.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Recommended Readings

arXiv — cs.CV12 hours ago

FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection

PositiveArtificial Intelligence

The paper presents FQ-PETR, a fully quantized framework for multi-view 3D object detection, addressing challenges in deploying PETR models due to high computational costs and memory requirements. The proposed method introduces innovations such as Quantization-Friendly LiDAR-ray Position Embedding to enhance performance without significant accuracy loss, despite the inherent difficulties in quantizing non-linear operators.

Read full article

via arXiv — cs.CV

arXiv — cs.CV12 hours ago

Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition

PositiveArtificial Intelligence

LiDAR place recognition is essential for SLAM, robot navigation, and autonomous driving. Current methods often face catastrophic forgetting when adapting to new environments. To combat this, a new framework called KDF+ has been proposed, which incorporates a loss-aware sampling strategy and a rehearsal enhancement mechanism to improve continual learning in LiDAR place recognition.

Read full article

via arXiv — cs.CV

arXiv — cs.CV12 hours ago

MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation

PositiveArtificial Intelligence

MambaTrack3D is a new framework designed for LiDAR-based object tracking in dynamic outdoor environments characterized by high temporal variation (HTV). It addresses challenges faced by existing memory-based trackers, such as computational complexity and temporal redundancy, by introducing an Inter-frame Propagation module and a Grouped Feature Enhancement Module. These innovations allow for efficient tracking while effectively modeling spatial relations across historical frames.

Read full article

via arXiv — cs.CV

arXiv — cs.CV12 hours ago

CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking

PositiveArtificial Intelligence

CompTrack is a novel framework designed for 3D single object tracking in LiDAR point clouds, addressing challenges posed by spatial and informational redundancy. By utilizing a Spatial Foreground Predictor to filter background noise and an Information Bottleneck-guided Dynamic Token Compression module to enhance efficiency, CompTrack aims to improve the accuracy and performance of existing tracking systems in autonomous driving applications.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization

PositiveArtificial Intelligence

The article presents a new framework for GNSS-free collaborative perception using LiDAR localization, addressing the challenges faced in GNSS-denied environments. Traditional localization methods often struggle in these settings, hindering effective collaboration among multi-agent systems. The proposed solution includes a lightweight Pose Generator with Confidence (PGC) for estimating poses and confidence, alongside the Pose-Aware Spatio-Temporal Alignment Transformer (PASTAT) for spatial alignment. A new simulation dataset, V2VLoc, is introduced, which supports LiDAR localization and collabor…

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

LED: Light Enhanced Depth Estimation at Night

PositiveArtificial Intelligence

Nighttime depth estimation using camera systems poses significant challenges, particularly for autonomous driving where accurate depth perception is crucial. Traditional models trained on daytime data often struggle without expensive LiDAR systems. This study introduces Light Enhanced Depth (LED), a novel approach that utilizes high-definition headlights to improve depth estimation in low-light conditions. LED demonstrates substantial performance improvements across various depth-estimation architectures on both synthetic and real datasets.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

Availability-aware Sensor Fusion via Unified Canonical Space

PositiveArtificial Intelligence

The paper presents a novel method for sensor fusion in autonomous driving, termed availability-aware sensor fusion (ASF). This approach addresses the limitations of existing methods that assume continuous sensor availability, which can lead to performance degradation during sensor failure. By employing unified canonical projection (UCP) and cross-attention across sensors along patches (CASAP), ASF enhances object detection performance under various conditions, including adverse weather and sensor degradation.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

DepthVision: Enabling Robust Vision-Language Models with GAN-Based LiDAR-to-RGB Synthesis for Autonomous Driving

PositiveArtificial Intelligence

DepthVision is a multimodal framework designed to enhance Vision-Language Models (VLMs) by utilizing LiDAR data without requiring architectural modifications or retraining. It synthesizes RGB-like images from sparse LiDAR point clouds using a conditional GAN and integrates a Luminance-Aware Modality Adaptation (LAMA) module to dynamically adjust image quality based on ambient lighting. This innovation aims to improve the reliability of autonomous vehicles in challenging visual conditions, such as darkness or motion blur.

Read full article

via arXiv — cs.CV