CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking

arXiv — cs.CVThursday, November 20, 2025 at 5:00:00 AM

Was this article worth reading? Share it

Recommended Readings
PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation
PositiveArtificial Intelligence
The PAVE dataset represents a significant advancement in the evaluation of production autonomous vehicles (AVs). Unlike existing datasets that rely on human-driven data, PAVE is the first end-to-end benchmark collected entirely through autonomous driving in real-world conditions. It includes over 100 hours of data segmented into 32,727 key frames, featuring synchronized camera images and high-precision GNSS/IMU data, aimed at enhancing the safety evaluation of AVs.
FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection
PositiveArtificial Intelligence
The paper presents FQ-PETR, a fully quantized framework for multi-view 3D object detection, addressing challenges in deploying PETR models due to high computational costs and memory requirements. The proposed method introduces innovations such as Quantization-Friendly LiDAR-ray Position Embedding to enhance performance without significant accuracy loss, despite the inherent difficulties in quantizing non-linear operators.
Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?
PositiveArtificial Intelligence
The paper discusses the application of mixed sample data augmentation (MSDA) techniques to enhance 3D object detection using radar point clouds. While MSDA has been effective for LiDAR data, its adaptation for radar point clouds presents unique challenges, including irregular angular distribution and point sparsity. The authors propose a new method called Class-Aware PillarMix (CAPMix) that utilizes MixUp at the pillar level, guided by class labels, to address these challenges.
Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition
PositiveArtificial Intelligence
LiDAR place recognition is essential for SLAM, robot navigation, and autonomous driving. Current methods often face catastrophic forgetting when adapting to new environments. To combat this, a new framework called KDF+ has been proposed, which incorporates a loss-aware sampling strategy and a rehearsal enhancement mechanism to improve continual learning in LiDAR place recognition.
Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space
PositiveArtificial Intelligence
The paper introduces EOT-WM, a driving world model that integrates ego and other vehicle trajectories in video latent space for autonomous driving systems. Current models primarily focus on the ego vehicle's trajectory, limiting their ability to simulate realistic interactions with other vehicles. EOT-WM aims to enhance the evaluation of autonomous driving systems by addressing this gap, allowing for better trajectory predictions and planning.
MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation
PositiveArtificial Intelligence
MambaTrack3D is a new framework designed for LiDAR-based object tracking in dynamic outdoor environments characterized by high temporal variation (HTV). It addresses challenges faced by existing memory-based trackers, such as computational complexity and temporal redundancy, by introducing an Inter-frame Propagation module and a Grouped Feature Enhancement Module. These innovations allow for efficient tracking while effectively modeling spatial relations across historical frames.
What We Can Learn From Waymo’s California AV Data
PositiveArtificial Intelligence
Waymo is leading the robotaxi sector in California, achieving over 83% of driverless miles. This significant milestone highlights the company's advancements in autonomous vehicle technology and its growing presence in the market.
Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving
PositiveArtificial Intelligence
The article discusses the limitations of current end-to-end autonomous driving systems, which overly depend on ego status, affecting their ability to generalize and understand scenes robustly. It introduces AdaptiveAD, a new architectural solution that employs a dual-branch structure to decouple scene perception from ego status. This approach aims to enhance the performance of autonomous driving systems by allowing for more effective scene-driven reasoning without the influence of ego status.