KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird's-Eye-View Segmentation

KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird's-Eye-View Segmentation

arXiv — cs.CV•Thursday, December 18, 2025 at 5:00:00 AM

A new framework named KD360-VoxelBEV has been introduced, focusing on cross-modality knowledge distillation for Bird's-Eye-View (BEV) segmentation using LiDAR and a 360-degree camera. This innovative approach utilizes a voxel-aligned view transformer and a high-capacity Teacher network to enhance the performance of a lightweight Student network trained solely on panoramic camera images, achieving significant improvements in segmentation accuracy on the Dur360BEV dataset.
This development is significant as it demonstrates a substantial advancement in the efficiency and effectiveness of BEV segmentation, which is crucial for applications in autonomous driving and robotics. The Teacher network's ability to distill knowledge into a more compact model allows for real-time processing capabilities, making it a valuable tool for industries relying on accurate spatial and semantic understanding from limited sensor data.
The introduction of KD360-VoxelBEV aligns with ongoing trends in the field of autonomous systems, where the integration of multiple sensor modalities is becoming increasingly important. This framework not only enhances segmentation performance but also reflects a broader shift towards data-efficient learning methods, as seen in other recent advancements that leverage LiDAR and camera data for improved object detection and scene understanding.

— via World Pulse Now AI Editorial System

KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird's-Eye-View Segmentation