Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition

arXiv — cs.CV•Thursday, November 20, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A novel framework named KDF+ has been introduced to enhance continual learning for LiDAR place recognition, addressing the issue of catastrophic forgetting when adapting to new environments.
This development is significant as it enables more effective learning and retention of previously acquired knowledge, which is crucial for applications in SLAM and autonomous navigation.
The advancement in LiDAR technologies, including frameworks for object tracking and depth estimation, highlights the ongoing efforts to improve autonomous systems' capabilities in dynamic and challenging environments.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Recommended Readings

arXiv — cs.CV10 hours ago

CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking

PositiveArtificial Intelligence

CompTrack is a novel framework designed for 3D single object tracking in LiDAR point clouds, addressing challenges posed by spatial and informational redundancy. By utilizing a Spatial Foreground Predictor to filter background noise and an Information Bottleneck-guided Dynamic Token Compression module to enhance efficiency, CompTrack aims to improve the accuracy and performance of existing tracking systems in autonomous driving applications.

Read full article

via arXiv — cs.CV

arXiv — cs.LG10 hours ago

Controlling False Positives in Image Segmentation via Conformal Prediction

PositiveArtificial Intelligence

A new framework for controlling false positives in image segmentation has been introduced, enhancing the reliability of semantic segmentation in clinical decision-making. This model-agnostic approach utilizes conformal prediction to create confidence masks that maintain a user-defined tolerance for false positives, without requiring retraining. The method demonstrates high probability guarantees for new images, making it a significant advancement in medical imaging.

Read full article

via arXiv — cs.LG

arXiv — cs.CV10 hours ago

Evaluating Multimodal Large Language Models on Vertically Written Japanese Text

NeutralArtificial Intelligence

This study evaluates the performance of Multimodal Large Language Models (MLLMs) on vertically written Japanese text, an area that has seen limited research. The authors generated a synthetic Japanese OCR dataset that includes both horizontal and vertical writing for model fine-tuning and evaluation. The findings aim to enhance the understanding of document images in Japanese, particularly those with vertical text formats.

Read full article

via arXiv — cs.CV

arXiv — cs.CV10 hours ago

MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation

PositiveArtificial Intelligence

MambaTrack3D is a new framework designed for LiDAR-based object tracking in dynamic outdoor environments characterized by high temporal variation (HTV). It addresses challenges faced by existing memory-based trackers, such as computational complexity and temporal redundancy, by introducing an Inter-frame Propagation module and a Grouped Feature Enhancement Module. These innovations allow for efficient tracking while effectively modeling spatial relations across historical frames.

Read full article

via arXiv — cs.CV

arXiv — cs.CV10 hours ago

FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection

PositiveArtificial Intelligence

The paper presents FQ-PETR, a fully quantized framework for multi-view 3D object detection, addressing challenges in deploying PETR models due to high computational costs and memory requirements. The proposed method introduces innovations such as Quantization-Friendly LiDAR-ray Position Embedding to enhance performance without significant accuracy loss, despite the inherent difficulties in quantizing non-linear operators.

Read full article

via arXiv — cs.CV

arXiv — cs.CV10 hours ago

Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image

PositiveArtificial Intelligence

Wonder3D++ is a new method designed to generate high-fidelity textured meshes from single-view images. It addresses limitations in existing techniques that either require extensive optimization or yield low-quality results. By employing a cross-domain diffusion model and a multi-view attention mechanism, Wonder3D++ enhances the quality and consistency of 3D reconstructions, making it a significant advancement in the field of 3D generation.

Read full article

via arXiv — cs.CV

arXiv — cs.LG10 hours ago

Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?

PositiveArtificial Intelligence

The paper discusses the application of mixed sample data augmentation (MSDA) techniques to enhance 3D object detection using radar point clouds. While MSDA has been effective for LiDAR data, its adaptation for radar point clouds presents unique challenges, including irregular angular distribution and point sparsity. The authors propose a new method called Class-Aware PillarMix (CAPMix) that utilizes MixUp at the pillar level, guided by class labels, to address these challenges.

Read full article

via arXiv — cs.LG

arXiv — cs.CV10 hours ago

H-CNN-ViT: A Hierarchical Gated Attention Multi-Branch Model for Bladder Cancer Recurrence Prediction

PositiveArtificial Intelligence

Bladder cancer, with a recurrence rate of up to 78%, poses significant challenges for post-operative monitoring. Traditional multi-sequence contrast-enhanced MRI scans are often difficult to interpret due to changes from surgery. This study introduces H-CNN-ViT, a new AI model designed to enhance bladder cancer recurrence prediction by utilizing a curated multi-sequence MRI dataset, which aims to improve diagnostic accuracy and patient management.

Read full article

via arXiv — cs.CV