EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision

arXiv — cs.CVThursday, November 13, 2025 at 5:00:00 AM
EAGLE, a new framework for egocentric visual query localization, has been developed to tackle the inherent challenges of camera motion, viewpoint changes, and appearance variations that hinder performance in embodied AI and VR/AR. By leveraging episodic appearance- and geometry-aware memory, EAGLE enhances retrieval accuracy and enables a seamless integration of 2D and 3D tasks. This innovative approach draws inspiration from avian memory consolidation, integrating segmentation guided by an appearance-aware meta-learning memory with tracking driven by a geometry-aware localization memory. The memory consolidation mechanism effectively stores high-confidence retrieval samples, allowing for precise contour delineation and robust spatial discrimination. EAGLE has achieved state-of-the-art performance on the Ego4D-VQ benchmark, marking a significant step forward in the field of visual localization and underscoring its importance for future advancements in AI and immersive technologies.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it