Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space

arXiv — cs.CV•Thursday, November 20, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

The introduction of EOT
This development is crucial as it enhances the predictive capabilities of autonomous systems, potentially leading to safer and more efficient driving experiences.
The ongoing evolution of autonomous driving technologies highlights the need for comprehensive models that can adapt to complex driving scenarios, reflecting a broader trend towards more integrated and realistic simulations in the field.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Recommended Readings

arXiv — cs.CV13 hours ago

PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation

PositiveArtificial Intelligence

The PAVE dataset represents a significant advancement in the evaluation of production autonomous vehicles (AVs). Unlike existing datasets that rely on human-driven data, PAVE is the first end-to-end benchmark collected entirely through autonomous driving in real-world conditions. It includes over 100 hours of data segmented into 32,727 key frames, featuring synchronized camera images and high-precision GNSS/IMU data, aimed at enhancing the safety evaluation of AVs.

Read full article

via arXiv — cs.CV

arXiv — cs.CV13 hours ago

CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking

PositiveArtificial Intelligence

CompTrack is a novel framework designed for 3D single object tracking in LiDAR point clouds, addressing challenges posed by spatial and informational redundancy. By utilizing a Spatial Foreground Predictor to filter background noise and an Information Bottleneck-guided Dynamic Token Compression module to enhance efficiency, CompTrack aims to improve the accuracy and performance of existing tracking systems in autonomous driving applications.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

nuCarla: A nuScenes-Style Bird's-Eye View Perception Dataset for CARLA Simulation

PositiveArtificial Intelligence

The nuCarla dataset has been introduced as a large-scale, nuScenes-style bird's-eye view perception dataset designed for the CARLA simulation environment. This dataset addresses the limitations of existing datasets that primarily support open-loop learning by providing a closed-loop simulation framework. nuCarla is fully compatible with the nuScenes format, allowing for the transfer of real-world perception models, and offers a scale comparable to nuScenes, enhancing the training of end-to-end autonomous driving models.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving

PositiveArtificial Intelligence

The article discusses the limitations of current end-to-end autonomous driving systems, which overly depend on ego status, affecting their ability to generalize and understand scenes robustly. It introduces AdaptiveAD, a new architectural solution that employs a dual-branch structure to decouple scene perception from ego status. This approach aims to enhance the performance of autonomous driving systems by allowing for more effective scene-driven reasoning without the influence of ego status.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

Divide and Merge: Motion and Semantic Learning in End-to-End Autonomous Driving

PositiveArtificial Intelligence

The article discusses a novel approach to end-to-end autonomous driving that separates semantic and motion learning to improve detection and tracking performance. The proposed method, Neural-Bayes motion decoding, utilizes learned motion queries in parallel with detection and tracking queries, enhancing information exchange through interactive semantic decoding. This addresses the negative transfer issue seen in multi-task learning, which can hinder performance in autonomous driving tasks.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

CARScenes: Semantic VLM Dataset for Safe Autonomous Driving

PositiveArtificial Intelligence

CAR-Scenes is a frame-level dataset designed for autonomous driving, facilitating the training and evaluation of vision-language models (VLMs) for scene-level understanding. The dataset comprises 5,192 annotated images from sources like Argoverse, Cityscapes, KITTI, and nuScenes, utilizing a comprehensive 28-key category/sub-category knowledge base. The annotations are generated through a GPT-4o-assisted pipeline with human verification, providing detailed attributes and supporting semantic retrieval and risk-aware scenario mining.

Read full article

via arXiv — cs.CV