Understanding World or Predicting Future? A Comprehensive Survey of World Models

arXiv — cs.LG•Tuesday, November 18, 2025 at 5:00:00 AM

NeutralArtificial Intelligence

The concept of world models is gaining traction due to advancements in AI technologies, particularly with models like GPT
The development of world models is crucial as they enhance decision
The ongoing evolution of world models reflects broader trends in AI, where the integration of large language models and simulation technologies is reshaping industries. This shift raises discussions about the implications for artificial general intelligence and the ethical considerations surrounding autonomous systems.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Recommended Readings

arXiv — cs.CL18 hours ago

A Reasoning Paradigm for Named Entity Recognition

PositiveArtificial Intelligence

A new framework for Named Entity Recognition (NER) has been proposed to enhance the performance of generative large language models (LLMs) like GPT-4. While these models excel at generating entities through semantic pattern matching, they often lack a robust reasoning mechanism, leading to suboptimal outcomes, particularly in low-resource scenarios. The proposed framework shifts the paradigm from implicit pattern matching to explicit reasoning, involving three stages: Chain of Thought (CoT) generation, CoT tuning, and reasoning enhancement, ultimately aiming to improve NER accuracy.

Read full article

via arXiv — cs.CL

arXiv — cs.LG18 hours ago

Learning with Preserving for Continual Multitask Learning

PositiveArtificial Intelligence

The article discusses a novel framework called Learning with Preserving (LwP) designed for Continual Multitask Learning (CMTL) in artificial intelligence systems. CMTL involves models that learn new tasks sequentially without forgetting previously acquired skills, which is crucial in fields like autonomous driving and medical imaging. Traditional methods often struggle due to task-specific feature fragmentation. LwP focuses on maintaining the geometric structure of shared representation spaces, enhancing the model's ability to learn continuously.

Read full article

via arXiv — cs.LG

arXiv — cs.LG18 hours ago

MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding

PositiveArtificial Intelligence

MMEdge is a new framework designed for real-time multimodal inference on resource-constrained edge devices, crucial for applications like autonomous driving and mobile health. The framework addresses the challenges of sensing dynamics and model execution by decomposing the inference process into fine-grained sensing and encoding units. This allows for incremental computation as data arrives, while a lightweight temporal aggregation module captures rich temporal dynamics to maintain accuracy.

Read full article

via arXiv — cs.LG

arXiv — cs.CV2 days ago

CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios

PositiveArtificial Intelligence

The CATS-V2V dataset introduces a pioneering real-world collection for Vehicle-to-Vehicle (V2V) cooperative perception, aimed at enhancing autonomous driving in complex adverse traffic scenarios. Collected using two time-synchronized vehicles, the dataset encompasses 100 clips featuring 60,000 frames of LiDAR point clouds and 1.26 million multi-view camera images across various weather and lighting conditions. This dataset is expected to significantly benefit the autonomous driving community by providing high-quality data for improved perception capabilities.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

One-to-N Backdoor Attack in 3D Point Cloud via Spherical Trigger

PositiveArtificial Intelligence

Backdoor attacks pose a significant risk to deep learning systems, especially in critical 3D applications like autonomous driving and robotics. This study introduces a novel one-to-N backdoor framework for 3D vision, utilizing a configurable spherical trigger. The research demonstrates that a single trigger can effectively encode multiple target classes, achieving high attack success rates of up to 100% while preserving accuracy on clean data.

Read full article

via arXiv — cs.CV

arXiv — cs.LG2 days ago

SURFACEBENCH: Can Self-Evolving LLMs Find the Equations of 3D Scientific Surfaces?

NeutralArtificial Intelligence

The article discusses the introduction of SurfaceBench, a new benchmark for symbolic surface discovery in machine learning. This benchmark addresses the challenge of equation discovery from data, which is crucial for understanding complex physical and geometric phenomena. SurfaceBench includes 183 tasks across 15 categories of symbolic complexity, featuring various equation representation forms and synthetic three-dimensional data. It aims to improve upon existing benchmarks that often focus on scalar functions and rely on inadequate metrics.

Read full article

via arXiv — cs.LG

arXiv — cs.CV2 days ago

FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection

PositiveArtificial Intelligence

The paper titled 'FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection' addresses the challenges of deploying PETR models in autonomous driving due to their high computational costs and memory requirements. It introduces FQ-PETR, a fully quantized framework that aims to enhance efficiency without sacrificing accuracy. Key innovations include a Quantization-Friendly LiDAR-ray Position Embedding and techniques to mitigate accuracy degradation typically associated with quantization methods.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving

NeutralArtificial Intelligence

The article discusses advancements in autonomous driving systems that utilize 3D object detection through RGB cameras, which are more cost-effective than LiDAR. Despite their promising detection accuracy, these systems are vulnerable to adversarial attacks. The study introduces AdvRoad, a method to create realistic road-style adversarial posters that can deceive detection systems without being easily noticed. This approach aims to enhance the safety and reliability of autonomous driving technologies.

Read full article

via arXiv — cs.CV