FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object Detection

arXiv — cs.CVMonday, November 17, 2025 at 5:00:00 AM
  • The introduction of FQ
  • The development of FQ
  • While no directly related articles were identified, the innovations presented in FQ
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended Readings
Learning with Preserving for Continual Multitask Learning
PositiveArtificial Intelligence
The article discusses a novel framework called Learning with Preserving (LwP) designed for Continual Multitask Learning (CMTL) in artificial intelligence systems. CMTL involves models that learn new tasks sequentially without forgetting previously acquired skills, which is crucial in fields like autonomous driving and medical imaging. Traditional methods often struggle due to task-specific feature fragmentation. LwP focuses on maintaining the geometric structure of shared representation spaces, enhancing the model's ability to learn continuously.
MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding
PositiveArtificial Intelligence
MMEdge is a new framework designed for real-time multimodal inference on resource-constrained edge devices, crucial for applications like autonomous driving and mobile health. The framework addresses the challenges of sensing dynamics and model execution by decomposing the inference process into fine-grained sensing and encoding units. This allows for incremental computation as data arrives, while a lightweight temporal aggregation module captures rich temporal dynamics to maintain accuracy.
Understanding World or Predicting Future? A Comprehensive Survey of World Models
NeutralArtificial Intelligence
The article discusses the growing interest in world models, particularly in light of advancements in multimodal large language models like GPT-4 and video generation models such as Sora. World models serve two main purposes: understanding current world mechanisms and predicting future dynamics. The review categorizes these models and highlights their applications in various fields, including generative games, autonomous driving, robotics, and social simulacra, emphasizing their role in decision-making processes.
Adaptive LiDAR Scanning: Harnessing Temporal Cues for Efficient 3D Object Detection via Multi-Modal Fusion
PositiveArtificial Intelligence
The article discusses a novel adaptive LiDAR scanning framework that enhances 3D object detection by utilizing temporal cues from past observations. Traditional LiDAR sensors often perform redundant scans, leading to inefficiencies in data acquisition and power consumption. The proposed method employs a lightweight predictor network to identify regions of interest, significantly reducing unnecessary data collection and improving overall efficiency.
On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Networks
PositiveArtificial Intelligence
The article discusses the evaluation of Deep Neural Networks (DNNs) based on their generalization performance and robustness against adversarial attacks. It highlights the challenges in assessing DNNs solely through generalization metrics as their performance has reached state-of-the-art levels. The study introduces the concept of the Populated Region Set (PRS) to analyze the internal properties of DNNs that influence their robustness, revealing that a low PRS ratio correlates with improved adversarial robustness.
One-to-N Backdoor Attack in 3D Point Cloud via Spherical Trigger
PositiveArtificial Intelligence
Backdoor attacks pose a significant risk to deep learning systems, especially in critical 3D applications like autonomous driving and robotics. This study introduces a novel one-to-N backdoor framework for 3D vision, utilizing a configurable spherical trigger. The research demonstrates that a single trigger can effectively encode multiple target classes, achieving high attack success rates of up to 100% while preserving accuracy on clean data.
Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving
NeutralArtificial Intelligence
The article discusses advancements in autonomous driving systems that utilize 3D object detection through RGB cameras, which are more cost-effective than LiDAR. Despite their promising detection accuracy, these systems are vulnerable to adversarial attacks. The study introduces AdvRoad, a method to create realistic road-style adversarial posters that can deceive detection systems without being easily noticed. This approach aims to enhance the safety and reliability of autonomous driving technologies.
CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios
PositiveArtificial Intelligence
The CATS-V2V dataset introduces a pioneering real-world collection for Vehicle-to-Vehicle (V2V) cooperative perception, aimed at enhancing autonomous driving in complex adverse traffic scenarios. Collected using two time-synchronized vehicles, the dataset encompasses 100 clips featuring 60,000 frames of LiDAR point clouds and 1.26 million multi-view camera images across various weather and lighting conditions. This dataset is expected to significantly benefit the autonomous driving community by providing high-quality data for improved perception capabilities.