Monocular Person Localization under Camera Ego-motion

arXiv — cs.CVTuesday, November 25, 2025 at 5:00:00 AM
  • A new method for monocular person localization under camera ego-motion has been developed, addressing the challenges of accurately estimating a person's 3D position from 2D images captured by a moving camera. This approach utilizes a four-point model to jointly estimate the camera's 2D attitude and the person's 3D location, significantly improving localization accuracy compared to existing methods.
  • This advancement is crucial for enhancing Human-Robot Interaction (HRI), as accurate person localization is essential for applications such as person-following systems in robotics. The method has been validated through public datasets and real robot experiments, demonstrating its effectiveness in practical scenarios.
  • The development of this localization technique reflects ongoing efforts in the fields of robotics and computer vision to overcome limitations posed by camera motion. It aligns with broader trends in pose estimation and visual odometry, where improving accuracy and reliability remains a key focus, particularly in dynamic environments where traditional methods struggle.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models
NeutralArtificial Intelligence
A recent study has highlighted the potential of deep generative models for compressing sensor data in autonomous vehicles, particularly for scenarios requiring remote human assistance. This approach aims to enhance the efficiency of data transmission from sensors like cameras and lidar, which generate vast amounts of information in real-time.
Revisiting Pre-trained Language Models for Vulnerability Detection
NeutralArtificial Intelligence
The paper revisits the effectiveness of pre-trained language models (PLMs) in detecting real-world vulnerabilities, highlighting critical challenges such as data leakage and limited scope in existing studies. An extensive evaluation of 18 PLMs on high-quality datasets is conducted, focusing on their performance in vulnerability detection (VD) through fine-tuning and prompt engineering methods.
DeltaDeno: Zero-Shot Anomaly Generation via Delta-Denoising Attribution
PositiveArtificial Intelligence
DeltaDeno introduces a novel zero-shot anomaly generation method that operates without real anomaly samples or training, utilizing a technique called Delta-Denoising to localize and edit defects in images. This method contrasts two diffusion branches driven by minimal prompts, allowing for realistic local defect generation while preserving surrounding context.
RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond
PositiveArtificial Intelligence
A new algorithm named RapidPoseTriangulation has been introduced, enhancing multi-view multi-person whole-body human pose triangulation with remarkable speed and generalization capabilities. This advancement allows for detailed capture of human movements, including facial expressions and finger movements, across various individuals and viewpoints.
Performance of Conformal Prediction in Capturing Aleatoric Uncertainty
NeutralArtificial Intelligence
Conformal prediction, a model-agnostic approach, is being evaluated for its effectiveness in capturing aleatoric uncertainty, which refers to the inherent ambiguity in datasets due to overlapping classes. This investigation focuses on the correlation between prediction set sizes and the distinct labels assigned by human annotators, aiming to validate the expected performance of conformal predictors in uncertain environments.