Monocular Person Localization under Camera Ego-motion

arXiv — cs.CV•Tuesday, November 25, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A new method for monocular person localization under camera ego-motion has been developed, addressing the challenges of accurately estimating a person's 3D position from 2D images captured by a moving camera. This approach utilizes a four-point model to jointly estimate the camera's 2D attitude and the person's 3D location, significantly improving localization accuracy compared to existing methods.
This advancement is crucial for enhancing Human-Robot Interaction (HRI), as accurate person localization is essential for applications such as person-following systems in robotics. The method has been validated through public datasets and real robot experiments, demonstrating its effectiveness in practical scenarios.
The development of this localization technique reflects ongoing efforts in the fields of robotics and computer vision to overcome limitations posed by camera motion. It aligns with broader trends in pose estimation and visual odometry, where improving accuracy and reliability remains a key focus, particularly in dynamic environments where traditional methods struggle.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

X Headshot

Transform your selfies into professional headshots with AI in minutes.

AI & DataTry the app

Novaheadshot

Transform selfies into professional headshots with AI—no photographer required.

AI & DataTry the app

Attentive AI

Extract digital maps from satellite, aerial, and drone imagery using deep learning.

AI & DataTry the app

Continue Readings

arXiv — cs.LGa day ago

Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models

NeutralArtificial Intelligence

A recent study has highlighted the potential of deep generative models for compressing sensor data in autonomous vehicles, particularly for scenarios requiring remote human assistance. This approach aims to enhance the efficiency of data transmission from sensors like cameras and lidar, which generate vast amounts of information in real-time.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

Revisiting Pre-trained Language Models for Vulnerability Detection

NeutralArtificial Intelligence

The paper revisits the effectiveness of pre-trained language models (PLMs) in detecting real-world vulnerabilities, highlighting critical challenges such as data leakage and limited scope in existing studies. An extensive evaluation of 18 PLMs on high-quality datasets is conducted, focusing on their performance in vulnerability detection (VD) through fine-tuning and prompt engineering methods.

Read full article

via arXiv — cs.LG

arXiv — cs.CV2 days ago

DeltaDeno: Zero-Shot Anomaly Generation via Delta-Denoising Attribution

PositiveArtificial Intelligence

DeltaDeno introduces a novel zero-shot anomaly generation method that operates without real anomaly samples or training, utilizing a technique called Delta-Denoising to localize and edit defects in images. This method contrasts two diffusion branches driven by minimal prompts, allowing for realistic local defect generation while preserving surrounding context.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond

PositiveArtificial Intelligence

A new algorithm named RapidPoseTriangulation has been introduced, enhancing multi-view multi-person whole-body human pose triangulation with remarkable speed and generalization capabilities. This advancement allows for detailed capture of human movements, including facial expressions and finger movements, across various individuals and viewpoints.

Read full article

via arXiv — cs.CV

arXiv — cs.LG2 days ago

Performance of Conformal Prediction in Capturing Aleatoric Uncertainty

NeutralArtificial Intelligence

Conformal prediction, a model-agnostic approach, is being evaluated for its effectiveness in capturing aleatoric uncertainty, which refers to the inherent ambiguity in datasets due to overlapping classes. This investigation focuses on the correlation between prediction set sizes and the distinct labels assigned by human annotators, aiming to validate the expected performance of conformal predictors in uncertain environments.

Read full article

via arXiv — cs.LG