Sharp Monocular View Synthesis in Less Than a Second

arXiv — cs.LG•Friday, December 12, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

SHARP is a newly introduced method for photorealistic view synthesis from a single image, achieving this in under a second using a standard GPU. The technique regresses the parameters of a 3D Gaussian representation, enabling real-time rendering of high-resolution images for nearby views. Experimental results indicate a significant reduction in synthesis time and improved performance metrics compared to previous models.
This development positions Apple at the forefront of advancements in artificial intelligence and computer vision, showcasing its commitment to innovation in creating tools that enhance visual content generation. By providing code and weights for SHARP, Apple also fosters collaboration and further research in the field.
The introduction of SHARP aligns with ongoing trends in AI, particularly in enhancing image synthesis and rendering techniques. It reflects a growing emphasis on efficiency and quality in visual technologies, paralleling other advancements such as neuromorphic eye tracking and multiview material appearance transfer, which aim to improve user experiences in augmented and virtual reality applications.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

MagicShot

Transform your screenshots into branded images in seconds.

AI & DataView app details

ScreenshotOne

Capture screenshots at scale with a fast, reliable API for any use case.

AI & DataView app details

Shakker-ai

Generate any image you imagine, streaming instantly from Shakker AI.

Creative & DesignView app details

Professional AI Headshot Generator

Transform your selfies into professional headshots in seconds with AI.

Creative & DesignView app details

Eyeware Beam

Turn your iPhone into a head tracker, eye tracker, and webcam for gaming and video creation.

Tech & Developer ToolsView app details

Novaheadshot

Transform selfies into professional headshots with AI—no photographer required.

AI & DataView app details

Continue Readings

TechCrunch2 days ago

Google and Apple roll out emergency security updates after zero-day attacks

PositiveArtificial Intelligence

Google and Apple have both released emergency security updates to address vulnerabilities that were actively exploited in zero-day attacks. Apple has patched its flagship devices, while Google has updated Chrome to fix a specific vulnerability that was part of these attacks.

Read full article

via TechCrunch

TechSpot2 days ago

2025's most-downloaded iPhone app is, you guessed it, ChatGPT

PositiveArtificial Intelligence

OpenAI's ChatGPT has been named the most downloaded free app on the U.S. iOS App Store for 2025, a notable rise from its previous position at number four in 2024. This achievement underscores the app's growing popularity and the increasing reliance on AI-driven applications among users.

Read full article

via TechSpot

Department of Product2 days ago

🔵 Claude Code comes to Slack and Cursor gets design capabilities

NeutralArtificial Intelligence

Anthropic has integrated its Claude Code programming agent into Slack, currently in beta, allowing developers to manage coding tasks directly within the messaging platform. This integration aims to streamline workflows by enabling task delegation through chat, enhancing productivity for software engineers.

Read full article

via Department of Product

arXiv — cs.LG3 days ago

Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning

PositiveArtificial Intelligence

A recent study has introduced differential smoothing as a method to mitigate the diversity collapse often observed in large language models (LLMs) during reinforcement learning fine-tuning. This method aims to enhance both the correctness and diversity of model outputs, addressing a critical issue where outputs lack variety and can lead to diminished performance across tasks.

Read full article

via arXiv — cs.LG

$$\mathrm{D}^\mathrm{3}$-Predictor: Noise-Free Deterministic Diffusion for Dense Prediction$

arXiv — cs.CV3 days ago

$\mathrm{D}^\mathrm{3}$-Predictor: Noise-Free Deterministic Diffusion for Dense Prediction

PositiveArtificial Intelligence

The introduction of the D³-Predictor presents a significant advancement in dense prediction by addressing the limitations of existing diffusion models, which are hindered by stochastic noise that disrupts fine-grained spatial cues and geometric structure mappings. This new framework reformulates a pretrained diffusion model to eliminate stochasticity, allowing for a more deterministic mapping from images to geometry.

Read full article

via arXiv — cs.CV

arXiv — cs.CV3 days ago

Perception-Inspired Color Space Design for Photo White Balance Editing

PositiveArtificial Intelligence

A novel framework for white balance (WB) correction has been proposed, leveraging a perception-inspired Learnable HSI (LHSI) color space. This approach aims to address the limitations of traditional sRGB-based WB editing, which struggles with color constancy in complex lighting conditions due to fixed nonlinear transformations and entangled color channels.

Read full article

via arXiv — cs.CV

arXiv — cs.CV3 days ago

Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data

PositiveArtificial Intelligence

A recent study explores the automated recognition of instructional activities and discourse from multimodal classroom data, utilizing AI-driven analysis of 164 hours of video and 68 lesson transcripts. This research aims to replace manual annotation methods, which are resource-intensive and difficult to scale, with more efficient AI techniques for actionable feedback to educators.

Read full article

via arXiv — cs.CV

arXiv — cs.LG3 days ago

Latent Action World Models for Control with Unlabeled Trajectories

PositiveArtificial Intelligence

A new study introduces latent-action world models that learn from both action-conditioned and action-free data, addressing the limitations of traditional models that rely heavily on labeled action trajectories. This approach allows for training on large-scale unlabeled trajectories while requiring only a small set of labeled actions.

Read full article

via arXiv — cs.LG

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about