Sharp Monocular View Synthesis in Less Than a Second

arXiv — cs.LGFriday, December 12, 2025 at 5:00:00 AM
  • SHARP is a newly introduced method for photorealistic view synthesis from a single image, achieving this in under a second using a standard GPU. The technique regresses the parameters of a 3D Gaussian representation, enabling real-time rendering of high-resolution images for nearby views. Experimental results indicate a significant reduction in synthesis time and improved performance metrics compared to previous models.
  • This development positions Apple at the forefront of advancements in artificial intelligence and computer vision, showcasing its commitment to innovation in creating tools that enhance visual content generation. By providing code and weights for SHARP, Apple also fosters collaboration and further research in the field.
  • The introduction of SHARP aligns with ongoing trends in AI, particularly in enhancing image synthesis and rendering techniques. It reflects a growing emphasis on efficiency and quality in visual technologies, paralleling other advancements such as neuromorphic eye tracking and multiview material appearance transfer, which aim to improve user experiences in augmented and virtual reality applications.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
Google and Apple roll out emergency security updates after zero-day attacks
PositiveArtificial Intelligence
Google and Apple have both released emergency security updates to address vulnerabilities that were actively exploited in zero-day attacks. Apple has patched its flagship devices, while Google has updated Chrome to fix a specific vulnerability that was part of these attacks.
2025's most-downloaded iPhone app is, you guessed it, ChatGPT
PositiveArtificial Intelligence
OpenAI's ChatGPT has been named the most downloaded free app on the U.S. iOS App Store for 2025, a notable rise from its previous position at number four in 2024. This achievement underscores the app's growing popularity and the increasing reliance on AI-driven applications among users.
🔵 Claude Code comes to Slack and Cursor gets design capabilities
NeutralArtificial Intelligence
Anthropic has integrated its Claude Code programming agent into Slack, currently in beta, allowing developers to manage coding tasks directly within the messaging platform. This integration aims to streamline workflows by enabling task delegation through chat, enhancing productivity for software engineers.
Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning
PositiveArtificial Intelligence
A recent study has introduced differential smoothing as a method to mitigate the diversity collapse often observed in large language models (LLMs) during reinforcement learning fine-tuning. This method aims to enhance both the correctness and diversity of model outputs, addressing a critical issue where outputs lack variety and can lead to diminished performance across tasks.
$\mathrm{D}^\mathrm{3}$-Predictor: Noise-Free Deterministic Diffusion for Dense Prediction
PositiveArtificial Intelligence
The introduction of the D³-Predictor presents a significant advancement in dense prediction by addressing the limitations of existing diffusion models, which are hindered by stochastic noise that disrupts fine-grained spatial cues and geometric structure mappings. This new framework reformulates a pretrained diffusion model to eliminate stochasticity, allowing for a more deterministic mapping from images to geometry.
Perception-Inspired Color Space Design for Photo White Balance Editing
PositiveArtificial Intelligence
A novel framework for white balance (WB) correction has been proposed, leveraging a perception-inspired Learnable HSI (LHSI) color space. This approach aims to address the limitations of traditional sRGB-based WB editing, which struggles with color constancy in complex lighting conditions due to fixed nonlinear transformations and entangled color channels.
Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data
PositiveArtificial Intelligence
A recent study explores the automated recognition of instructional activities and discourse from multimodal classroom data, utilizing AI-driven analysis of 164 hours of video and 68 lesson transcripts. This research aims to replace manual annotation methods, which are resource-intensive and difficult to scale, with more efficient AI techniques for actionable feedback to educators.
Latent Action World Models for Control with Unlabeled Trajectories
PositiveArtificial Intelligence
A new study introduces latent-action world models that learn from both action-conditioned and action-free data, addressing the limitations of traditional models that rely heavily on labeled action trajectories. This approach allows for training on large-scale unlabeled trajectories while requiring only a small set of labeled actions.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about