DT-NVS: Diffusion Transformers for Novel View Synthesis

arXiv — cs.CVThursday, November 13, 2025 at 5:00:00 AM
The recent submission of 'DT-NVS: Diffusion Transformers for Novel View Synthesis' to arXiv marks a significant advancement in the field of computer vision, particularly in generating novel views from a single image. Traditional methods have been constrained by focusing on limited camera movements or unnatural object-centric scenes. In contrast, DT-NVS employs a 3D diffusion model that leverages a transformer-based architecture, trained on a large-scale dataset of real-world, multi-category videos. This innovative approach not only enhances the model's ability to synthesize realistic views but also introduces novel camera conditioning strategies that allow for effective training on unaligned datasets. The evaluation results indicate that DT-NVS outperforms existing state-of-the-art 3D aware diffusion models, showcasing its potential for broader applications in real-world scenarios. This development is crucial as it opens new avenues for research and practical applications in novel view…
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
Beyond the Next Port: A Multi-Task Transformer for Forecasting Future Voyage Segment Durations
PositiveArtificial Intelligence
A new study introduces a transformer-based architecture designed for forecasting future voyage segment durations in maritime operations, addressing the limitations of traditional ETA models that focus on immediate ports and rely on real-time AIS data. This innovative approach reformulates the prediction as a segment-level time-series forecasting problem, integrating historical sailing durations and port congestion proxies.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about