PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion

arXiv — cs.CV•Tuesday, November 25, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

PartDiffuser has been introduced as a novel semi-autoregressive diffusion framework aimed at improving the generation of 3D meshes from point clouds. This method enhances the balance between global structural consistency and local detail fidelity by employing a part-wise approach, utilizing semantic segmentation and a discrete diffusion process for high-frequency geometric feature reconstruction.
The development of PartDiffuser is significant as it addresses the limitations of existing autoregressive methods, which often suffer from error accumulation and struggle with maintaining detail across different mesh parts. By leveraging the DiT architecture and a part-aware cross-attention mechanism, this framework promises to enhance the quality and efficiency of 3D mesh generation.
This advancement in 3D mesh generation aligns with ongoing efforts in the AI field to improve texture generation and efficiency in diffusion models. Similar frameworks, such as NaTex and SLA, also focus on enhancing the capabilities of diffusion processes, indicating a broader trend towards refining generative models to overcome challenges like occlusion and attention efficiency, which are critical for applications in computer graphics and virtual environments.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Z3D

Generate 3D models instantly with AI-powered design tools.

AI & DataTry the app

Dynamiq

Build, deploy, and scale your generative AI applications with one unified platform.

Business & ProductivityTry the app

Artefacts.ai

Create custom 3D models instantly with AI—no design experience required.

AI & DataTry the app

Continue Readings

arXiv — cs.CV18 hours ago

Which Layer Causes Distribution Deviation? Entropy-Guided Adaptive Pruning for Diffusion and Flow Models

PositiveArtificial Intelligence

A new framework called EntPruner has been introduced to address parameter redundancy in large-scale vision generative models, specifically diffusion and flow models. This framework employs an entropy-guided automatic progressive pruning strategy, which assesses the importance of model blocks based on Conditional Entropy Deviation (CED) to optimize performance across various downstream tasks.

Read full article

via arXiv — cs.CV

arXiv — cs.CV18 hours ago

MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training

PositiveArtificial Intelligence

MoGAN has been introduced as a motion-centric post-training framework aimed at enhancing motion quality in video diffusion models, which often struggle with issues like jitter and ghosting. This framework utilizes a DiT-based optical-flow discriminator to improve motion realism without relying on reward models or human preference data.

Read full article

via arXiv — cs.CV

arXiv — cs.CV3 days ago

Learning Plug-and-play Memory for Guiding Video Diffusion Models

PositiveArtificial Intelligence

A new study introduces a plug-and-play memory system for Diffusion Transformer-based video generation models, specifically the DiT, enhancing their ability to incorporate world knowledge and improve visual coherence. This development addresses the models' frequent violations of physical laws and commonsense dynamics, which have been a significant limitation in their application.

Read full article

via arXiv — cs.CV

arXiv — cs.CV3 days ago

Training-Free Efficient Video Generation via Dynamic Token Carving

PositiveArtificial Intelligence

A new inference pipeline named Jenga has been introduced to enhance the efficiency of video generation using Video Diffusion Transformer (DiT) models. This approach addresses the computational challenges associated with self-attention and the multi-step nature of diffusion models by employing dynamic attention carving and progressive resolution generation.

Read full article

via arXiv — cs.CV