Breaking the Likelihood-Quality Trade-off in Diffusion Models by Merging Pretrained Experts

arXiv — cs.LG•Tuesday, November 25, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A new method has been introduced to address the trade-off between perceptual sample quality and data likelihood in diffusion models for image generation. By merging two pretrained experts, one focused on image quality and the other on likelihood, the approach allows for improved image generation without the need for retraining, demonstrating effectiveness on datasets like CIFAR-10 and ImageNet32.
This development is significant as it enhances the capabilities of diffusion models, allowing for the generation of high-quality images while maintaining accurate likelihoods. The method's simplicity and effectiveness could lead to broader applications in various fields, including computer vision and machine learning.
The advancement reflects ongoing efforts in the AI community to optimize model performance and efficiency. It aligns with recent trends in machine learning that seek to balance quality and computational demands, as seen in related studies on unlearning representations and dataset pruning, indicating a growing focus on refining generative models and their applications.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Raphael

Generate unlimited AI images for free, no account required.

Creative & DesignTry the app

LexiStock AI

AI-powered photo enhancement for professional, high-quality image results.

AI & DataTry the app

The Influencer AI

Generate consistent AI personas for photo and video content creation.

Marketing & CommerceTry the app

Continue Readings

arXiv — cs.CV17 hours ago

Dynamic Epsilon Scheduling: A Multi-Factor Adaptive Perturbation Budget for Adversarial Training

PositiveArtificial Intelligence

A novel framework called Dynamic Epsilon Scheduling (DES) has been proposed to enhance adversarial training for deep neural networks. This approach adapts the adversarial perturbation budget based on instance-specific characteristics, integrating factors such as distance to decision boundaries, prediction confidence, and model uncertainty. This advancement addresses the limitations of fixed perturbation budgets in existing methods.

Read full article

via arXiv — cs.CV

arXiv — cs.CV17 hours ago

From Diffusion to One-Step Generation: A Comparative Study of Flow-Based Models with Application to Image Inpainting

PositiveArtificial Intelligence

A comprehensive study has been conducted comparing three generative modeling paradigms: Denoising Diffusion Probabilistic Models (DDPM), Conditional Flow Matching (CFM), and MeanFlow, focusing on their application in image inpainting. The study highlights that CFM significantly outperforms DDPM in terms of efficiency and quality, achieving a notable FID score of 24.15 with only 50 steps, while MeanFlow allows for single-step generation, reducing inference time by 50 times.

Read full article

via arXiv — cs.CV

arXiv — cs.CV17 hours ago

LTD: Low Temperature Distillation for Gradient Masking-free Adversarial Training

PositiveArtificial Intelligence

A novel approach called Low-Temperature Distillation (LTD) has been introduced to enhance adversarial training in neural networks, addressing the vulnerabilities associated with one-hot label representations in image classification. LTD utilizes a lower temperature in the teacher model while keeping the student model's temperature fixed, refining label representations and improving model robustness against adversarial attacks.

Read full article

via arXiv — cs.CV

arXiv — cs.LG2 days ago

SG-OIF: A Stability-Guided Online Influence Framework for Reliable Vision Data

PositiveArtificial Intelligence

The Stability-Guided Online Influence Framework (SG-OIF) has been introduced to enhance the reliability of vision data in deep learning models, addressing challenges such as the computational expense of influence function implementations and the instability of training dynamics. This framework aims to provide real-time control over algorithmic stability, facilitating more accurate identification of critical training examples.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

DP-MicroAdam: Private and Frugal Algorithm for Training and Fine-tuning

PositiveArtificial Intelligence

The introduction of DP-MicroAdam marks a significant advancement in the realm of adaptive optimizers for differentially private training, demonstrating superior performance and convergence rates compared to traditional methods like DP-SGD. This new algorithm is designed to be memory-efficient and sparsity-aware, addressing the challenges of extensive compute and hyperparameter tuning typically associated with differential privacy.

Read full article

via arXiv — cs.LG

arXiv — stat.ML2 days ago

ModHiFi: Identifying High Fidelity predictive components for Model Modification

PositiveArtificial Intelligence

A recent study titled 'ModHiFi: Identifying High Fidelity predictive components for Model Modification' explores methods to modify open weight models without access to training data or loss functions. The research focuses on identifying critical components that influence predictive performance using only distributional access, such as synthetic data.

Read full article

via arXiv — stat.ML

arXiv — cs.LG2 days ago

Latent Diffusion Inversion Requires Understanding the Latent Space

NeutralArtificial Intelligence

Recent research highlights the need for a deeper understanding of latent space in Latent Diffusion Models (LDMs), revealing that these models exhibit uneven memorization across latent codes and that different dimensions within a single latent code contribute variably to memorization. This study introduces a method to rank these dimensions based on their impact on the decoder pullback metric.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

MGAS: Multi-Granularity Architecture Search for Trade-Off Between Model Effectiveness and Efficiency

PositiveArtificial Intelligence

The introduction of Multi-Granularity Differentiable Architecture Search (MG-DARTS) marks a significant advancement in neural architecture search (NAS), focusing on optimizing both model effectiveness and efficiency. This framework addresses limitations in existing differentiable architecture search methods by incorporating finer-grained structures, enhancing the balance between model performance and size.

Read full article

via arXiv — cs.LG