Joint Discriminative-Generative Modeling via Dual Adversarial Training

arXiv — cs.LG•Friday, December 5, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

A novel training framework has been proposed to enhance Joint Energy-Based Models (JEM) by integrating adversarial training principles, addressing issues of instability and poor sample quality in generative modeling. This method replaces Stochastic Gradient Langevin Dynamics (SGLD) with a more stable approach that utilizes Binary Cross-Entropy loss to optimize the energy function, improving both classification robustness and generative learning stability.
This development is significant as it aims to bridge the gap between robust classification and high-fidelity generative modeling, which has been a persistent challenge in artificial intelligence. By enhancing the performance of JEMs, the proposed framework could lead to more reliable applications in various fields, including computer vision and natural language processing.
The introduction of this framework aligns with ongoing efforts in the AI community to improve model robustness and generative capabilities. Similar advancements have been made in areas such as dataset distillation and adversarial training, highlighting a trend towards integrating multiple training methodologies to overcome limitations in existing models. This reflects a broader movement in AI research focused on enhancing model performance while addressing issues like class uncertainty and data quality.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Dynamiq

Build, deploy, and scale your generative AI applications with one unified platform.

Business & ProductivityView app details

Augmeta

AI peers for collaborative problem-solving and enhanced team productivity.

AI & DataView app details

Continue Readings

arXiv — cs.CVa day ago

The Inductive Bottleneck: Data-Driven Emergence of Representational Sparsity in Vision Transformers

NeutralArtificial Intelligence

Recent research has identified an 'Inductive Bottleneck' in Vision Transformers (ViTs), where these models exhibit a U-shaped entropy profile, compressing information in middle layers before expanding it for final classification. This phenomenon is linked to the semantic abstraction required by specific tasks and is not merely an architectural flaw but a data-dependent adaptation observed across various datasets such as UC Merced, Tiny ImageNet, and CIFAR-100.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

Distribution Matching Variational AutoEncoder

NeutralArtificial Intelligence

The Distribution-Matching Variational AutoEncoder (DMVAE) has been introduced to address limitations in existing visual generative models, which often compress images into a latent space without explicitly shaping its distribution. DMVAE aligns the encoder's latent distribution with an arbitrary reference distribution, allowing for a more flexible modeling approach beyond the conventional Gaussian prior.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

PositiveArtificial Intelligence

A new framework called Feature Auto-Encoder (FAE) has been introduced to adapt pre-trained visual representations for image generation, addressing challenges in aligning high-dimensional features with low-dimensional generative models. This approach aims to simplify the adaptation process, enhancing the efficiency and quality of generated images.

Read full article

via arXiv — cs.CV

arXiv — cs.LGa day ago

Exploring Adversarial Watermarking in Transformer-Based Models: Transferability and Robustness Against Defense Mechanism for Medical Images

NeutralArtificial Intelligence

Recent research has explored the vulnerabilities of Vision Transformers (ViTs) in medical image analysis, particularly their susceptibility to adversarial watermarking, which introduces imperceptible perturbations to images. This study highlights the challenges faced by deep learning models in dermatological image analysis, where ViTs are increasingly utilized due to their self-attention mechanisms that enhance performance in computer vision tasks.

Read full article

via arXiv — cs.LG

arXiv — cs.CVa day ago

PrunedCaps: A Case For Primary Capsules Discrimination

PositiveArtificial Intelligence

A recent study has introduced a pruned version of Capsule Networks (CapsNets), demonstrating that it can operate up to 9.90 times faster than traditional architectures by eliminating 95% of Primary Capsules while maintaining accuracy across various datasets, including MNIST and CIFAR-10.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

Adaptive Dataset Quantization: A New Direction for Dataset Pruning

PositiveArtificial Intelligence

A new paper introduces an innovative dataset quantization method aimed at reducing storage and communication costs for large-scale datasets on resource-constrained edge devices. This approach focuses on compressing individual samples by minimizing intra-sample redundancy while retaining essential features, marking a shift from traditional inter-sample redundancy methods.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

CLUENet: Cluster Attention Makes Neural Networks Have Eyes

PositiveArtificial Intelligence

The CLUster attEntion Network (CLUENet) has been introduced as a novel deep architecture aimed at enhancing visual semantic understanding by addressing the limitations of existing convolutional and attention-based models, particularly their rigid receptive fields and complex architectures. This innovation incorporates global soft aggregation, hard assignment, and improved cluster pooling strategies to enhance local modeling and interpretability.

Read full article

via arXiv — cs.CV

arXiv — cs.LGa day ago

Arc Gradient Descent: A Mathematically Derived Reformulation of Gradient Descent with Phase-Aware, User-Controlled Step Dynamics

PositiveArtificial Intelligence

The paper introduces Arc Gradient Descent (ArcGD), a new optimizer that reformulates traditional gradient descent methods to incorporate phase-aware and user-controlled step dynamics. The evaluation of ArcGD shows it outperforming the Adam optimizer on a non-convex benchmark and a real-world ML dataset, particularly in challenging scenarios like the Rosenbrock function and CIFAR-10 image classification.

Read full article

via arXiv — cs.LG