Source-Optimal Training is Transfer-Suboptimal

arXiv — stat.ML•Tuesday, November 25, 2025 at 5:00:00 AM

NeutralArtificial Intelligence

A recent study highlights a fundamental misalignment in transfer learning, demonstrating that the source regularization minimizing source risk rarely aligns with the regularization that maximizes transfer benefits. This misalignment is characterized through phase boundaries for L2-SP ridge regression, revealing that optimal source penalties differ based on task alignment and signal-to-noise ratios.
This development is significant as it challenges existing paradigms in transfer learning, suggesting that practitioners may need to reconsider their approaches to source regularization to enhance transfer performance across various tasks.
The findings resonate with ongoing discussions in the AI community regarding the effectiveness of different learning strategies, particularly in the context of long-tailed datasets and the challenges of class imbalance, as well as the broader implications for model training and evaluation in diverse applications.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Hypertune

Optimize machine learning models with automated hyperparameter tuning and experiment tracking.

Business & ProductivityView app details

Augmeta

AI peers for collaborative problem-solving and enhanced team productivity.

AI & DataView app details

Continue Readings

arXiv — cs.LG2 days ago

DASH: A Meta-Attack Framework for Synthesizing Effective and Stealthy Adversarial Examples

PositiveArtificial Intelligence

The introduction of DAASH, a meta-attack framework, marks a significant advancement in generating effective and perceptually aligned adversarial examples, addressing the limitations of traditional Lp-norm constrained methods. This framework strategically composes existing attack methods in a multi-stage process, enhancing the perceptual alignment of adversarial examples.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Oscillations Make Neural Networks Robust to Quantization

PositiveArtificial Intelligence

Recent research challenges the notion that weight oscillations during Quantization Aware Training (QAT) are merely undesirable effects, proposing instead that they are crucial for enhancing the robustness of neural networks. The study demonstrates that these oscillations, induced by a new regularizer, can help maintain performance across various quantization levels, particularly in models like ResNet-18 and Tiny Vision Transformer evaluated on CIFAR-10 and Tiny ImageNet datasets.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Learning effective pruning at initialization from iterative pruning

PositiveArtificial Intelligence

A recent study explores the potential of pruning at initialization (PaI) by drawing inspiration from iterative pruning methods, aiming to enhance performance in deep learning models. The research highlights the significance of identifying surviving subnetworks based on initial features, which could lead to more efficient pruning strategies and reduced training costs, especially as neural networks grow in size.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Fully Decentralized Certified Unlearning

NeutralArtificial Intelligence

A recent study has introduced a method for fully decentralized certified unlearning in machine learning, focusing on the removal of specific data influences from trained models without a central coordinator. This approach, termed RR-DU, employs a random-walk procedure to enhance privacy and mitigate data poisoning risks, providing convergence guarantees in convex scenarios and stationarity in nonconvex cases.

Read full article

via arXiv — cs.LG

arXiv — cs.LG2 days ago

Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata

PositiveArtificial Intelligence

A novel Conditional Neural Cellular Automata (c-NCA) architecture has been proposed, enabling the generation of distinct topological structures, specifically MNIST digits, from a single seed. This approach emphasizes local interactions and translation equivariance, diverging from traditional generative models that rely on global reception fields.

Read full article

via arXiv — cs.LG

arXiv — stat.ML2 days ago

Discovering Influential Factors in Variational Autoencoders

NeutralArtificial Intelligence

A recent study has focused on the influential factors extracted by variational autoencoders (VAEs), highlighting the challenge of supervising learned representations without manual intervention. The research emphasizes the role of mutual information between inputs and learned factors as a key indicator for identifying influential factors, revealing that some factors may be non-influential and can be disregarded in data reconstruction.

Read full article

via arXiv — stat.ML

arXiv — cs.LG2 days ago

Nonlinear Optimization with GPU-Accelerated Neural Network Constraints

NeutralArtificial Intelligence

A new reduced-space formulation for optimizing trained neural networks has been proposed, which evaluates the network's outputs and derivatives on a GPU. This method treats the neural network as a 'gray box,' leading to faster solves and fewer iterations compared to traditional full-space formulations. The approach has been demonstrated on two optimization problems, including adversarial generation for a classifier trained on MNIST images.

Read full article

via arXiv — cs.LG

arXiv — cs.CV3 days ago

PrunedCaps: A Case For Primary Capsules Discrimination

PositiveArtificial Intelligence

A recent study has introduced a pruned version of Capsule Networks (CapsNets), demonstrating that it can operate up to 9.90 times faster than traditional architectures by eliminating 95% of Primary Capsules while maintaining accuracy across various datasets, including MNIST and CIFAR-10.

Read full article

via arXiv — cs.CV