stable-pretraining-v1: Foundation Model Research Made Simple

arXiv — cs.LG•Wednesday, November 26, 2025 at 5:00:00 AM

PositiveArtificial Intelligence

The stable-pretraining library has been introduced as a modular and performance-optimized tool for foundation model research, built on PyTorch, Lightning, Hugging Face, and TorchMetrics. This library aims to simplify self-supervised learning (SSL) by providing essential utilities and enhancing the visibility of training dynamics through comprehensive logging.
This development is significant as it addresses the challenges faced by researchers in the AI field, such as complex codebases and the engineering burden of scaling experiments. By streamlining the process, stable-pretraining promotes faster iterations and more effective experimentation.
The introduction of stable-pretraining aligns with ongoing efforts to enhance AI safety and efficiency, as seen in advancements like SaFeR-CLIP, which mitigates unsafe content in vision-language models. Additionally, the emphasis on modularity and flexibility reflects a broader trend in AI research towards creating integrated systems that can adapt to various tasks and domains.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Testfeed

Predict your social media post performance before publishing to save resources.

Tech & Developer ToolsTry the app

Dyad

Build and deploy free, local AI applications with open-source tools.

AI & DataTry the app

Republiclabs.ai

Generate custom images and videos with the people's AI playground.

Creative & DesignTry the app

Continue Readings

arXiv — cs.CVa day ago

Face, Whole-Person, and Object Classification in a Unified Space Via The Interleaved Multi-Domain Identity Curriculum

PositiveArtificial Intelligence

A new study introduces the Interleaved Multi-Domain Identity Curriculum (IMIC), enabling models to perform object recognition, face recognition from varying image qualities, and person recognition in a unified embedding space without significant catastrophic forgetting. This approach was tested on foundation models DINOv3, CLIP, and EVA-02, demonstrating comparable performance to domain experts across all tasks.

Read full article

via arXiv — cs.CV

arXiv — cs.CLa day ago

QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel Generation

PositiveArtificial Intelligence

The QiMeng-Kernel framework introduces a Macro-Thinking Micro-Coding paradigm aimed at enhancing the generation of high-performance GPU kernels for AI and scientific computing. This approach addresses the challenges of correctness and efficiency in existing LLM-based methods by decoupling optimization strategies from implementation details, thereby improving both aspects significantly.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

MMTU: A Massive Multi-Task Table Understanding and Reasoning Benchmark

NeutralArtificial Intelligence

A new benchmark called MMTU has been introduced, featuring over 28,000 questions across 25 real-world table tasks, aimed at enhancing the evaluation of large language models (LLMs) in table-based applications. This initiative addresses the current limitations in benchmarking table-related tasks, which have been largely overlooked compared to other NLP benchmarks.

Read full article

via arXiv — cs.CL

arXiv — cs.LGa day ago

STAlloc: Enhancing Memory Efficiency in Large-Scale Model Training with Spatio-Temporal Planning

PositiveArtificial Intelligence

The introduction of STAlloc, a new GPU memory allocator for deep learning frameworks, aims to enhance memory efficiency during large-scale model training by reducing fragmentation caused by existing online memory allocators that overlook tensor lifespans. This innovation is particularly relevant as the demand for large language models (LLMs) continues to grow, leading to increased GPU memory pressure and potential out-of-memory errors.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

NNGPT: Rethinking AutoML with Large Language Models

PositiveArtificial Intelligence

NNGPT has been introduced as an open-source framework that transforms large language models into self-improving AutoML engines, particularly for neural network development in computer vision. This framework enhances neural network datasets by generating new models, allowing for continuous fine-tuning through a closed-loop system of generation, assessment, and self-improvement.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

Concept-Aware Batch Sampling Improves Language-Image Pretraining

PositiveArtificial Intelligence

A recent study introduces Concept-Aware Batch Sampling (CABS), a novel framework designed to enhance language-image pretraining by utilizing a dynamic, concept-based approach to data curation. This method builds on DataConcept, a dataset of 128 million annotated image-text pairs, allowing for more adaptive and efficient training processes in vision-language models.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition

PositiveArtificial Intelligence

A novel framework called the correlation adaptation prompt network (CAPNET) has been proposed to enhance long-tailed multi-label visual recognition, addressing the challenges posed by imbalanced class distributions in datasets. This approach leverages pre-trained vision-language models like CLIP to better model label correlations, aiming to improve performance on tail classes that are often neglected in traditional methods.

Read full article

via arXiv — cs.LG

VentureBeat — AI2 days ago

What enterprises should know about The White House's new AI 'Manhattan Project' the Genesis Mission

NeutralArtificial Intelligence

President Donald Trump announced the Genesis Mission on November 24, 2025, likening it to the Manhattan Project, aiming to revolutionize scientific research in the U.S. The initiative directs the Department of Energy to create a closed-loop AI experimentation platform that integrates national laboratories and government data for enhanced research capabilities.

Read full article

via VentureBeat — AI