An Empirical Survey of Model Merging Algorithms for Social Bias Mitigation

arXiv — cs.CL•Wednesday, December 3, 2025 at 5:00:00 AM

NeutralArtificial Intelligence

A recent empirical survey examined seven model merging algorithms aimed at mitigating social bias in large language models (LLMs), including Linear, Karcher Mean, and SLERP, among others. The study evaluated their effectiveness using 13 open weight models from the GPT, LLaMA, and Qwen families against three bias datasets: BBQ, BOLD, and HONEST, while also assessing their impact on downstream performance in tasks from the SuperGLUE benchmark.
This development is significant as it highlights the ongoing challenge of addressing societal biases in LLMs, which can undermine fairness and social trust. The findings indicate a trade-off between bias reduction and model accuracy, particularly affecting tasks that require reading comprehension and reasoning, thus raising concerns about the practical implications of bias mitigation techniques.
The issue of bias in AI models is increasingly critical, as highlighted by various studies exploring the performance and evaluation of LLMs and their decision-making processes. The findings from this survey contribute to a broader discourse on the ethical implications of AI, emphasizing the need for frameworks that not only enhance model performance but also ensure fairness and accountability in AI applications.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Zemith-3bda3b

Your all-in-one AI platform for work and research assistance.

AI & DataTry the app

Https

Access multiple AI models seamlessly in one unified chat application.

AI & DataTry the app

ZeroGPT.org

Detect AI-generated text and check for plagiarism with accurate, reliable results.

AI & DataTry the app

Continue Readings

arXiv — cs.CLa day ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

PositiveArtificial Intelligence

Large language models (LLMs) have revolutionized software development by translating natural language into functional code, with tools like Github Copilot and Claude Code leading the charge. A recent comprehensive guide details the lifecycle of code LLMs, from data curation to autonomous coding agents, highlighting the significant advancements in performance metrics.

Read full article

via arXiv — cs.CL

arXiv — cs.LGa day ago

Provable Scaling Laws of Feature Emergence from Learning Dynamics of Grokking

NeutralArtificial Intelligence

A new framework named Li_2 has been proposed to characterize the phenomenon of grokking, which involves delayed generalization in machine learning. This framework outlines three key stages of learning dynamics in 2-layer nonlinear networks: lazy learning, independent feature learning, and interactive feature learning. The study aims to provide a mathematical foundation for understanding how features emerge during training.

Read full article

via arXiv — cs.LG

arXiv — cs.CVa day ago

End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer

PositiveArtificial Intelligence

A new end-to-end framework for multi-person 2D pose estimation in videos has been introduced, eliminating the reliance on heuristic operations that limit accuracy and efficiency. This framework, named Pose-Aware Video transformEr Network (PAVE-Net), effectively associates individuals across frames, addressing the challenges of complex and overlapping trajectories in video data.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

Walk Before You Dance: High-fidelity and Editable Dance Synthesis via Generative Masked Motion Prior

PositiveArtificial Intelligence

Recent advancements in dance generation have led to the development of a novel approach that utilizes a generative masked text-to-motion model to synthesize high-quality 3D dance motions. This method addresses significant challenges such as realism, dance-music synchronization, and motion diversity, while also enabling semantic motion editing capabilities.

Read full article

via arXiv — cs.CV

arXiv — cs.CLa day ago

The Necessity of Imperfection:Reversing Model Collapse via Simulating Cognitive Boundedness

PositiveArtificial Intelligence

A new paper proposes a paradigm shift in the production of synthetic data for training AI models, emphasizing the need to simulate cognitive processes that generate human text rather than merely optimizing for statistical smoothness. This approach aims to address the issue of model collapse caused by training on cognitively impoverished data. The framework introduced includes a Cognitive State Decoder and a Cognitive Text Encoder to enrich generated text with human-like imperfections.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning

NeutralArtificial Intelligence

A recent study investigates the role of reinforcement learning (RL) in enhancing reasoning capabilities, focusing on Complementary Reasoning, which integrates internal knowledge with external context. The research utilizes a synthetic dataset of human biographies to differentiate between Parametric Reasoning and Contextual Reasoning, assessing generalization across various difficulty levels. Findings indicate that while supervised fine-tuning (SFT) performs well in familiar settings, it falters in out-of-distribution scenarios, particularly in zero-shot contexts.

Read full article

via arXiv — cs.CL

arXiv — cs.LGa day ago

Limitations of Using Identical Distributions for Training and Testing When Learning Boolean Functions

NeutralArtificial Intelligence

A recent study published on arXiv explores the complexities of generalization in machine learning, particularly when training and test data distributions differ. The research investigates whether training on a non-identical distribution can enhance generalization, challenging the assumption that identical distributions are always optimal for learning Boolean functions.

Read full article

via arXiv — cs.LG

arXiv — cs.LGa day ago

The Active and Noise-Tolerant Strategic Perceptron

PositiveArtificial Intelligence

The study introduces the Active and Noise-Tolerant Strategic Perceptron, an active learning algorithm designed for classifying strategic agents who may manipulate their features for favorable outcomes. This approach aims to enhance accuracy and efficiency in environments where labeling is costly, such as hiring and admissions.

Read full article

via arXiv — cs.LG