Distributive Fairness in Large Language Models: Evaluating Alignment with Human Values

arXiv — cs.CL•Tuesday, November 25, 2025 at 5:00:00 AM

NegativeArtificial Intelligence

A recent study evaluated the alignment of large language models (LLMs) with human values, particularly focusing on distributive fairness concepts such as equitability and Rawlsian maximin. The findings revealed a significant misalignment between LLM responses and human distributional preferences, indicating that these models struggle to effectively address societal issues related to resource distribution.
This development is critical as it highlights the limitations of current LLMs in decision-making contexts, particularly in social and economic domains where fairness is essential. The inability of LLMs to utilize money as a resource to alleviate inequality raises concerns about their effectiveness as agents in these areas.
The challenges faced by LLMs in aligning with human values reflect broader issues in artificial intelligence, including the need for improved evaluation frameworks and methodologies. As the demand for LLMs grows, addressing their shortcomings in fairness and truthfulness becomes increasingly important, especially in light of ongoing debates about bias and the ethical implications of AI technologies.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LCW

An invisible AI copilot that helps you ace every coding interview.

AI & DataTry the app

Langtail

Build and deploy robust LLM applications quickly with your team.

Business & ProductivityTry the app

Leiga

AI-powered tools to help your teams stay focused and unlock their full potential.

AI & DataTry the app

Continue Readings

Tech Monitor20 hours ago

Look to the human brain for a glimpse of AI’s future

PositiveArtificial Intelligence

Recent discussions highlight the potential of the human brain as a low-power model for the future of artificial intelligence (AI), particularly in the development of large language models (LLMs). This perspective shifts the focus from AI's traditionally high energy demands to a more sustainable approach inspired by biological systems.

Read full article

via Tech Monitor

arXiv — cs.CLa day ago

MindEval: Benchmarking Language Models on Multi-turn Mental Health Support

NeutralArtificial Intelligence

The introduction of MindEval marks a significant advancement in the evaluation of language models for multi-turn mental health support, addressing the limitations of current AI chatbots that often reinforce maladaptive beliefs. Developed in collaboration with Ph.D-level Licensed Clinical Psychologists, this framework aims to enhance the realism of simulated therapeutic conversations through automated evaluation methods.

Read full article

via arXiv — cs.CL

arXiv — cs.CVa day ago

PRADA: Probability-Ratio-Based Attribution and Detection of Autoregressive-Generated Images

PositiveArtificial Intelligence

A new method named PRADA (Probability-Ratio-Based Attribution and Detection of Autoregressive-Generated Images) has been introduced to effectively detect images generated by autoregressive models, addressing a significant gap in the current landscape of image synthesis technologies. This approach analyzes the probability ratios of model-generated images to distinguish their origins reliably.

Read full article

via arXiv — cs.CV

arXiv — cs.CLa day ago

Gender Bias in Emotion Recognition by Large Language Models

NeutralArtificial Intelligence

A recent study has investigated gender bias in emotion recognition by large language models (LLMs), revealing that these models may exhibit biases when interpreting emotional states based on descriptions of individuals and their environments. The research emphasizes the need for effective debiasing strategies, suggesting that training-based interventions are more effective than prompt-based approaches.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

PositiveArtificial Intelligence

The introduction of Sparse Sparse Attention (SSA) aims to enhance the efficiency of large language models (LLMs) by aligning outputs from both sparse and full attention mechanisms. This approach addresses the limitations of traditional sparse attention methods, which often suffer from performance degradation due to inadequate gradient updates during training. SSA proposes a unified framework that seeks to improve attention sparsity while maintaining model effectiveness.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

BengaliFig: A Low-Resource Challenge for Figurative and Culturally Grounded Reasoning in Bengali

PositiveArtificial Intelligence

The introduction of BengaliFig marks a significant advancement in evaluating large language models (LLMs) in low-resource contexts, specifically targeting figurative and culturally grounded reasoning in Bengali. This dataset comprises 435 unique riddles from Bengali oral and literary traditions, annotated across multiple dimensions to enhance understanding of cultural nuances.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel Generation

PositiveArtificial Intelligence

The QiMeng-Kernel framework introduces a Macro-Thinking Micro-Coding paradigm aimed at enhancing the generation of high-performance GPU kernels for AI and scientific computing. This approach addresses the challenges of correctness and efficiency in existing LLM-based methods by decoupling optimization strategies from implementation details, thereby improving both aspects significantly.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

TurnBench-MS: A Benchmark for Evaluating Multi-Turn, Multi-Step Reasoning in Large Language Models

PositiveArtificial Intelligence

A new benchmark called TurnBench has been introduced to evaluate multi-turn, multi-step reasoning in large language models (LLMs). This benchmark is designed through an interactive code-breaking task, requiring models to uncover hidden rules by making sequential guesses and integrating feedback over multiple rounds. The benchmark features two modes: Classic and Nightmare, each testing different levels of reasoning complexity.

Read full article

via arXiv — cs.CL