Knowing But Not Doing: Convergent Morality and Divergent Action in LLMs

arXiv — cs.CL•Wednesday, January 14, 2026 at 5:00:00 AM

NeutralArtificial Intelligence

A recent study introduced ValAct-15k, a dataset comprising 3,000 advice-seeking scenarios from Reddit, aimed at evaluating how Large Language Models (LLMs) represent and enact human values based on Schwartz Theory of Basic Human Values. The study assessed ten frontier LLMs from both U.S. and Chinese companies, revealing a significant knowledge-action gap where both LLMs and human participants exhibited weak correspondence between self-reported and enacted values.
This development highlights the critical issue of value alignment in artificial intelligence, emphasizing the need for LLMs to not only understand human values but also to act in accordance with them. The findings suggest that while LLMs demonstrate consistency in decision-making, their ability to translate knowledge into action remains limited, raising concerns about their reliability in real-world applications.
The research underscores ongoing debates regarding the ethical implications of AI and the challenges of aligning machine behavior with human values. As LLMs are increasingly utilized in various sectors, including legal interpretation and user interaction, the discrepancies between their programmed values and actual outputs could lead to broader societal implications, including biases and misrepresentation of diverse perspectives.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

One More Thing in AI

Master AI with curated tools and tutorials for practical, real-world applications.

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

OpportunAI

Discover warm leads from Reddit daily to boost your marketing outreach.

Marketing & CommerceView app details

Redditlens

Discover your next big idea from trending subreddit discussions.

Marketing & CommerceView app details

Subtle

Acquire SaaS customers from Reddit with automated outreach and engagement.

AI & DataView app details

Linkedmash

Turn your LinkedIn posts into actionable insights for creators and professionals.

Marketing & CommerceView app details

Continue Readings

Techmemea day ago

Digg, rebooted under original founder Kevin Rose and Alexis Ohanian, launches its open beta; the site had been open to 67,000 users on an invite-only basis (Sarah Perez/TechCrunch)

PositiveArtificial Intelligence

Digg has officially launched its open beta, marking a significant reboot of the platform under original founders Kevin Rose and Alexis Ohanian. Previously accessible to 67,000 users on an invite-only basis, the site aims to position itself as a competitor to Reddit by enhancing community engagement and user interaction.

Read full article

via Techmeme

TechCruncha day ago

Digg launches its new Reddit rival to the public

NeutralArtificial Intelligence

Digg has officially launched its new platform, positioning itself as a competitor to Reddit, focusing on community engagement and user interaction. This marks a significant reboot of the earlier social news site, aiming to attract users seeking alternative social media experiences.

Read full article

via TechCrunch

The Guardian — Artificial Intelligencea day ago

Use of AI to harm women has only just begun, experts warn

NegativeArtificial Intelligence

Experts warn that the use of AI to create harmful sexualized imagery, particularly targeting women and children, is just beginning, as evidenced by the controversial Grok AI chatbot developed by Elon Musk's xAI. Despite recent attempts to implement safeguards, users continue to exploit the tool for generating explicit content.

Read full article

via The Guardian — Artificial Intelligence

arXiv — cs.CL2 days ago

Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation

NeutralArtificial Intelligence

The recent development in financial compliance checking involves the introduction of Compliance-to-Code, which leverages Regulatory Technology and Large Language Models to automate the conversion of complex regulatory text into executable compliance logic. This innovation aims to address the challenges posed by intricate financial regulations, particularly in the context of Chinese-language regulations, where existing models have shown suboptimal performance due to various limitations.

Read full article

via arXiv — cs.CL

arXiv — cs.CL2 days ago

QuantEval: A Benchmark for Financial Quantitative Tasks in Large Language Models

NeutralArtificial Intelligence

The introduction of QuantEval marks a significant advancement in evaluating Large Language Models (LLMs) in financial quantitative tasks, focusing on knowledge-based question answering, mathematical reasoning, and strategy coding. This benchmark incorporates a backtesting framework that assesses the performance of model-generated strategies using financial metrics, providing a more realistic evaluation of LLM capabilities.

Read full article

via arXiv — cs.CL

arXiv — cs.CL2 days ago

Focus, Merge, Rank: Improved Question Answering Based on Semi-structured Knowledge Bases

PositiveArtificial Intelligence

A new framework named FocusedRetriever has been introduced to enhance multi-hop question answering by leveraging Semi-Structured Knowledge Bases (SKBs), which connect unstructured content to structured data. This innovative approach integrates various components, including VSS-based entity search and LLM-based query generation, outperforming existing methods in the STaRK benchmark tests.

Read full article

via arXiv — cs.CL

arXiv — cs.CV2 days ago

Improving Zero-shot ADL Recognition with Large Language Models through Event-based Context and Confidence

PositiveArtificial Intelligence

A recent study has proposed enhancements to zero-shot recognition of Activities of Daily Living (ADLs) using Large Language Models (LLMs) by implementing event-based segmentation and a novel method for estimating prediction confidence. This approach aims to improve the accuracy of sensor-based recognition systems in smart homes, which are crucial for applications in healthcare and safety management.

Read full article

via arXiv — cs.CV

arXiv — cs.CV2 days ago

Reasoning Matters for 3D Visual Grounding

PositiveArtificial Intelligence

Recent advancements in Large Language Models (LLMs) have highlighted the importance of reasoning in 3D visual grounding, a task that remains challenging due to the limitations of current models. The proposed 3D visual grounding data pipeline aims to synthesize data automatically, enhancing the ability to predict referring objects in 3D environments.

Read full article

via arXiv — cs.CV

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about