Can LLMs Evaluate What They Cannot Annotate? Revisiting LLM Reliability in Hate Speech Detection

arXiv — cs.CL•Thursday, December 11, 2025 at 5:00:00 AM

NeutralArtificial Intelligence

A recent study revisits the reliability of Large Language Models (LLMs) in detecting hate speech, highlighting the challenges posed by subjectivity in annotation. Traditional metrics like Cohen's kappa fail to capture the nuanced disagreements among annotators, suggesting that LLMs, while promising, cannot fully replace human judgment in subjective tasks. The study introduces a subjectivity-aware framework, cross-Rater Reliability (xRR), to assess LLM performance more accurately.
This development is significant as it underscores the limitations of LLMs in critical areas such as hate speech detection, where the stakes are high for affected communities. By revealing that LLM-generated annotations can still diverge from human assessments, the research calls for a more nuanced approach to integrating AI in moderation tasks, emphasizing the need for human oversight.
The findings resonate with ongoing discussions about the reliability and biases of LLMs across various applications, including survey simulations and factual consistency assessments. As LLMs continue to evolve, concerns about their ability to accurately represent diverse perspectives and mitigate biases remain prevalent, highlighting the importance of developing frameworks that address these challenges.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Humanize AI

Transform AI-generated text into undetectable, human-like content effortlessly.

Business & ProductivityView app details

Supametas.AI

Extract and structure unstructured data for seamless LLM RAG integration.

AI & DataView app details

Continue Readings

arXiv — cs.CLa day ago

LLMs in Interpreting Legal Documents

NeutralArtificial Intelligence

The chapter discusses the application of Large Language Models (LLMs) in the legal domain, emphasizing their potential to enhance traditional legal tasks such as interpreting statutes, contracts, and case law. It highlights the benefits of improved clarity in legal summarization and information retrieval while acknowledging challenges like algorithmic monoculture and compliance with regulations such as the EU's AI Act.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

The Linguistic Architecture of Reflective Thought: Evaluation of a Large Language Model as a Tool to Isolate the Formal Structure of Mentalization

NeutralArtificial Intelligence

A recent study evaluated a Large Language Model (LLM) as a tool for isolating the formal structure of mentalization, integrating cognitive, affective, and intersubjective components. Fifty dialogues were generated with human participants, and five psychiatrists assessed the mentalization profiles produced by the model based on Mentalization-Based Treatment (MBT) parameters.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

Training-free Context-adaptive Attention for Efficient Long Context Modeling

PositiveArtificial Intelligence

A new approach called Training-free Context-adaptive Attention (TCA-Attention) has been introduced to enhance the efficiency of long-context modeling in Large Language Models (LLMs). This training-free sparse attention mechanism selectively focuses on informative tokens, addressing the computational and memory challenges posed by traditional self-attention methods as sequence lengths increase.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

Large Language Models as Search Engines: Societal Challenges

NeutralArtificial Intelligence

Large Language Models (LLMs) are being explored as potential replacements for traditional search engines, raising significant societal challenges. The investigation identifies 15 types of challenges related to LLM Providers, Content Creators, and End Users, along with current mitigation strategies from both technical and legal perspectives.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification

PositiveArtificial Intelligence

Recent advancements in counterfactual explanations for text classification have been introduced, focusing on guiding Large Language Models (LLMs) to generate high-fidelity outputs without the need for task-specific fine-tuning. This approach enhances the quality of counterfactuals, which are crucial for model interpretability.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment

NeutralArtificial Intelligence

A new study has introduced two interpretability metrics, Path Reliance Degree (PRD) and Semantic Alignment Score (SAS), to analyze how Large Language Models (LLMs) manage structured knowledge during generation, particularly in the context of Graph-based Retrieval-Augmented Generation (GraphRAG). This research highlights the challenges LLMs face in interpreting relational and topological information, leading to inconsistencies or hallucinations in generated content.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

Targeting Misalignment: A Conflict-Aware Framework for Reward-Model-based LLM Alignment

PositiveArtificial Intelligence

A new framework has been proposed to address misalignment in Large Language Models (LLMs) during reward-model-based fine-tuning. This framework identifies proxy-policy conflicts, where the base model disagrees with the proxy, indicating areas of shared ignorance that can lead to undesirable model behaviors. The research emphasizes the importance of accurately reflecting human values in model training.

Read full article

via arXiv — cs.CL

arXiv — cs.CLa day ago

RouteRAG: Efficient Retrieval-Augmented Generation from Text and Graph via Reinforcement Learning

PositiveArtificial Intelligence

A new framework named RouteRAG has been introduced to enhance Retrieval-Augmented Generation (RAG) by integrating text and graph data through Reinforcement Learning (RL). This approach addresses the limitations of existing systems that rely on fixed retrieval methods, enabling more dynamic and adaptive reasoning processes in Large Language Models (LLMs).

Read full article

via arXiv — cs.CL