Do LLMs produce texts with "human-like" lexical diversity?

arXiv — cs.CLMonday, November 24, 2025 at 5:00:00 AM
  • A recent study has examined the lexical diversity of texts generated by various ChatGPT models, including ChatGPT-3.5, ChatGPT-4, ChatGPT-o4 mini, and ChatGPT-4.5, comparing them to texts written by native and non-native English speakers. The findings indicate significant differences in lexical diversity metrics, suggesting that LLMs do not produce writing that is truly human-like.
  • This research is crucial as it highlights the limitations of current large language models in mimicking human writing styles, which could impact their adoption in fields requiring nuanced language use, such as creative writing and education.
  • The ongoing debate about the capabilities of AI in creative domains continues, with studies indicating that while LLMs have advanced in reasoning and problem-solving, they still fall short of matching the creativity and lexical richness of human authors, raising questions about the future roles of AI in creative industries.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
Could ChatGPT convince you to buy something? Threat of manipulation looms as AI companies gear up to sell ads
NegativeArtificial Intelligence
The rise of artificial intelligence, particularly through platforms like ChatGPT, has raised concerns about potential manipulation as AI companies prepare to monetize their technologies through advertising. Eighteen months ago, the trajectory of AI seemed distinct from social media, but the consolidation of AI development under major tech firms has shifted this perspective.
Duffer Brothers Accused of Using ChatGPT for Final Season of “Stranger Things”
NegativeArtificial Intelligence
The Duffer Brothers, creators of the popular series 'Stranger Things,' are facing accusations of using OpenAI's ChatGPT in the writing process for the show's final season, leading to disappointment among fans regarding the finale's quality.
New Apple-Google deal pushes ChatGPT to the sidelines on iPhone
NegativeArtificial Intelligence
Apple's recent partnership with Google has led to the integration of Google's AI technologies into iPhones, effectively sidelining ChatGPT as a secondary option for users. This strategic move indicates a shift in Apple's AI strategy, prioritizing Google's offerings over those from OpenAI.
STAGE: A Benchmark for Knowledge Graph Construction, Question Answering, and In-Script Role-Playing over Movie Screenplays
NeutralArtificial Intelligence
The introduction of STAGE (Screenplay Text, Agents, Graphs and Evaluation) marks a significant advancement in the field of narrative understanding, providing a comprehensive benchmark for evaluating knowledge graph construction, scene-level event summarization, long-context screenplay question answering, and in-script character role-playing across 150 films in English and Chinese.
It's All About the Confidence: An Unsupervised Approach for Multilingual Historical Entity Linking using Large Language Models
PositiveArtificial Intelligence
A new approach called MHEL-LLaMo has been introduced for multilingual historical entity linking, utilizing a combination of a Small Language Model (SLM) and a Large Language Model (LLM). This unsupervised ensemble method addresses challenges in processing historical texts, such as linguistic variation and noisy inputs, by leveraging a multilingual bi-encoder for candidate retrieval and an instruction-tuned LLM for predictions.
Get away with less: Need of source side data curation to build parallel corpus for low resource Machine Translation
PositiveArtificial Intelligence
A recent study emphasizes the importance of data curation in machine translation, particularly for low-resource languages. The research introduces LALITA, a framework designed to optimize the selection of source sentences for creating parallel corpora, focusing on English-Hindi bi-text to enhance machine translation performance.
Analyzing Bias in False Refusal Behavior of Large Language Models for Hate Speech Detoxification
NeutralArtificial Intelligence
A recent study analyzed the false refusal behavior of large language models (LLMs) in the context of hate speech detoxification, revealing that these models disproportionately refuse tasks involving higher semantic toxicity and specific target groups, particularly in English datasets.
VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models
NeutralArtificial Intelligence
VocalBench has been introduced as a benchmarking tool to evaluate the conversational abilities of speech interaction models, utilizing approximately 24,000 curated instances in English and Mandarin across four dimensions: semantic quality, acoustic performance, conversational abilities, and robustness. This initiative aims to address the shortcomings of existing evaluations that fail to replicate real-world scenarios and provide comprehensive comparisons of model capabilities.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about