Incentives or Ontology? A Structural Rebuttal to OpenAI's Hallucination Thesis

arXiv — cs.LG•Thursday, December 18, 2025 at 5:00:00 AM

NeutralArtificial Intelligence

OpenAI's recent thesis posits that hallucinations in large language models (LLMs) stem from misaligned evaluation incentives, suggesting that improving benchmarks could mitigate these issues. However, a new paper challenges this view, arguing that hallucination is an inherent aspect of the transformer model's architecture, not merely an optimization failure. The authors assert that transformers create a pseudo-ontology based on linguistic co-occurrence, leading to fictional interpolations in sparse data regions.
This development is significant for OpenAI as it questions the foundational assumptions of their approach to AI model evaluation and training. If hallucinations are indeed structural rather than contingent, it may necessitate a fundamental rethinking of how LLMs are designed and assessed, potentially impacting their reliability and application in various fields.
The discourse surrounding AI transparency and accountability is intensifying, particularly as OpenAI introduces methods like 'confessions' to enhance model honesty. This reflects a broader industry trend towards addressing ethical concerns in AI, where the reliability of outputs and the models' ability to self-report errors are becoming critical factors in public trust and regulatory scrutiny.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataView app details

Keywords AI

Monitor and optimize your AI models with comprehensive observability tools.

Business & ProductivityView app details

Meteoria

Ensure your brand is accurately referenced and cited by AI models.

AI & DataView app details

Octofy

Access all top AI models with one subscription, automatically optimized for your needs.

AI & DataView app details

OpportunAI

Discover warm leads from Reddit daily to boost your marketing outreach.

Marketing & CommerceView app details

Humai

Transfer your consciousness to a bionic body with AI-driven organs and sensory technology.

Lifestyle & HealthView app details

Continue Readings

DEV Community18 hours ago

**Revolutionizing Conversational AI: ChatGPT's App Store Launch**

PositiveArtificial Intelligence

OpenAI has launched an app store for its ChatGPT chatbot, enabling developers to create and distribute custom applications within the platform. This initiative aims to enhance user experience by providing access to a variety of new features, games, and tools directly through ChatGPT's interface.

Read full article

via DEV Community

Techmeme20 hours ago

Sources: OpenAI's new fundraising round could value it at as much as $830B; it aims to raise up to $100B and complete the round by the end of Q1 at the earliest (Wall Street Journal)

NeutralArtificial Intelligence

OpenAI is reportedly seeking to raise up to $100 billion in a new fundraising round, which could value the company at approximately $830 billion. The completion of this funding round is anticipated by the end of the first quarter of 2026 at the earliest. Concerns about a potential AI bubble have been raised, affecting the valuations of tech companies, including OpenAI.

Read full article

via Techmeme

Bloomberg Technologya day ago

OpenAI Is ‘Definitely Not’ Too Big to Fail, Economist Says

NeutralArtificial Intelligence

A leading economist has stated that OpenAI is 'definitely not' too big to fail, suggesting that the potential collapse of the AI bubble would not have catastrophic consequences. This perspective comes amid growing concerns about the sustainability of investments in artificial intelligence, particularly following OpenAI's significant market fluctuations.

Read full article

via Bloomberg Technology

TechCruncha day ago

ChatGPT launches an app store, lets developers know it’s open for business

PositiveArtificial Intelligence

OpenAI has launched an app store for its ChatGPT chatbot, allowing developers to create and distribute custom applications within the platform. This initiative aims to enhance user experience by providing access to a variety of new features, tools, and games, marking a significant expansion of ChatGPT's functionality.

Read full article

via TechCrunch

Bloomberg Technologya day ago

OpenAI Has Declared ‘Code Red’ Multiple Times, Executive Says

NeutralArtificial Intelligence

OpenAI CEO Sam Altman declared a 'code red' for the company's ChatGPT platform, emphasizing the urgent need for improvements amid rising competition from Google's Gemini 3. This declaration marks a significant moment for OpenAI, highlighting ongoing challenges in maintaining its leadership in the AI sector.

Read full article

via Bloomberg Technology

TechCruncha day ago

Why British politicians are flocking to American tech giants

PositiveArtificial Intelligence

Former British Chancellor George Osborne has joined OpenAI as managing director and head of OpenAI for Countries, a role focused on building partnerships with governments globally for AI initiatives. He will also lead Coinbase's internal advisory council. This move underscores a trend of political figures transitioning into significant roles within major tech companies.

Read full article

via TechCrunch

THE DECODERa day ago

GPT-5.2 tops OpenAI's new FrontierScience test but struggles with real research problems

NeutralArtificial Intelligence

OpenAI has introduced GPT-5.2, which has excelled in its new FrontierScience benchmark, outperforming previous models but revealing limitations in tackling real-world research challenges. This benchmark aims to assess AI capabilities at both Olympic and research levels.

Read full article

via THE DECODER

Bloomberg Technologya day ago

Oracle and OpenAI Win Michigan Approval to Power New Data Center

NeutralArtificial Intelligence

Michigan regulators have unanimously approved DTE Energy's request to power a new data center being developed by Oracle and OpenAI, despite public opposition during the hearing. This decision marks a significant step in the collaboration between these tech giants as they expand their data infrastructure in the state.

Read full article

via Bloomberg Technology

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about