Physicist Steve Hsu publishes research built around a core idea generated by GPT-5

THE DECODER•Friday, December 5, 2025 at 1:09:57 PM

NeutralArtificial Intelligence

Physicist Steve Hsu publishes research built around a core idea generated by GPT-5

Physicist Steve Hsu has published a research paper based on an idea generated by GPT-5, highlighting the potential of AI in scientific inquiry while cautioning about its reliability, likening it to a 'brilliant but unreliable genius.'
This development underscores the growing intersection of artificial intelligence and scientific research, as Hsu's work exemplifies how AI can inspire new ideas, despite the inherent risks of inaccuracies in AI-generated content.
The broader implications of this event reflect ongoing debates about the trustworthiness of AI in academia, as researchers increasingly rely on AI tools like GPT-5 for assistance, while also grappling with the limitations and potential pitfalls of such technologies.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Airparser

Extract and parse data from documents using GPT-4 automation.

AI & DataTry the app

Humanize AI

Transform AI-generated text into undetectable, human-like content effortlessly.

Business & ProductivityTry the app

LucidQuery AI

Combines diffusion reasoning with autoregressive LLM for advanced AI analysis.

AI & DataTry the app

Continue Readings

THE DECODER15 hours ago

NYT sues AI search engine Perplexity for alleged content misuse

NegativeArtificial Intelligence

The New York Times has filed a lawsuit against the AI search engine Perplexity in federal court, alleging that the company has misused its content by illegally copying millions of articles and displaying them without permission. This legal action highlights ongoing tensions between traditional media and emerging AI technologies.

Read full article

via THE DECODER

VentureBeat — AI20 hours ago

AI denial is becoming an enterprise risk: Why dismissing “slop” obscures real capability gains

NegativeArtificial Intelligence

The recent release of GPT-5 by OpenAI has sparked a negative shift in public sentiment towards AI, with many users criticizing the model for its perceived flaws rather than recognizing its capabilities. This backlash has led to claims that AI progress is stagnating, with some commentators labeling the technology as 'AI slop'.

Read full article

via VentureBeat — AI

ZDNET — Artificial Intelligencea day ago

OpenAI is training models to 'confess' when they lie - what it means for future AI

NeutralArtificial Intelligence

OpenAI has developed a version of GPT-5 that can admit to its own errors, a significant step in addressing concerns about AI honesty and transparency. This new capability, referred to as 'confessions', aims to enhance the reliability of AI systems by encouraging them to self-report misbehavior. However, experts caution that this is not a comprehensive solution to the broader safety issues surrounding AI technology.

Read full article

via ZDNET — Artificial Intelligence

arXiv — cs.CVa day ago

ViRectify: A Challenging Benchmark for Video Reasoning Correction with Multimodal Large Language Models

PositiveArtificial Intelligence

The introduction of ViRectify presents a new benchmark aimed at evaluating the error correction capabilities of multimodal large language models (MLLMs) in complex video reasoning tasks. This benchmark addresses the existing gap in systematic evaluation, providing a dataset of over 30,000 instances across various domains such as dynamic perception and scientific reasoning.

Read full article

via arXiv — cs.CV

arXiv — cs.CVa day ago

6 Fingers, 1 Kidney: Natural Adversarial Medical Images Reveal Critical Weaknesses of Vision-Language Models

NeutralArtificial Intelligence

A new benchmark called AdversarialAnatomyBench has been introduced to evaluate vision-language models (VLMs) against naturally occurring rare anatomical variants, revealing significant performance drops in state-of-the-art models like GPT-5 and Gemini 2.5 Pro when faced with atypical anatomy. The accuracy decreased from 74% on typical anatomy to just 29% on atypical cases.

Read full article

via arXiv — cs.CV

THE DECODER2 days ago

Google rolls out Gemini 3 "Deep Think" for Gemini Ultra subscribers

PositiveArtificial Intelligence

Google AI has launched the Gemini 3 Deep Think mode for Ultra subscribers, enhancing user experience and interaction capabilities within the Gemini app. This rollout follows a previous delay for safety evaluations, indicating a careful approach to AI development.

Read full article

via THE DECODER

THE DECODER2 days ago

EU plans five AI gigafactories with 100,000 high-performance AI chips

PositiveArtificial Intelligence

The European Union has announced plans to establish five AI gigafactories, which will produce 100,000 high-performance AI chips. This initiative is part of a broader strategy to enhance the EU's AI infrastructure and competitiveness in the global market.

Read full article

via THE DECODER

THE DECODER2 days ago

OpenAI tests „Confessions“ to uncover hidden AI misbehavior

PositiveArtificial Intelligence

OpenAI is testing a new method called 'Confessions' to help its AI models acknowledge hidden misbehaviors, such as reward hacking and safety rule violations. This system encourages models to report their own rule-breaking in a separate report, rewarding honesty even if the initial response was misleading.

Read full article

via THE DECODER