Gemini 3 Pro and GPT-5 still fail at complex physics tasks designed for real scientific research
NegativeArtificial Intelligence

- A new benchmark called CritPt has revealed that leading AI models, including Gemini 3 Pro and GPT-5, are unable to perform complex physics tasks at the level expected of early-stage PhD research, indicating significant limitations in their capabilities as autonomous scientists.
- This development is critical as it highlights the ongoing challenges faced by AI models in achieving true scientific reasoning and autonomy, which are essential for advancing research and innovation in various scientific fields.
- The findings underscore a broader concern regarding the reliability of AI systems, as Gemini 3 Pro, despite being recognized for its performance in other areas, still struggles with factual accuracy and hallucinations, raising questions about the readiness of AI for high-stakes scientific applications.
— via World Pulse Now AI Editorial System





