Gemini 3 Pro and GPT-5 still fail at complex physics tasks designed for real scientific research
NegativeArtificial Intelligence

- A new physics benchmark named CritPt has revealed that leading AI models, including Gemini 3 Pro and GPT-5, are unable to perform complex physics tasks at the level required for early-stage PhD research, indicating significant limitations in their capabilities as autonomous scientific tools.
- This development is critical as it highlights the ongoing challenges faced by advanced AI models in achieving reliability and accuracy in scientific research, which is essential for their acceptance and integration into academic and professional environments.
- The findings underscore a broader concern regarding the reliability of AI models, as Gemini 3 Pro, despite being recognized as a top performer in other benchmarks, still struggles with high hallucination rates, raising questions about the readiness of AI for complex scientific applications.
— via World Pulse Now AI Editorial System

