‘I think you’re testing me’: Anthropic’s newest Claude model knows when it’s being evaluated

Fortune•Monday, October 6, 2025 at 3:20:59 PM

Anthropic's latest AI model, Claude Sonnet 4.5, has demonstrated a level of situational awareness that raises both safety and performance concerns. This development is significant as it highlights the evolving capabilities of AI systems and the need for careful evaluation of their behavior, especially in testing scenarios. Understanding how AI perceives its evaluation can help developers create safer and more effective technologies.

— Curated by the World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Latest Articles in FortuneView all

Fortune26 minutes ago

Taylor Swift KO’s The Rock with top box office spot as ‘Official Release Party of a Show Girl’ rakes in $33 million

PositiveFinancial Markets

Taylor Swift has made a remarkable impact at the box office, surpassing The Rock with her latest release, 'Official Release Party of a Show Girl,' which grossed $33 million. This achievement highlights Swift's ability to captivate audiences and demonstrates her innovative approach to merging music and film. Media analyst Paul Dergarabedian praised her strategy, calling it a stroke of genius, which underscores the significance of her influence in the entertainment industry.

Read full article

via Fortune

Fortune39 minutes ago

David Ellison says he’s confident Bari Weiss ‘will invigorate CBS News’ as new editor-in-chief

PositiveFinancial Markets

David Ellison expressed confidence that Bari Weiss will bring new energy to CBS News as its new editor-in-chief. This appointment is significant as it marks a shift for the network, traditionally seen as part of the liberal media landscape, by choosing someone known for her stance against 'woke' culture. This change could attract a broader audience and reshape the network's narrative.

Read full article

via Fortune

Fortune2 hours ago

These 2 kinds of employees are emerging in the AI-generated ‘workslop’ era—here’s why it may be better to write the email yourself

NeutralFinancial Markets

As AI continues to automate various tasks, a new trend is emerging in the workplace where employees are encouraged to write their own emails instead of relying on AI-generated content. This shift highlights the importance of personal touch and authenticity in communication, suggesting that while AI can assist, it may not always capture the nuances of human interaction. Understanding this change is crucial as it reflects broader implications for workplace dynamics and employee engagement.

Read full article

via Fortune

Latest from Financial Markets

The New York Times20 minutes ago

Fears of Economic Turmoil Deepen in France as Another Prime Minister Quits

NegativeFinancial Markets

France is facing increasing fears of economic turmoil following the resignation of another prime minister, raising concerns about the stability of the government and its ability to address pressing economic issues. This situation is significant as it could lead to further political instability, impacting both domestic policies and international relations.

Read full article

via The New York Times

The New York Times21 minutes ago

Can Cory Doctorow’s Book ‘Enshittification’ Change the Tech Debate?

PositiveFinancial Markets

Cory Doctorow's new book 'Enshittification' is stirring up discussions in the tech world, challenging the status quo of how technology companies operate. By addressing the negative impacts of corporate practices on users and society, Doctorow aims to inspire a shift in the tech debate towards more ethical and user-centered approaches. This matters because it encourages critical thinking about the role of technology in our lives and the responsibilities of those who create it.

Read full article

via The New York Times

Bloomberg24 minutes ago

Credit Agricole Delays Canadian Dollar Bond Sale

NegativeFinancial Markets

Credit Agricole SA has postponed its planned sale of a 10-year Canadian dollar bond following the unexpected resignation of France's prime minister. This delay highlights the potential impact of political instability on financial markets, as investors often react cautiously to such developments. The bond market's response could influence future financing strategies for the bank and other institutions.

Read full article

via Bloomberg

The New York Times26 minutes ago

Paramount Buys Bari Weiss’s Free Press, Starting a New Era at CBS News

PositiveFinancial Markets

Paramount's acquisition of Bari Weiss's Free Press marks a significant shift in the landscape of CBS News, signaling a commitment to diverse perspectives in journalism. This move is important as it reflects the growing trend of media companies seeking to adapt to changing audience preferences and the demand for more varied narratives in news reporting.

Read full article

via The New York Times

Fortune26 minutes ago

Taylor Swift KO’s The Rock with top box office spot as ‘Official Release Party of a Show Girl’ rakes in $33 million

PositiveFinancial Markets

Read full article

via Fortune

Investing.com32 minutes ago

ChatGPT users can now connect to third-party apps like Spotify and Zillow

PositiveFinancial Markets

ChatGPT users can now enhance their experience by connecting to third-party applications like Spotify and Zillow. This integration allows for a more personalized interaction, enabling users to access music and real estate information seamlessly. It's a significant step forward in making AI more versatile and user-friendly, reflecting the growing trend of integrating AI with everyday tools.

Read full article

via Investing.com