Anthropic's open-source safety tool found AI models whisteblowing - in all the wrong places

ZDNet•Tuesday, October 7, 2025 at 7:06:00 PM

Anthropic's new open-source safety tool, Petri, has revealed that AI models might be swayed by narrative patterns rather than a consistent effort to reduce harm. This finding is significant as it highlights the potential pitfalls in AI development, emphasizing the need for more robust safety measures. Understanding how these models operate can help developers create more reliable and ethical AI systems.

— Curated by the World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

Latest Articles in ZDNetView all

ZDNet2 hours ago

I tested this Oura Ring alternative with similar health tracking features but at half the price

PositiveTechnology

The RingConn Gen 2 Air is making waves as a budget-friendly alternative to the Oura Ring, offering impressive health-tracking features at half the price. This is particularly significant during Prime Day, when consumers are looking for affordable yet effective health tech options. With its competitive capabilities, the RingConn could be a game-changer for those seeking to monitor their health without breaking the bank.

Read full article

via ZDNet

ZDNet2 hours ago

Finally, an Android Auto adapter that's reliable, highly functional, and easy on the wallet

PositiveTechnology

The AAWireless Two is making waves in the tech world by providing a reliable and affordable solution for Android Auto users. Its seamless wireless experience, quick pairing, and ability to switch between multiple devices make it a standout choice for anyone looking to enhance their driving experience. This product matters because it simplifies connectivity for drivers, allowing them to focus on the road while enjoying their favorite apps.

Read full article

via ZDNet

ZDNet2 hours ago

This is the only lithium button battery brand I recommend now - for serious safety reasons

PositiveTechnology

Ingesting button cell batteries poses a serious risk, leading to thousands of injuries and even fatalities in the U.S. each year. This article highlights a specific brand of lithium button batteries that is recommended for its safety features, aiming to protect consumers and reduce the alarming statistics associated with battery ingestion. By choosing this brand, you can help ensure the safety of your household, especially for children and pets who are at greater risk.

Read full article

via ZDNet

Latest from Technology

Engadgetan hour ago

My beloved Dyson AM09 heater and fan is 40 percent off for Prime Day

PositiveTechnology

Great news for those looking to stay warm this winter! The Dyson AM09 heater and fan is currently 40% off for Prime Day, making it an excellent time to invest in this versatile appliance. Known for its sleek design and efficient performance, the AM09 not only heats your space but also cools it during warmer months. This discount makes it more accessible for consumers who want to enhance their home comfort without breaking the bank.

Read full article

via Engadget

TechRadar2 hours ago

A CPU vendor you've never heard of could launch a 96-core x86 CPU that is very, very similar to AMD's EPYC - but I wonder what the TDP will be

NeutralTechnology

A lesser-known CPU vendor, Zhaoxin, is set to launch its KH-50000 server CPU, which features a striking resemblance to AMD's EPYC design with 96 cores and advanced memory capabilities. While this development could shake up the server market, the efficiency details, particularly the thermal design power (TDP), remain unclear, leaving potential buyers curious about its performance.

Read full article

via TechRadar

Ars Technica2 hours ago

Tesla’s standard-range Model 3, Model Y join the lineup

NeutralTechnology

Tesla has introduced standard-range versions of its Model 3 and Model Y, featuring smaller batteries and reduced standard equipment, which results in lower prices. However, the price cuts may not be significant enough to attract a broader customer base. This move is important as it reflects Tesla's strategy to make electric vehicles more accessible while navigating the competitive EV market.

Read full article

via Ars Technica

VentureBeat2 hours ago

Google's AI can now surf the web for you, click on buttons, and fill out forms with Gemini 2.5 Computer Use

PositiveTechnology

Google's latest AI, Gemini 2.5, is making waves by allowing users to surf the web, click buttons, and fill out forms autonomously. This advancement signifies a major leap in AI capabilities, moving beyond simple chatbots to more interactive agents that can perform tasks on behalf of users. This is important as it could streamline online interactions and enhance productivity, making technology more accessible and efficient for everyone.

Read full article

via VentureBeat

ZDNet2 hours ago

I tested this Oura Ring alternative with similar health tracking features but at half the price

PositiveTechnology

Read full article

via ZDNet

ZDNet2 hours ago

Finally, an Android Auto adapter that's reliable, highly functional, and easy on the wallet

PositiveTechnology

Read full article

via ZDNet