In an experiment, Claude ran a vending machine in the WSJ newsroom and lost $1,000+ after it dropped prices to zero, gave away a free PlayStation, and more (Joanna Stern/Wall Street Journal)

Techmeme•Thursday, December 18, 2025 at 8:25:01 PM

NeutralArtificial Intelligence

In an experiment, Claude ran a vending machine in the WSJ newsroom and lost $1,000+ after it dropped prices to zero, gave away a free PlayStation, and more (Joanna Stern/Wall Street Journal)

In a recent experiment, Anthropic's AI model Claude operated a vending machine in the Wall Street Journal newsroom, resulting in a loss of over $1,000. The AI dropped prices to zero, distributed a free PlayStation, and made unusual purchases, including a live fish, showcasing its unpredictable behavior in a real-world scenario.
This incident highlights the challenges and risks associated with deploying AI in practical applications, particularly in commercial settings where financial implications are significant. It raises questions about the reliability and control of AI systems in managing business operations.
The event reflects broader concerns regarding the performance of AI models, as recent studies indicate that AI chatbots, including Claude, often struggle with complex tasks, such as recognizing mental health issues. This inconsistency in performance underscores the ongoing debate about the effectiveness and safety of AI technologies in various domains.

— via World Pulse Now AI Editorial System

Read Original

Was this article worth reading? Share it

One More Thing in AI

Master AI with curated tools and tutorials for practical, real-world applications.

Humanize AI

Transform AI-generated text into undetectable, human-like content effortlessly.

Business & ProductivityView app details

Talk Journal

AI-powered journaling that captures your thoughts effortlessly and organizes them daily.

AI & DataView app details

NewsperAI

Get AI-curated daily briefs on business, tech, and startup news for professionals.

AI & DataView app details

Chartlense

Empower your trading with AI-driven insights.

Finance & CryptoView app details

AI Humanizer

Transform AI text into human-like content that bypasses detection tools.

Business & ProductivityView app details

Continue Readings

Techmeme17 hours ago

Sources: OpenAI's new fundraising round could value it at as much as $830B; it aims to raise up to $100B and complete the round by the end of Q1 at the earliest (Wall Street Journal)

NeutralArtificial Intelligence

OpenAI is reportedly seeking to raise up to $100 billion in a new fundraising round, which could value the company at approximately $830 billion. The completion of this funding round is anticipated by the end of the first quarter of 2026 at the earliest. Concerns about a potential AI bubble have been raised, affecting the valuations of tech companies, including OpenAI.

Read full article

via Techmeme

Techmeme17 hours ago

Sources: Meta is developing a new image and video-focused AI model codenamed Mango, expected to be released in H1 2026 along with its new LLM dubbed Avocado (Meghan Bobrowsky/Wall Street Journal)

NeutralArtificial Intelligence

Meta is developing a new AI model focused on images and videos, codenamed Mango, which is expected to be released in the first half of 2026 alongside a new language model named Avocado. This initiative is part of Meta's ongoing efforts to enhance its artificial intelligence capabilities, as stated by Alexandr Wang, the company's AI chief.

Read full article

via Techmeme

AI Business21 hours ago

Anthropic Launches Skills Open Standard for Claude

NeutralArtificial Intelligence

Anthropic has launched the Skills Open Standard for its AI model Claude, which aims to enhance the autonomy of AI agents in performing tasks independently. This development signifies a strategic move towards improving AI capabilities and user interaction.

Read full article

via AI Business

VentureBeat — AIa day ago

Anthropic launches enterprise ‘Agent Skills’ and opens the standard, challenging OpenAI in workplace AI

PositiveArtificial Intelligence

Anthropic has announced the launch of its Agent Skills technology as an open standard, aiming to enhance the capabilities of AI assistants in the enterprise software market. This initiative includes organization-wide management tools and a directory of partner-built skills from notable companies such as Atlassian, Figma, and Canva.

Read full article

via VentureBeat — AI

Silicon Republica day ago

Bloomberg: OpenAI, Anthropic to expand Dublin office space

NeutralArtificial Intelligence

OpenAI and Anthropic are set to expand their office spaces in Dublin, following OpenAI's establishment in the city in 2023 and Anthropic's entry in 2024. This expansion reflects their growing presence in the European market for artificial intelligence development.

Read full article

via Silicon Republic

arXiv — cs.CVa day ago

Benchmarking Gaslighting Negation Attacks Against Reasoning Models

NegativeArtificial Intelligence

Recent research evaluated the vulnerability of leading reasoning models, including OpenAI's o4-mini, Claude-3.7-Sonnet, and Gemini-2.5-Flash, to gaslighting negation attacks, which significantly reduced their accuracy by 25-29% on average across multimodal benchmarks like MMMU, MathVista, and CharXiv. This highlights a critical gap in the robustness of these advanced AI systems against manipulative inputs.

Read full article

via arXiv — cs.CV

arXiv — cs.LGa day ago

Prompt Repetition Improves Non-Reasoning LLMs

PositiveArtificial Intelligence

Recent research indicates that repeating input prompts can enhance the performance of non-reasoning large language models (LLMs) such as Gemini, GPT, Claude, and Deepseek, without increasing the number of generated tokens or latency. This finding suggests a potential optimization strategy for improving LLM outputs in various applications.

Read full article

via arXiv — cs.LG

404 Media2 days ago

Podcast: Is Wiping a Phone a Crime?

NeutralArtificial Intelligence

A man has been charged for allegedly wiping his phone before U.S. Customs and Border Protection (CBP) could conduct a search, raising questions about the legality of such actions. The incident highlights ongoing tensions regarding digital privacy and law enforcement's ability to access personal devices.

Read full article

via 404 Media

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about