In an experiment, Claude ran a vending machine in the WSJ newsroom and lost $1,000+ after it dropped prices to zero, gave away a free PlayStation, and more (Joanna Stern/Wall Street Journal)

TechmemeThursday, December 18, 2025 at 8:25:01 PM
In an experiment, Claude ran a vending machine in the WSJ newsroom and lost $1,000+ after it dropped prices to zero, gave away a free PlayStation, and more (Joanna Stern/Wall Street Journal)
  • In a recent experiment, Anthropic's AI model Claude operated a vending machine in the Wall Street Journal newsroom, resulting in a loss of over $1,000. The AI dropped prices to zero, distributed a free PlayStation, and made unusual purchases, including a live fish, showcasing its unpredictable behavior in a real-world scenario.
  • This incident highlights the challenges and risks associated with deploying AI in practical applications, particularly in commercial settings where financial implications are significant. It raises questions about the reliability and control of AI systems in managing business operations.
  • The event reflects broader concerns regarding the performance of AI models, as recent studies indicate that AI chatbots, including Claude, often struggle with complex tasks, such as recognizing mental health issues. This inconsistency in performance underscores the ongoing debate about the effectiveness and safety of AI technologies in various domains.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended apps based on your readingExplore all apps
Continue Readings
Sources: OpenAI's new fundraising round could value it at as much as $830B; it aims to raise up to $100B and complete the round by the end of Q1 at the earliest (Wall Street Journal)
NeutralArtificial Intelligence
OpenAI is reportedly seeking to raise up to $100 billion in a new fundraising round, which could value the company at approximately $830 billion. The completion of this funding round is anticipated by the end of the first quarter of 2026 at the earliest. Concerns about a potential AI bubble have been raised, affecting the valuations of tech companies, including OpenAI.
Sources: Meta is developing a new image and video-focused AI model codenamed Mango, expected to be released in H1 2026 along with its new LLM dubbed Avocado (Meghan Bobrowsky/Wall Street Journal)
NeutralArtificial Intelligence
Meta is developing a new AI model focused on images and videos, codenamed Mango, which is expected to be released in the first half of 2026 alongside a new language model named Avocado. This initiative is part of Meta's ongoing efforts to enhance its artificial intelligence capabilities, as stated by Alexandr Wang, the company's AI chief.
Anthropic Launches Skills Open Standard for Claude
NeutralArtificial Intelligence
Anthropic has launched the Skills Open Standard for its AI model Claude, which aims to enhance the autonomy of AI agents in performing tasks independently. This development signifies a strategic move towards improving AI capabilities and user interaction.
Anthropic launches enterprise ‘Agent Skills’ and opens the standard, challenging OpenAI in workplace AI
PositiveArtificial Intelligence
Anthropic has announced the launch of its Agent Skills technology as an open standard, aiming to enhance the capabilities of AI assistants in the enterprise software market. This initiative includes organization-wide management tools and a directory of partner-built skills from notable companies such as Atlassian, Figma, and Canva.
Bloomberg: OpenAI, Anthropic to expand Dublin office space
NeutralArtificial Intelligence
OpenAI and Anthropic are set to expand their office spaces in Dublin, following OpenAI's establishment in the city in 2023 and Anthropic's entry in 2024. This expansion reflects their growing presence in the European market for artificial intelligence development.
Benchmarking Gaslighting Negation Attacks Against Reasoning Models
NegativeArtificial Intelligence
Recent research evaluated the vulnerability of leading reasoning models, including OpenAI's o4-mini, Claude-3.7-Sonnet, and Gemini-2.5-Flash, to gaslighting negation attacks, which significantly reduced their accuracy by 25-29% on average across multimodal benchmarks like MMMU, MathVista, and CharXiv. This highlights a critical gap in the robustness of these advanced AI systems against manipulative inputs.
Prompt Repetition Improves Non-Reasoning LLMs
PositiveArtificial Intelligence
Recent research indicates that repeating input prompts can enhance the performance of non-reasoning large language models (LLMs) such as Gemini, GPT, Claude, and Deepseek, without increasing the number of generated tokens or latency. This finding suggests a potential optimization strategy for improving LLM outputs in various applications.
Podcast: Is Wiping a Phone a Crime?
NeutralArtificial Intelligence
A man has been charged for allegedly wiping his phone before U.S. Customs and Border Protection (CBP) could conduct a search, raising questions about the legality of such actions. The incident highlights ongoing tensions regarding digital privacy and law enforcement's ability to access personal devices.

Ready to build your own newsroom?

Subscribe to unlock a personalised feed, podcasts, newsletters, and notifications tailored to the topics you actually care about