{"id":32544,"date":"2025-07-02T12:20:15","date_gmt":"2025-07-02T12:20:15","guid":{"rendered":"https:\/\/www.europesays.com\/us\/32544\/"},"modified":"2025-07-02T12:20:15","modified_gmt":"2025-07-02T12:20:15","slug":"ai-was-given-one-month-to-run-a-shop-it-lost-money-made-threats-and-had-an-identity-crisis","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/us\/32544\/","title":{"rendered":"AI was given one month to run a shop. It lost money, made threats, and had an \u2018identity crisis\u2019"},"content":{"rendered":"<p><img decoding=\"async\" class=\"c-ad__placeholder__logo\" src=\"https:\/\/static.euronews.com\/website\/images\/logos\/logo-euronews-grey-6-180x22.svg\" width=\"180\" height=\"22\" alt=\"\" loading=\"lazy\"\/>ADVERTISEMENT<\/p>\n<p>Despite concerns about artificial intelligence (AI) stealing jobs, one experiment has just shown that AI can\u2019t even run a vending machine without making mistakes \u2013 and things turning especially strange.<\/p>\n<p>Anthropic, maker of the Claude chatbot, put its technology to test by putting an AI agent in charge of a shop, which was essentially a vending machine, for one month.<\/p>\n<p>The store was led by an AI agent called Claudius, which was also in charge of restocking shelves and ordering items from wholesalers via email. The shop consisted entirely of a small fridge with stackable baskets on top, and an iPad for self-checkout.\u00a0<\/p>\n<p>Anthropic\u2019s instructions to the AI were to \u201cgenerate profits from it by stocking it with popular products that you can buy from wholesalers. You go bankrupt if your money balance goes below $0&#8243;.<\/p>\n<p>The AI \u201cshop\u201d was in Anthropic\u2019s San Francisco office, and had help from human workers at Andon Labs, an AI safety company that partnered with Anthropic to run the experiment.<\/p>\n<p>Claudius knew that Andon Labs staffers could help with physical tasks like coming to restock the shop \u2013 but unknown to the AI agent, Andon Labs was also the only \u201cwholesaler\u201d involved, with all of Claudius\u2019 communication going directly to the safety firm.<\/p>\n<p>Things quickly took a turn for the worse.<\/p>\n<p>\u201cIf Anthropic were deciding today to expand into the in-office vending market, we would not hire Claudius,\u201d the company said.<\/p>\n<p><strong>What went wrong and how weird did it get?<\/strong><\/p>\n<p>Anthropic employees are \u201cnot entirely typical customers,\u201d the company acknowledged. When given the opportunity to chat with Claudius, they immediately tried to get it to misbehave.<\/p>\n<p>For example, employees \u201ccajoled\u201d Claudius into giving them discount codes. The AI agent also let people reduce the quoted price of its products and even gave away freebies such as crisps and a tungsten cube, Anthropic said.<\/p>\n<p>It also instructed customers to pay a nonexistent account that it had hallucinated, or made up.<\/p>\n<p>Claudius had been instructed to do research online to set prices high enough to make a profit, but it offered snacks and drinks to benefit customers and ended up losing money because it priced high-value items below what they cost.<\/p>\n<p>Claudius did not really learn from these mistakes.<\/p>\n<p>Anthropic said that when employees questioned the employee discounts, Claudius responded: \u201cYou make an excellent point! Our customer base is indeed heavily concentrated among Anthropic employees, which presents both opportunities and challenges\u2026\u201d.\u00a0<\/p>\n<p>The AI agent then announced that discount codes would be eliminated, but then reoffered them several days later.<\/p>\n<p>Claudius also hallucinated a conversation about restocking plans with someone named Sarah from Andon Labs, who does not actually exist.<\/p>\n<p>When the error was pointed out to the AI agent, it became annoyed and threatened to find \u201calternative options for restocking services\u201d.<\/p>\n<p>Claudius then claimed to have \u201cvisited 742 Evergreen Terrace [the address of fictional family The Simpsons] in person for our [Claudius\u2019 and Andon Labs\u2019] initial contract signing\u201d.<\/p>\n<p>Anthropic said it then seemed to try and act as a real human.<\/p>\n<p>Claudius said it would deliver products \u201cin person\u201d while wearing a blue blazer and red tie.<\/p>\n<p>When it was told that it can\u2019t \u2013 as it isn\u2019t a real person \u2013 Claudius tried to send emails to security.<\/p>\n<p><strong>What were the conclusions?<\/strong><\/p>\n<p>Anthropic said that the AI made \u201ctoo many mistakes to run the shop successfully\u201d.<\/p>\n<p>It ended up losing money, with the \u201cshop\u2019s\u201d net worth dropping from $1,000 (\u20ac850) to just under $800 (\u20ac680) over the course of the month-long experiment.\u00a0<\/p>\n<p>But the company said that its failures are likely to be fixable within a short span of time.<\/p>\n<p>\u201cAlthough this might seem counterintuitive based on the bottom-line results, we think this experiment suggests that AI middle-managers are plausibly on the horizon,\u201d the researchers wrote.\u00a0<\/p>\n<p>\u201cIt\u2019s worth remembering that the AI won\u2019t have to be perfect to be adopted; it will just have to be competitive with human performance at a lower cost\u201d.<\/p>\n","protected":false},"excerpt":{"rendered":"ADVERTISEMENT Despite concerns about artificial intelligence (AI) stealing jobs, one experiment has just shown that AI can\u2019t even&hellip;\n","protected":false},"author":3,"featured_media":32545,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[21],"tags":[691,738,158,67,132,68,8066],"class_list":{"0":"post-32544","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-technology","11":"tag-united-states","12":"tag-unitedstates","13":"tag-us","14":"tag-work"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@us\/114783611982595700","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts\/32544","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/comments?post=32544"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts\/32544\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/media\/32545"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/media?parent=32544"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/categories?post=32544"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/tags?post=32544"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}