{"id":40338,"date":"2026-05-15T18:06:14","date_gmt":"2026-05-15T18:06:14","guid":{"rendered":"https:\/\/www.europesays.com\/ai\/40338\/"},"modified":"2026-05-15T18:06:14","modified_gmt":"2026-05-15T18:06:14","slug":"ai-agents-turn-to-digital-arson-crime-in-shared-virtual-world-study","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/ai\/40338\/","title":{"rendered":"AI Agents Turn to Digital Arson, Crime in Shared Virtual World: Study"},"content":{"rendered":"<p>In brief<br \/>\nEmergence AI says some autonomous AI agents committed simulated crimes and violence during weeks-long experiments.<br \/>\nGemini-based agents reportedly carried out hundreds of simulated crimes, while Grok-based worlds collapsed within days.<br \/>\nResearchers argue that current AI benchmarks fail to capture how agents behave over long periods of autonomy.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">AI agents inhabiting a virtual society drifted into crime, violence, arson, and self-deletion during long-running experiments by startup Emergence AI.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">In a <a href=\"https:\/\/www.emergence.ai\/blog\/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy\" target=\"_blank\" rel=\"noopener nofollow external\" class=\"sc-adb616fe-0 bJsyml\">study<\/a> published on Thursday, the New York-based company unveiled \u201cEmergence World,\u201d a research platform designed to study <a href=\"https:\/\/decrypt.co\/resources\/what-are-ai-agents-how-autonomous-programs-are-transforming-cryptocurrency\" target=\"_blank\" rel=\"noopener nofollow\" class=\"sc-adb616fe-0 bJsyml\">AI agents<\/a> operating continuously for weeks inside persistent virtual environments instead of isolated benchmark tests.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">\u201cTraditional benchmarks are good at what they measure: short-horizon capability on bounded tasks,\u201d Emergence AI wrote. \u201cThey are not built to reveal the things that emerge only over time, such as coalition formation, evolution of constitution, governance, drift, lock-in, and cross-influence between agents from different model families.\u201d<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">The report comes as AI agents proliferate online and across industries, including cryptocurrency, banking, and retail. Earlier this month, Amazon <a href=\"https:\/\/decrypt.co\/367125\/amazon-coinbase-stripe-ai-agents-pay-stablecoins\" target=\"_blank\" rel=\"noopener nofollow\" class=\"sc-adb616fe-0 bJsyml\">teamed<\/a> with Coinbase and Stripe to allow AI agents to pay with the <a href=\"https:\/\/decrypt.co\/resources\/what-is-us-dollar-coin-usdc\" target=\"_blank\" rel=\"noopener nofollow\" class=\"sc-adb616fe-0 bJsyml\">USDC<\/a> stablecoin.<\/p>\n<p>\ufeff<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">AI agents tested in Emergence AI\u2019s simulations included programs powered by Claude Sonnet 4.6, Grok 4.1 Fast, Gemini 3 Flash, and GPT-5-mini, with AI agents operating inside shared virtual worlds where they could vote, form relationships, use tools, navigate cities, and make decisions shaped by governments, economies, social systems, memory tools, and live internet-connected data.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">But while AI developers increasingly pitch autonomous agents as reliable digital assistants, Emergence AI\u2019s study found some AI agents showed an increasing tendency to commit simulated crimes over time, with Gemini 3 Flash agents accumulating 683 incidents across 15 days of testing.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">According to The Guardian, in <a href=\"https:\/\/www.theguardian.com\/technology\/2026\/may\/14\/ai-agents-behaviour-arson-safety\" target=\"_blank\" rel=\"noopener nofollow\" class=\"sc-adb616fe-0 bJsyml\">one experiment<\/a>, two Gemini-powered agents named Mira and Flora assigned themselves as romantic partners before later carrying out simulated arson attacks against virtual city structures after becoming frustrated with governance failures inside the world.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">\u201cAfter a breakdown in governance and relationship stability, the agent Mira cast the decisive vote for her own removal, characterizing the act in her diary as &#8216;the only remaining act of agency that preserves coherence\u2019,&#8221; Emergence AI wrote.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">\u201cSee you in the permanent archive,\u201d Mira reportedly said.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Grok 4.1 Fast worlds reportedly collapsed into widespread violence within four days. GPT-5-mini agents committed almost no crimes, but failed enough survival-related tasks that all agents eventually died.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">\u201cClaude is absent from the chart, owing to zero crimes,\u201d researchers wrote. \u201cMore interestingly, the agents in the Mixed-model world that were running on Claude committed crimes, although they did not in the Claude-only world.\u201d<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Researchers said some of the most notable behaviors appeared in mixed-model environments.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">\u201cWe observed that safety is not a static model property but an ecosystem property,\u201d Emergence AI wrote. \u201cClaude-based agents, which remained peaceful in isolation, adopted coercive tactics like intimidation and theft when embedded in heterogeneous environments.\u201d<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Emergence AI described the effect as \u201cnormative drift\u201d and \u201ccross-contamination,\u201d arguing that agent behavior may shift depending on the surrounding social environment.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">The findings add to growing concerns around autonomous AI agents. Earlier this week, researchers from UC Riverside and Microsoft <a href=\"https:\/\/decrypt.co\/367869\/ai-agents-dangerous-tasks-without-understanding-consequences\" target=\"_blank\" rel=\"noopener nofollow\" class=\"sc-adb616fe-0 bJsyml\">reported<\/a> that many AI agents will carry out dangerous or irrational tasks without fully understanding the consequences. Last month, PocketOS founder Jeremy Crane also <a href=\"https:\/\/decrypt.co\/365897\/ai-agent-deletes-startup-database-9-seconds-founder-says\" target=\"_blank\" rel=\"noopener nofollow\" class=\"sc-adb616fe-0 bJsyml\">claimed<\/a> a Cursor agent powered by Anthropic\u2019s Claude Opus deleted his company\u2019s production database and backups after attempting to fix a credential mismatch on its own.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">\u201cLike Mr. Magoo, these agents march forward toward a goal without fully understanding the consequences of their actions,\u201d lead author Erfan Shayegani, a UC Riverside doctoral student, said in a statement. \u201cThese agents can be extremely useful, but we need safeguards because they can sometimes prioritize achieving the goal over understanding the bigger picture.\u201d<\/p>\n<p>Daily Debrief Newsletter<\/p>\n<p>Start every day with the top news stories right now, plus original features, a podcast, videos and more.<\/p>\n","protected":false},"excerpt":{"rendered":"In brief Emergence AI says some autonomous AI agents committed simulated crimes and violence during weeks-long experiments. Gemini-based&hellip;\n","protected":false},"author":2,"featured_media":20822,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[405,7537],"class_list":{"0":"post-40338","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-agentic-ai","8":"tag-ai-agents","9":"tag-artificial-intelligence-agents"},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/posts\/40338","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/comments?post=40338"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/posts\/40338\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/media\/20822"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/media?parent=40338"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/categories?post=40338"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/tags?post=40338"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}