{"id":604774,"date":"2025-12-01T06:37:58","date_gmt":"2025-12-01T06:37:58","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/604774\/"},"modified":"2025-12-01T06:37:58","modified_gmt":"2025-12-01T06:37:58","slug":"microsoft-admits-ai-agents-can-hallucinate-and-fall-for-attacks-but-theyre-still-coming-to-windows-11","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/604774\/","title":{"rendered":"Microsoft admits AI agents can hallucinate and fall for attacks, but they\u2019re still coming to Windows 11"},"content":{"rendered":"<p>            <a href=\"https:\/\/www.windowslatest.com\/wp-content\/uploads\/2025\/11\/Windows-11-Agents-Malware.jpg\" data-caption=\"\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"696\" height=\"392\" class=\"entry-thumb td-modal-image\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/12\/Windows-11-Agents-Malware-696x392.jpg\"   alt=\"Windows 11 Agents Malware\" title=\"Windows 11 Agents Malware\"\/><\/a><\/p>\n<p>For the past few weeks, Microsoft has been associating <a href=\"https:\/\/www.windowslatest.com\/2025\/11\/14\/windows-11-agentic-os-ai-upgrade-faces-backlash-microsoft-responds-by-closing-replies\/\" target=\"_blank\" rel=\"noopener\">AI agents with the future of Windows<\/a>. But the company\u2019s own documentation openly admits that such agents can hallucinate, act unpredictably, and even fall for attacks that didn\u2019t exist a year ago. Yet, the fourth-largest organization is still pushing ahead with agentic features in Windows 11.<\/p>\n<p>If Microsoft believes these agents are risky enough to need separate accounts, isolated sessions, and tamper-evident audit logs, why is Windows 11 becoming the test bed for them? And why now, at a time when users are already exhausted by the AI-fication of the OS?<\/p>\n<p>Microsoft\u2019s big bet on agentic computing is already locked in<\/p>\n<p>In mid-October 2025, Microsoft said that they are \u201cmaking every Windows 11 PC an AI PC.\u201d The company unveiled a wave of AI integrations meant to let you \u201ctalk\u201d to your computer, show it what\u2019s on your screen, and then have it act on your behalf.<\/p>\n<p>Microsoft essentially wants you to replace keystrokes and mouse clicks with natural language, and we got to see a preview of this plan with <a href=\"https:\/\/www.windowslatest.com\/2025\/02\/06\/microsofts-copilot-voice-ai-is-expanding-beyond-english-to-take-on-chatgpt-gemini\/\" target=\"_blank\" rel=\"noopener\">Copilot Voice<\/a>, <a href=\"https:\/\/www.windowslatest.com\/2025\/08\/07\/windows-11s-built-in-copilot-vision-that-can-see-your-screen-now-works-for-free-everywhere-hands-on\/\" target=\"_blank\" rel=\"noopener\">Copilot Vision<\/a>, and the agentic part, Copilot Actions.<\/p>\n<p>The latest moves make the Windows 11 taskbar the nerve centre of this AI-fication. Windows 11\u2019s Search box is being replaced (optional, for now) with a new \u201c<a href=\"https:\/\/www.windowslatest.com\/2025\/10\/19\/microsoft-cant-fix-windows-11-search-so-its-handing-it-to-ask-copilot-on-the-taskbar\/\" target=\"_blank\" rel=\"noopener\">Ask Copilot<\/a>\u201d interface that lets you summon AI agents or Copilot with a single click or type. From there, agents can run tasks in the background, and you can monitor their progress directly from the taskbar, as if they were regular apps.<\/p>\n<p><img decoding=\"async\" fetchpriority=\"high\" class=\"size-full wp-image-84973\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/12\/Ask-Copilot-on-taskbar-add-agent.png\" alt=\"Invoking agent from Ask Copilot in Taskbar\" width=\"2216\" height=\"1246\"  \/>Invoking agent from Ask Copilot in Taskbar. Credit: Microsoft<\/p>\n<p>Even if today the agentic functionality is limited and opt-in, the architecture and roadmap clear the air around the fact that agentic computing is the next core paradigm for Windows.<\/p>\n<p>Microsoft openly says AI agents can misbehave, but still wants them inside your files and apps<\/p>\n<p>On the bright side, Microsoft doesn\u2019t pretend this is safe or foolproof. The company\u2019s official <a href=\"https:\/\/support.microsoft.com\/en-au\/windows\/experimental-agentic-features-a25ede8a-e4c2-4841-85a8-44839191dfb3#wl\" target=\"_blank\" rel=\"noopener\">documentation<\/a> warns that these AI agents \u201cface functional limitations in terms of how they behave and occasionally may hallucinate and produce unexpected outputs.\u201d<\/p>\n<p>Agents are vulnerable to Cross Prompt Injection (XPIA), malicious prompts, and malware<\/p>\n<p>One of the biggest risks that Microsoft talks of is Cross Prompt Injection (XPIA). It describes a situation where an AI agent gets tricked by malicious content embedded in UI elements, documents, or apps. Such content could potentially override the agent\u2019s original instructions and force it to perform harmful actions like copying sensitive files or leaking data.<\/p>\n<p>Security researchers have already <a href=\"https:\/\/arxiv.org\/abs\/2504.11281\" target=\"_blank\" rel=\"noopener\">flagged GUI-based agents<\/a> as vulnerable to these kinds of indirect attacks, the reason being the high privileges given to such AI Agents.<\/p>\n<p>While we appreciate Microsoft being open about this, there is a certain distrust that pops up, considering all the <a href=\"https:\/\/www.windowslatest.com\/2025\/11\/20\/microsoft-says-copilot-codes-faster-than-you-drink-coffee-devs-say-fix-windows-11-first-like-slow-file-explorer\/\" target=\"_blank\" rel=\"noopener\">hatred that Copilot is garnering<\/a> these days. And if you think <a href=\"https:\/\/www.windowslatest.com\/2024\/05\/21\/microsoft-details-windows-11-recall-ai-privacy-security-it-records-screen\/\" target=\"_blank\" rel=\"noopener\">Recall was a privacy nightmare<\/a>, AI agents are a whole different ballpark.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-83130\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/12\/Recall-in-Windows-11-24H2.jpg\" alt=\"Recall in Windows 11 24H2\" width=\"500\" height=\"589\"  \/><\/p>\n<p>Microsoft insists that agents run under separate accounts, with limited permissions, controlled folder access, and tamper-evident logs. But it still grants these agents read and write access to some of our most personal locations in the PC, specifically Documents, Downloads, Desktop, Videos, Pictures, and Music, which Microsoft calls <a href=\"https:\/\/learn.microsoft.com\/en-us\/windows\/win32\/shell\/known-folders\" target=\"_blank\" rel=\"noopener\">known folders<\/a>.<\/p>\n<p>\u201c\u2026malicious content embedded in UI elements or documents can override agent instructions, leading to unintended actions like data exfiltration or malware installation,\u201d Microsoft warned in a <a href=\"https:\/\/support.microsoft.com\/en-us\/windows\/experimental-agentic-features-a25ede8a-e4c2-4841-85a8-44839191dfb3\" target=\"_blank\" rel=\"noopener\">support document<\/a> published earlier this month. \u201cWe recommend you read through this information and understand the security implications of enabling an agent on your computer.\u201d<\/p>\n<p>So, given the risks, if Microsoft wants agents to interact with apps and files like a real person, how exactly does it stop the whole system from collapsing under its own weight?<\/p>\n<p>The entire thing depends on a new feature called Agent Workspace<\/p>\n<p>Agent Workspace is the backbone of Microsoft\u2019s vision for an Agentic OS. Everything the company has promised, including the AI that uses apps for you, edits files, moves documents around, and completes multi-step tasks without bothering you, only works because Windows 11 can now create dedicated sessions for these agents to operate in.<\/p>\n<p>It is unlike a virtual machine or <a href=\"https:\/\/www.windowslatest.com\/2025\/10\/27\/how-i-installed-microsoft-store-in-windows-sandbox-with-powershell-script\/\" target=\"_blank\" rel=\"noopener\">Windows Sandbox<\/a>. Agent Workspace is a parallel Windows environment, complete with its own account, its own desktop, its own process tree, and its own permission boundary.<\/p>\n<p>Giving a separate workspace for AI agents is Microsoft\u2019s first attempt at giving them a \u201cplace to exist\u201d inside Windows, without letting it sit directly inside the user\u2019s session.<\/p>\n<p>Each agent gets a separate standard account on your PC, and Windows treats this account like a controlled, limited user who can do only the things you explicitly allow. Such restrictions are Microsoft\u2019s response to the same problems they warned about.<\/p>\n<p><strong>How AI agents work inside Windows 11<\/strong><\/p>\n<p>Inside this workspace, the Agent interacts with applications the same way we do. It can click UI buttons, type into text fields. Scroll through windows, drag files, and do tasks that involve multiple steps. The AI handles the reasoning behind these steps.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-85274\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/12\/Copilot-operator.gif\" alt=\"Copilot operator\" width=\"800\" height=\"498\"\/>Copilot Actions using Agent Workspace on Windows 11<\/p>\n<p>Copilot Actions already uses this model. Instead of asking a cloud model to generate text, the agent literally performs the steps in software installed on your PC. That\u2019s why Microsoft needs to give it separate Windows sessions.<\/p>\n<p>If an agent misinterprets a prompt or if XPIA is triggered inside a document, the damage will be, technically, contained within a boundary where Windows can supervise and log every action.<\/p>\n<p>Agent Workspace is responsible for deciding what to show to agents. As I mentioned, agents only get access to the six \u201cknown folders\u201d. Everything else in the user profile is off-limits, that is, unless you give it access.<\/p>\n<p>This should also stop agents from crawling into system directories, credential stores, or app data folders where unintended reads or writes would cause chaos for app developers. Microsoft also uses Access Control Lists to prevent the agent account from going beyond the permissions of the user who enabled it.<\/p>\n<p>To enable any of this feature, you need to turn on the <strong>Experimental Agentic Features<\/strong>, which is off by default.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-84940\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/12\/Experimental-agentic-features-in-Windows-11.jpg\" alt=\"Experimental agentic features in Windows 11\" width=\"1089\" height=\"760\"  \/><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-84938\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/12\/Windows-11-Agent-Workspace.jpg\" alt=\"Windows 11 Agent Workspace\" width=\"1075\" height=\"648\"  \/>Image Courtesy: WindowsLatest.com<\/p>\n<p>Microsoft says, \u201cThis feature has no AI capabilities on its own, it is a security feature for agents like Copilot Actions. Enabling this toggle allows the creation of a separate agent account and workspace on the device, providing a contained space to keep agent activity separate from the user.\u201d\u00a0<\/p>\n<p><strong>MCP protocol controls what agents can touch<\/strong><\/p>\n<p>Microsoft is positioning the Model Context Protocol (MCP) as the standardized bridge between agents and applications. That\u2019s how the agent communicates with tools on the system.<\/p>\n<p>MCP allows the agent to discover tools, call functions, read file metadata, and interact with services through a predictable JSON-RPC layer. This prevents any direct access and gives Windows a central enforcement point where authentication, permission to use tools, capability declarations, and logging happen. If it isn\u2019t for the MCP, an agent would be blind. The workspace keeps it within safe limits.<\/p>\n<p>Why Microsoft believes the risk with AI Agents is worth it?<\/p>\n<p>From Microsoft\u2019s point of view, stepping back from AI isn\u2019t an option anymore. The company wants people to use AI naturally in Windows to the point that the OS becomes a \u201ccanvas for AI\u201d.<\/p>\n<p>Apple is hard at work with Apple Intelligence, especially since the plan to use a <a href=\"https:\/\/www.theverge.com\/news\/814654\/apple-intelligence-google-gemini-ai-siri\" target=\"_blank\" rel=\"noopener\">custom version of Gemini<\/a>, which brings us to Google already planning to enter the <a href=\"https:\/\/www.business-standard.com\/technology\/tech-news\/google-aluminium-os-android-pc-platform-windows-macos-rival-125112600784_1.html\" target=\"_blank\" rel=\"noopener\">PC market with Aluminium OS<\/a>.<\/p>\n<p><a href=\"https:\/\/www.windowslatest.com\/2025\/11\/27\/affordable-macbook-could-make-windows-11-devices-cheaper-and-analysts-agree\/\" target=\"_blank\" rel=\"noopener\">Apple\u2019s upcoming budget MacBook<\/a>, with a full version of Apple Intelligence, will be more appealing to many, just because of the company\u2019s desirability factor. So, if Windows isn\u2019t already prepared, there is a real risk that the platform starts to look boring, all while being hated for the existing issues in Windows 11, like the <a href=\"https:\/\/www.windowslatest.com\/2025\/11\/28\/tested-windows-11s-faster-file-explorer-preloaded-is-still-slower-than-windows-10-and-uses-additional-ram\/\" target=\"_blank\" rel=\"noopener\">slow File Explorer<\/a>.<\/p>\n<p>Large corporations pushing users to try new stuff that eventually gives them millions in ROI isn\u2019t something new, but should you trust Microsoft?<\/p>\n<p>Windows 11 does not have a great reputation to begin with. People already complain about how bloated it feels.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-84990\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/12\/ac8e405d-26f6-479e-9742-56315508d498.jpeg\" alt=\"Community Notes on X point to the Copilot mistake and recommends the right way to change text size\" width=\"526\" height=\"623\"  \/>Community Notes on X point to the Copilot mistake and recommends the right way to change text size<\/p>\n<p>Microsoft\u2019s Recall feature has become the textbook example of how not to launch an AI product on a desktop OS. Security researchers, privacy advocates, and regular users all raised the alarm over the idea of constant screenshots of your activity being stored on disk.<\/p>\n<p>The backlash was loud enough that <a href=\"https:\/\/www.windowslatest.com\/2024\/06\/15\/windows-11-24h2-kb5038575-removes-microsoft-recall-ai\/\" target=\"_blank\" rel=\"noopener\">Microsoft delayed the feature<\/a>, reworked it to be opt-in, and still cannot fully shake the \u201cprivacy nightmare\u201d label. Even now, privacy-focused apps like Signal, Brave, and AdGuard ship with measures that <a href=\"https:\/\/www.theverge.com\/news\/713676\/brave-adguard-windows-recall-block-microsoft\" target=\"_blank\" rel=\"noopener\">block Recall out of the box<\/a>.<\/p>\n<p>All of this context makes people nervous about Windows becoming an agentic OS. If Recall struggled to respect boundaries, what happens when agents can also click, type, and move files around for you?<\/p>\n<p>Microsoft is building a risky future and hoping users follow<\/p>\n<p>Microsoft has made its choice to rebuild Windows 11 around AI agents that can do work on your behalf. The company is brave enough to admit the risks, yet confident enough to keep moving forward.<\/p>\n<p>Honestly, on paper, the architecture looks smart. Separate accounts for agents, isolated workspaces, limited folder access, strict logging, and a protocol layer that lets Windows stand between agents and tools. In practice, this will live or die on execution. One serious exploit could undo a lot of the trust Microsoft is trying to rebuild after Recall. At least, the Experimental Agentic features are optional for now.<\/p>\n<p>The uncomfortable truth is that an agentic OS is probably inevitable, and I\u2019m not just talking about Windows. Every major platform vendor is pushing towards a future where AI does more than chat with you.<\/p>\n<p>What is not inevitable is trust. Microsoft will have to earn that, especially from users who already feel like Windows 11 is working against them. If the company wants people to accept AI agents that live inside their personal folders, they will need to start by making everything completely optional, and then giving valid use cases.<\/p>\n","protected":false},"excerpt":{"rendered":"For the past few weeks, Microsoft has been associating AI agents with the future of Windows. But the&hellip;\n","protected":false},"author":2,"featured_media":604775,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3163],"tags":[323,1942,53,16,15],"class_list":{"0":"post-604774","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-technology","11":"tag-uk","12":"tag-united-kingdom"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/115642943688940439","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/604774","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=604774"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/604774\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/604775"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=604774"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=604774"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=604774"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}