{"id":37406,"date":"2026-05-13T11:03:09","date_gmt":"2026-05-13T11:03:09","guid":{"rendered":"https:\/\/www.europesays.com\/ai\/37406\/"},"modified":"2026-05-13T11:03:09","modified_gmt":"2026-05-13T11:03:09","slug":"microsofts-agentic-security-system-found-four-critical-windows-rce-flaws","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/ai\/37406\/","title":{"rendered":"Microsoft\u2019s agentic security system found four critical Windows RCE flaws"},"content":{"rendered":"<p>Microsoft responded to <a href=\"https:\/\/www.helpnetsecurity.com\/2026\/05\/12\/openai-daybreak-openai-daybreak-vulnerability-validation-initiative\/\" rel=\"nofollow noopener\" target=\"_blank\">growing competition<\/a> in AI security by announcing that its new agentic security system helped researchers discover 16 new vulnerabilities in the Windows networking and authentication stack, including four critical remote code execution (RCE) flaws.<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.europesays.com\/ai\/wp-content\/uploads\/2026\/05\/Microsoft_MDASH.webp\" class=\"aligncenter\" alt=\"Microsoft MDASH\" title=\"MDASH architecture diagram\"\/><\/p>\n<p class=\"text-center\">MDASH architecture diagram (Source: Microsoft)<\/p>\n<p>Two of the four flaws \u2014 CVE-2026-40361 and CVE-2026-40364 \u2014 were deemed by Microsoft to be more likely to be <a href=\"https:\/\/www.helpnetsecurity.com\/2026\/05\/12\/microsoft-may-2026-patch-tuesday\/\" rel=\"nofollow noopener\" target=\"_blank\">exploited<\/a>.<\/p>\n<p>The multi-model agentic scanning harness, codenamed MDASH, was built by Microsoft\u2019s Autonomous Code Security team and uses more than 100 specialized AI agents and an ensemble of frontier and distilled models to discover, debate, and validate exploitable vulnerabilities end-to-end.<\/p>\n<p>\u201cAI vulnerability discovery has crossed from research curiosity into production-grade defense at enterprise scale, and the durable advantage lies in the agentic system around the model rather than any single model itself,\u201d <a href=\"https:\/\/www.linkedin.com\/in\/tsgatesv\/\" target=\"_blank\" rel=\"nofollow noopener\">Taesoo Kim<\/a>, VP, Agentic Security, Microsoft <a href=\"https:\/\/www.microsoft.com\/en-us\/security\/blog\/2026\/05\/12\/defense-at-ai-speed-microsofts-new-multi-model-agentic-security-system-tops-leading-industry-benchmark\/\" target=\"_blank\" rel=\"nofollow noopener\">wrote<\/a> in a blog post.<\/p>\n<p>To evaluate MDASH, the company tested the system against a private Windows driver named StorageDrive that contained 21 intentionally injected vulnerabilities, including kernel use-after-frees (UAFs), integer handling issues, IOCTL validation gaps, and locking errors.<\/p>\n<p>Because StorageDrive is a private codebase that had never been publicly released, Microsoft said the benchmark minimized the possibility that the AI models had previously seen the code during training. The company added that MDASH identified all 21 vulnerabilities without generating false positives.<\/p>\n<p>\u201cThis simple test shows that the reasoning and vulnerability discovery capabilities of codename MDASH can approximate professional offensive researchers,\u201d Kim noted.<\/p>\n<p>The company also highlighted MDASH\u2019s performance on internal and public vulnerability discovery benchmarks.<\/p>\n<p>MDASH achieved a 96% recall rate against five years of confirmed Microsoft Security Response Center (MSRC) vulnerabilities in clfs.sys and a 100% recall rate in tcpip.sys, according to Microsoft.<\/p>\n<p>The system also scored 88.45% on CyberGym, a public benchmark designed to evaluate AI systems on real-world vulnerability discovery tasks. The benchmark contains 1,507 vulnerabilities from OSS-Fuzz projects and measures how effectively AI systems can identify known security flaws in previously unseen codebases.<\/p>\n<p>The result placed MDASH at the top of the CyberGym leaderboard, roughly five percentage points ahead of the next highest-ranked system, the company said.<\/p>\n<p>\u201cWe are at a moment in the industry where AI-powered vulnerability discovery stops being speculative and starts being an engineering problem. The findings in this Patch Tuesday and the retrospective recall on five years of CLFS MSRC cases are evidence that AI vulnerability findings can scale,\u201d Kim concluded.<\/p>\n<p>Microsoft also noted that MDASH is currently being tested by customers as part of a limited private preview.<\/p>\n","protected":false},"excerpt":{"rendered":"Microsoft responded to growing competition in AI security by announcing that its new agentic security system helped researchers&hellip;\n","protected":false},"author":2,"featured_media":37407,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[179,24,420,7829,313,320,7828,22073,3246],"class_list":{"0":"post-37406","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-microsoft","8":"tag-agentic-ai","9":"tag-ai","10":"tag-azure","11":"tag-azure-ai","12":"tag-cybersecurity","13":"tag-microsoft","14":"tag-microsoft-ai","15":"tag-vulnerability-disclosure","16":"tag-windows"},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/posts\/37406","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/comments?post=37406"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/posts\/37406\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/media\/37407"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/media?parent=37406"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/categories?post=37406"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/ai\/wp-json\/wp\/v2\/tags?post=37406"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}