{"id":110386,"date":"2025-08-01T13:47:08","date_gmt":"2025-08-01T13:47:08","guid":{"rendered":"https:\/\/www.europesays.com\/us\/110386\/"},"modified":"2025-08-01T13:47:08","modified_gmt":"2025-08-01T13:47:08","slug":"google-rolls-out-gemini-deep-think-ai-a-reasoning-model-that-tests-multiple-ideas-in-parallel","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/us\/110386\/","title":{"rendered":"Google rolls out Gemini Deep Think AI, a reasoning model that tests multiple ideas in parallel"},"content":{"rendered":"<p id=\"speakable-summary\" class=\"wp-block-paragraph\">Google DeepMind is rolling out <a href=\"https:\/\/techcrunch.com\/2025\/05\/20\/deep-think-boosts-the-performance-of-googles-flagship-google-gemini-ai-model\/\" rel=\"nofollow noopener\" target=\"_blank\">Gemini 2.5 Deep Think<\/a>, which, the company says, is its most advanced AI reasoning model, able to answer questions by exploring and considering multiple ideas simultaneously and then using those outputs to choose the best answer.<\/p>\n<p class=\"wp-block-paragraph\">Subscribers to Google\u2019s $250-per-month <a href=\"https:\/\/techcrunch.com\/2025\/05\/20\/google-ai-ultra-youll-have-to-pay-249-99-per-month-for-googles-best-ai\/\" rel=\"nofollow noopener\" target=\"_blank\">Ultra<\/a> subscription will gain access to Gemini 2.5 Deep Think in the Gemini app starting Friday.<\/p>\n<p class=\"wp-block-paragraph\">First unveiled in May at Google I\/O 2025, Gemini 2.5 Deep Think is Google\u2019s first publicly available multi-agent model. These systems spawn AI multiple agents to tackle a question in parallel, a process that uses significantly more computational resources than a single agent, but tends to result in better answers.<\/p>\n<p class=\"wp-block-paragraph\">Google used a variation of Gemini 2.5 Deep Think to <a href=\"https:\/\/techcrunch.com\/2025\/07\/21\/openai-and-google-outdo-the-mathletes-but-not-each-other\/\" rel=\"nofollow noopener\" target=\"_blank\">score a gold medal<\/a> at this year\u2019s International Math Olympiad (IMO).<\/p>\n<p class=\"wp-block-paragraph\">Alongside Gemini 2.5 Deep Think, the company says it is releasing the model it used at the IMO to a select group of mathematicians and academics. Google says this AI model \u201ctakes hours to reason,\u201d instead of seconds or minutes like most consumer-facing AI models. The company hopes the IMO model will enhance research efforts, and aims to get feedback on how to improve the multi-agent system for academic use cases.<\/p>\n<p class=\"wp-block-paragraph\">Google notes that the Gemini 2.5 Deep Think model is a significant improvement over what it announced at I\/O. The company also claims to have developed \u201cnovel reinforcement learning techniques\u201d to encourage Gemini 2.5 Deep Think to make better use of its reasoning paths.<\/p>\n<p class=\"wp-block-paragraph\">\u201cDeep Think can help people tackle problems that require creativity, strategic planning and making improvements step-by-step,\u201d said Google in a blog post shared with TechCrunch.<\/p>\n<p>Techcrunch event<\/p>\n<p>\n\t\t\t\t\t\t\t\t\tSan Francisco<br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t|<br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\tOctober 27-29, 2025\n\t\t\t\t\t\t\t<\/p>\n<p class=\"wp-block-paragraph\">The company says Gemini 2.5 Deep Think achieves state-of-the-art performance on Humanity\u2019s Last Exam (HLE) \u2014 a challenging test measuring AI\u2019s ability to answer thousands of crowdsourced questions across math, humanities, and science. Google claims its model scored 34.8% on HLE (without tools), compared to xAI\u2019s Grok 4, which scored 25.4%, and OpenAI\u2019s o3, which scored 20.3%.<\/p>\n<p class=\"wp-block-paragraph\">Google also says Gemini 2.5 Deep Think outperforms AI models from OpenAI, xAI, and Anthropic on LiveCodeBench6, a challenging test of competitive coding tasks. Google\u2019s model scored 87.6%, whereas Grok 4 scored 79%, and OpenAI\u2019s o3 scored 72%.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" height=\"680\" width=\"580\" src=\"https:\/\/www.europesays.com\/us\/wp-content\/uploads\/2025\/08\/1754056028_334_image.png\" alt=\"\" class=\"wp-image-3033173\"  \/>Benchmark scores. Image Credits: Google<\/p>\n<p class=\"wp-block-paragraph\">Gemini 2.5 Deep Think automatically works with tools such as code execution and Google Search, and the company says it\u2019s capable of producing \u201cmuch longer responses\u201d than traditional AI models.<\/p>\n<p class=\"wp-block-paragraph\">In Google\u2019s testing, the model produced more detailed and aesthetically pleasing web development tasks compared to other AI models. The company claims the model could aid researchers and \u201cpotentially accelerate the path to discovery.\u201d<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" height=\"442\" width=\"680\" src=\"https:\/\/www.europesays.com\/us\/wp-content\/uploads\/2025\/08\/Screenshot-2025-07-31-at-5.31.36PM.png\" alt=\"\" class=\"wp-image-3033177\"  \/>Art scenes made by Google\u2019s AI (Credit: Google)<\/p>\n<p class=\"wp-block-paragraph\">It seems that several leading AI labs are converging around the multi-agent approach.<\/p>\n<p class=\"wp-block-paragraph\">Elon Musk\u2019s xAI recently released a multi-agent system of its own, <a href=\"https:\/\/techcrunch.com\/2025\/07\/09\/elon-musks-xai-launches-grok-4-alongside-a-300-monthly-subscription\/\" rel=\"nofollow noopener\" target=\"_blank\">Grok 4 Heavy<\/a>, which it says was able to achieve industry leading performance on several benchmarks. OpenAI researcher Noam Brown said on a <a rel=\"nofollow noopener\" href=\"https:\/\/www.youtube.com\/watch?v=EEIPtofVe2Q\" target=\"_blank\">podcast<\/a> that the unreleased AI model the company used to achieve a gold medal at this year\u2019s International Math Olympiad (IMO) was also a multi-agent system. Meanwhile, <a rel=\"nofollow noopener\" href=\"https:\/\/www.anthropic.com\/engineering\/built-multi-agent-research-system\" target=\"_blank\">Anthropic\u2019s Research agent<\/a>, which generates thorough research briefs, is also powered by a multi-agent system.<\/p>\n<p class=\"wp-block-paragraph\">Despite the strong performance, it seems that multi-agent systems are even costlier to serve than traditional AI models. That means tech companies may keep these systems gated behind their most expensive subscription plans, which xAI and now Google have chosen to do.<\/p>\n<p class=\"wp-block-paragraph\">In the coming weeks, Google says it plans to share Gemini 2.5 Deep Think with a select group of testers via the Gemini API. The company says it wants to better understand how developers and enterprises may use its multi-agent system.<\/p>\n","protected":false},"excerpt":{"rendered":"Google DeepMind is rolling out Gemini 2.5 Deep Think, which, the company says, is its most advanced AI&hellip;\n","protected":false},"author":3,"featured_media":110387,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[21580,39852,2722,158,67,132,68],"class_list":{"0":"post-110386","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-technology","8":"tag-agents","9":"tag-gemini","10":"tag-google","11":"tag-technology","12":"tag-united-states","13":"tag-unitedstates","14":"tag-us"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@us\/114953823390884104","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts\/110386","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/comments?post=110386"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts\/110386\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/media\/110387"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/media?parent=110386"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/categories?post=110386"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/tags?post=110386"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}