{"id":614352,"date":"2025-12-05T18:39:14","date_gmt":"2025-12-05T18:39:14","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/614352\/"},"modified":"2025-12-05T18:39:14","modified_gmt":"2025-12-05T18:39:14","slug":"worlds-first-fully-agentic-ai-smartphone-is-this-chinas-second-deepseek-moment-technology-news","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/614352\/","title":{"rendered":"World\u2019s first fully agentic AI smartphone: Is this China\u2019s second DeepSeek moment? | Technology News"},"content":{"rendered":"<p>China is moving ahead briskly in the AI arms race. While the rest of the world has been seeing an influx of AI-driven smartphone features, mainly voice assistants and app-by-app interactions, China has taken a major leap. ZTE, a Shenzhen-based multinational telecom company, has introduced a smartphone powered by an AI agent. Built in collaboration with ByteDance, the device features an agent that doesn\u2019t just live inside apps but is integrated directly into the operating system. Its most striking capability is that it can operate the smartphone the same way a human would.<\/p>\n<p>Taylor Ogan, an entrepreneur from Shenzhen, took to his X (formerly Twitter) account to share the prototype named Nubia M153. The smartphone runs on a customised version of Android integrated with ByteDance\u2019s Doubao AI agent. For the uninitiated, Doubao is ByteDance\u2019s proprietary large-scale general-purpose AI model ecosystem that is widely deployed across China as a chatbot and tool for productivity.<\/p>\n<p><img class=\"lazyloading\" decoding=\"async\" data-lazy-type=\"lazyloading-image\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/04\/track_1x1.jpg\" data-lazy-src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/04\/track_1x1.jpg\" alt=\"\" width=\"1px\" height=\"1px\" style=\"display:none;\"\/><\/p>\n<p>This prototype is much more than a normal on-device assistant. Ogan\u2019s demo showed that the AI has full-stack control of the phone, meaning it can see the user interface, open apps, download apps, tap and type on screen, make calls, and execute multi-step tasks without the user having to know which apps are required. In simple words, the AI here uses the phone just like a human user would and not like an app would.\u00a0<\/p>\n<p><strong>What does the Agentic AI smartphone do?<\/strong><\/p>\n<p>Ogan began his thread showing him requesting the AI to find someone to wait in line for him. While this is not a norm yet in India, China\u2019s gig economy apps commonly offer queue-standing services to people at hospitals, government offices, and other venues with high demand. Ogan is seen asking the AI in English, to which it responds immediately. The AI can be seen choosing which local service app, configuring the task, filling the necessary fields, and offering a final confirmation screen. The CEO in his short video admits that he would not have known which app handled that job or how to set it up. The video shows the AI agent doing the entire process autonomously.\u00a0<\/p>\n<blockquote class=\"twitter-tweet\" data-media-max-width=\"560\">\n<p dir=\"ltr\" lang=\"en\">Another DeepSeek moment. This is the world\u2019s first actual smart phone. It\u2019s an engineering prototype of ZTE\u2019s Nubia M153 running ByteDance\u2019s Doubao AI agent fused into Android at the OS level. It has complete control over the phone. It can see the UI, choose\/download apps,\u2026 <a href=\"https:\/\/t.co\/lM9PYMoQek\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">pic.twitter.com\/lM9PYMoQek<\/a><\/p>\n<p>\u2014 Taylor Ogan (@TaylorOgan) <a href=\"https:\/\/twitter.com\/TaylorOgan\/status\/1996538308697137277?ref_src=twsrc%5Etfw\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">December 4, 2025<\/a><\/p>\n<\/blockquote>\n<p>This is groundbreaking, as most current AI assistants seen on smartphones can reason about tasks but cannot navigate through third-party apps on behalf of a user. Although Samsung, Apple, and other tech giants have been experimenting with AI actions, they are largely permission gated and limited to only partner apps. The ZTE-ByteDance prototype here is much ahead, as it allows its AI to act directly within the Graphical User Interface (GUI) as if it were a human.\u00a0<\/p>\n<p><strong>The hardware behind the Agentic AI<\/strong><\/p>\n<p>Ogan, in his thread, revealed that the prototype is powered by Qualcomm\u2019s new Snapdragon 8 Elite Gen 5 chipset with 16 GB of RAM. This is key as the agent divides its workload between cloud-based semantic reasoning and on-device screen control. According to the OP, running the \u2018vision of the screen\u2019 locally allows the AI to move quickly and maintain privacy for the sensitive UI interactions like payment flows and passwords.\u00a0<\/p>\n<p>When it comes to the AI model, ByteDance\u2019s Doubao is currently being used by over 175 million people in China. It is essentially a large, sparse Mixture-of-Experts model with multimodal, meaning text and vision, support. In the second instance, when Ogan clicks a picture of a NIO battery-swap station and asks, \u201cWhat is this thing?\u201d The model identifies the station from the image and links it to NIO\u2019s national EV-charging network and goes on to explain how it works.\u00a0<\/p>\n<blockquote class=\"twitter-tweet\" data-media-max-width=\"560\">\n<p dir=\"ltr\" lang=\"en\">This isn\u2019t a chat overlay, it\u2019s a true multimodal agent. It has the brand-new Snapdragon 8 Elite Gen 5 with 16GB RAM, so it can push a lot of the agentic workload on-device. Here I take a picture of a NIO battery swap station and ask, \u201cWhat is this thing?\u201d It\u2019s running\u2026 <a href=\"https:\/\/t.co\/b0rg7iJX3l\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">pic.twitter.com\/b0rg7iJX3l<\/a><\/p>\n<p>\u2014 Taylor Ogan (@TaylorOgan) <a href=\"https:\/\/twitter.com\/TaylorOgan\/status\/1996539031421940124?ref_src=twsrc%5Etfw\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">December 4, 2025<\/a><\/p>\n<\/blockquote>\n<p>\u00a0<\/p>\n<p><b>Cloud + on-device architecture<\/b><\/p>\n<p>Perhaps the coolest demonstration is that of booking a hotel. The CEO takes a single picture of the hotel entrance; he says nothing more than his intent to book a stay. The AI understands the assignment and divides its workloads.\u00a0<\/p>\n<p>Story continues below this ad<\/p>\n<p>Firstly, Doubao (cloud) translates the semantics, such as which hotel it is, that he wants to book for tonight, and that pet policies matter. Secondly, Nebula-GUI (on-device), which is reportedly a 7-billion-parameter model trained by ZTE, takes care of the physical actions such as opening a Ctrip (Chinese booking app), entering dates, locating the best rate, looking through the app for pet policies, and informing Ogan if dogs are allowed or not.<\/p>\n<p>Based on the demo, this two-layer architecture is what allows the task to run smoothly. In simple terms, Doubao plans and Nebula-GUI executes it.<\/p>\n<p><b>App-level knowledge and interaction with other bots<\/b><\/p>\n<p>In another demo, the agent is asked to book a robotaxi, and Doubao uses GPS data and looks for local ride-hailing apps to decide which operator serves the particular route. On Ogan\u2019s phone, Nebula-GUI opens the Baidu Apollo app, navigates through its menus, selects pickup points, and confirms the trip. Sometime later, Ogan asks it to change the drop-off location mid-ride. Again, the AI recognises the active Apollo session, opens the correct screen, changes the destination, and fires up a confirmation both on the phone and inside the robotaxi itself. This is a fine demonstration of the AI\u2019s app-specific knowledge.<\/p>\n<blockquote class=\"twitter-tweet\" data-media-max-width=\"560\">\n<p dir=\"ltr\" lang=\"en\">This is where it stops feeling like \u201cvoice commands\u201d and starts feeling like a real assistant. I don\u2019t remember which number I logged into the Baidu Apollo robotaxi app. Doubao digs into the app\u2019s settings and tells me the last four digits of this account\u2019s phone number so I can\u2026 <a href=\"https:\/\/t.co\/FT9I9q3QMi\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">pic.twitter.com\/FT9I9q3QMi<\/a><\/p>\n<p>\u2014 Taylor Ogan (@TaylorOgan) <a href=\"https:\/\/twitter.com\/TaylorOgan\/status\/1996542159521223034?ref_src=twsrc%5Etfw\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">December 4, 2025<\/a><\/p>\n<\/blockquote>\n<p>During the demo, when Ogan forgets the phone number linked to his Apollo account, the AI navigates the app\u2019s settings and brings the last four digits. Now, this is something most AI assistants will not be able to do unless they have access and deep OS-level visibility.\u00a0<\/p>\n<p>Meanwhile, in another test, Ogan uses Meituan, a Chinese tech company that offers on-demand drone delivery services. He asks the agent to order two drinks, and it updates his cart, makes the payment, and arranges delivery to a nearby locker. And, when Meituan\u2019s automated system makes a confirmation call, Doubao answers on his behalf and speaks to Meituan\u2019s bot. Thus, both the bots complete the exchange without any user intervention. This is an example of how agents can negotiate with other agents on behalf of a user.\u00a0<\/p>\n<blockquote class=\"twitter-tweet\" data-media-max-width=\"560\">\n<p dir=\"ltr\" lang=\"en\">I tell it to order me two of the drinks in front of me. It reuses the cart, updates quantity, pays, and a Meituan drone flies the order to a nearby locker. When Meituan\u2019s automated phone system calls to say the delivery arrived, Doubao auto-answers and talks to their bot on my\u2026 <a href=\"https:\/\/t.co\/rpGvGUVOvA\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">pic.twitter.com\/rpGvGUVOvA<\/a><\/p>\n<p>\u2014 Taylor Ogan (@TaylorOgan) <a href=\"https:\/\/twitter.com\/TaylorOgan\/status\/1996545229600797164?ref_src=twsrc%5Etfw\" class=\"\" rel=\"nofollow, noopener\" target=\"_blank\">December 4, 2025<\/a><\/p>\n<\/blockquote>\n<p>Ogan admits that through his walk, he uses the device as a passive layer of intelligence, identifying whether a store is part of a Shenzhen brand network, checking trademark and business registry data, or evaluating whether a passerby wearing an NYPD jacket is an actual police officer. In the demo, the system correctly contextualises location (Shenzhen) and identifies the jacket as a civilian fashion item.<\/p>\n<p>Story continues below this ad<\/p>\n<p>The demo also shows ByteDance\u2019s image-generation tools, modifying only the clothes in a photo while leaving the scene intact. This allows the agent to re-render the person in a Chinese police uniform or FBI jacket on request.<\/p>\n<p><b>What does this mean for us?<\/b><\/p>\n<p>This device is essentially an OS-native GUI agent that has been trained on Chinese mobile UI flows and is backed by a large, multimodal reasoning model. It eliminates the need to understand apps, menus, or workflows. Simply give the phone intent; it handles the execution.<\/p>\n<p>As of today, nothing in the global smartphone market demonstrates this level of autonomy. It remains to be seen if this becomes a commercial product, but the prototype clearly shows how agentic smartphones may change our lives. It also shows that the first true agentic smartphones may not come from Silicon Valley, but from China\u2019s integrated AI and mobile ecosystem.<\/p>\n<p><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n","protected":false},"excerpt":{"rendered":"China is moving ahead briskly in the AI arms race. While the rest of the world has been&hellip;\n","protected":false},"author":2,"featured_media":614353,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3159],"tags":[191513,191525,191520,191517,191514,191526,191518,191509,191524,191515,191512,191521,547,191519,191522,191510,191523,191511,191516,191528,191527,53,16,15,191508],"class_list":{"0":"post-614352","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-mobile","8":"tag-agentic-smartphone-china","9":"tag-ai-book-robotaxi","10":"tag-ai-for-gig-economy-services","11":"tag-ai-navigating-third-party-apps","12":"tag-ai-operate-phone-like-human","13":"tag-ai-ui-vision","14":"tag-autonomous-task-execution-smartphone","15":"tag-bytedance-doubao-ai","16":"tag-china-ai-arms-race","17":"tag-doubao-nebula-gui-architecture","18":"tag-full-stack-control-ai","19":"tag-large-sparse-mixture-of-experts-model","20":"tag-mobile","21":"tag-multi-agent-negotiation-ai","22":"tag-multimodal-reasoning-doubao","23":"tag-nubia-m153-prototype","24":"tag-on-device-screen-control-ai","25":"tag-os-native-gui-agent","26":"tag-snapdragon-8-elite-gen-5-ai","27":"tag-snow-bull-capital","28":"tag-taylor-ogan","29":"tag-technology","30":"tag-uk","31":"tag-united-kingdom","32":"tag-zte-ai-agent-smartphone"},"share_on_mastodon":{"url":"","error":"Validation failed: Text character limit of 500 exceeded"},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/614352","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=614352"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/614352\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/614353"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=614352"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=614352"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=614352"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}