{"id":262220,"date":"2026-01-01T20:32:09","date_gmt":"2026-01-01T20:32:09","guid":{"rendered":"https:\/\/www.europesays.com\/ie\/262220\/"},"modified":"2026-01-01T20:32:09","modified_gmt":"2026-01-01T20:32:09","slug":"deepseek-kicks-off-2026-with-paper-signalling-push-to-train-bigger-models-for-less","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/ie\/262220\/","title":{"rendered":"DeepSeek kicks off 2026 with paper signalling push to train bigger models for less"},"content":{"rendered":"<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">Chinese artificial intelligence start-up DeepSeek has ushered in 2026 with a new technical paper, co-authored by founder Liang Wenfeng, that proposes a rethink of the fundamental architecture used to train foundational AI models.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">The method \u2013 dubbed Manifold-Constrained Hyper-Connections (mHC) \u2013 forms part of the Hangzhou firm\u2019s push to make its models more cost-effective as it strives to keep pace with better-funded US rivals with deeper access to computing power.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">It also reflected the increasingly open, collaborative culture among Chinese AI companies, which have published a growing share of their research in public.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">For industry watchers, DeepSeek\u2019s papers often provide an important early signal of the engineering choices that will shape the start-up\u2019s next major model release.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">In the paper, released on Thursday, a team of 19 DeepSeek researchers said they tested mHC on models with 3 billion, 9 billion and 27 billion parameters, and found it scaled without adding significant computational burden.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">\u201cEmpirical results confirm that mHC effectively \u2026 [enables] stable large-scale training with superior scalability compared with conventional HC (hyper-connections),\u201d wrote the researchers, led by Zhenda Xie, Yixuan Wei and Huanqi Cao.<\/p>\n","protected":false},"excerpt":{"rendered":"Chinese artificial intelligence start-up DeepSeek has ushered in 2026 with a new technical paper, co-authored by founder Liang&hellip;\n","protected":false},"author":2,"featured_media":262221,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[261],"tags":[291,289,290,2786,79,381,179,18,4202,19,17,3521,5,2336,19453,119,82,107,65],"class_list":{"0":"post-262220","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-asia","12":"tag-business","13":"tag-china","14":"tag-economy","15":"tag-eire","16":"tag-hong-kong","17":"tag-ie","18":"tag-ireland","19":"tag-lifestyle","20":"tag-news","21":"tag-opinion","22":"tag-south-china-morning-post","23":"tag-sport","24":"tag-technology","25":"tag-us","26":"tag-world"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@ie\/115821749394957910","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/posts\/262220","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/comments?post=262220"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/posts\/262220\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/media\/262221"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/media?parent=262220"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/categories?post=262220"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/tags?post=262220"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}