{"id":468872,"date":"2026-05-05T02:47:09","date_gmt":"2026-05-05T02:47:09","guid":{"rendered":"https:\/\/www.europesays.com\/ie\/468872\/"},"modified":"2026-05-05T02:47:09","modified_gmt":"2026-05-05T02:47:09","slug":"ai-model-reads-genetic-code-to-trace-evolutionary-history","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/ie\/468872\/","title":{"rendered":"AI model reads genetic code to trace evolutionary history"},"content":{"rendered":"<p>Researchers at the University of Oregon have developed an artificial intelligence tool that can read genetic code the way large language models like ChatGPT read text.\u00a0Scanning the genome for biological mutation patterns, the computer model traces pairs of genes back in time to their last common ancestor.\u00a0<\/p>\n<p>It&#8217;s\u00a0the first language model designed for population genetics, said\u00a0Andrew Kern, a computational biologist in the UO\u00a0College of Arts and Sciences. As described in a paper published April 10 in the\u00a0Proceedings of the National Academy of Sciences, the AI tool offers scientists a fast and flexible alternative to classical methods for reconstructing evolutionary history.\u00a0<\/p>\n<p>In practice, it\u00a0can help\u00a0researchers like Kern\u00a0understand when disease-resistance genes\u00a0emerged\u00a0in a population, for example, or when\u00a0species\u00a0evolved key traits.\u00a0<\/p>\n<p>&#8220;Advances in generative AI and the\u00a0architectures\u00a0behind them are potentially useful to a number of fields outside a chatbot,&#8221; said Kern, an Evergreen professor of biology. &#8220;We&#8217;re borrowing strengths from the world of AI and applying them in this different context that&#8217;s largely been untapped.&#8221;\u00a0<\/p>\n<p>Genomes are often compared to a written language, with combinations of DNA&#8217;s four-letter alphabet &#8211; A, T, C and G &#8211; forming the basis for genes and chromosomes. Kern and his lab are most interested in\u00a0what&#8217;s\u00a0misspelled, which scientists call mutations: changes in DNA sequences, like swapped or missing letters, that accumulate over time as part of evolution.\u00a0<\/p>\n<p>Often harmless, mutations can be passed down from generation to generation, leaving a trail of breadcrumbs for tracing ancestral relationships.\u00a0<\/p>\n<p>Traditional methods based on math and statistics are the gold standard for translating mutations into ancestry.\u00a0They&#8217;re\u00a0difficult to beat in most cases, said Kevin Korfmann, lead author of the study and former postdoctoral researcher at the UO. But\u00a0those\u00a0classical probabilistic approaches can be slow and struggle with large or incomplete genomic datasets, he\u00a0added.\u00a0<\/p>\n<p>So, the researchers looked\u00a0to\u00a0AI to efficiently interpret the language of life by\u00a0modifying\u00a0a GPT-2 model, the older machine learning architecture behind ChatGPT. But instead of being trained on large volumes of English text, the language model was trained on simulations of genetic evolution across\u00a0different species\u00a0&#8211; including bacteria, rodents,\u00a0mosquitoes\u00a0and primates &#8211; to learn and recognize <a href=\"https:\/\/www.news-medical.net\/health\/How-do-Genetic-Mutations-Cause-Disease.aspx\" class=\"linked-term\" rel=\"nofollow noopener\" target=\"_blank\">mutation<\/a> patterns.\u00a0<\/p>\n<p>&#8220;We can&#8217;t repeat evolution, so one of the key workflows we have is developing simulations,&#8221; Korfmann said. &#8220;The simulations\u00a0mimic\u00a0evolutionary processes, and then we use the outcomes as training data for our deep learning models.&#8221;\u00a0<\/p>\n<p>In general, stretches of DNA with many mutations likely trace back to a distant common ancestor, whereas those with few mutations are likely to share a more recent ancestor.\u00a0This helps explain why chimpanzees are considered humans&#8217;\u00a0closest living relatives, with similar DNA, while sea sponges are the most distant, having diverged\u00a0genetically\u00a0more than 700 million years ago.\u00a0<\/p>\n<p>Based on those mutation patterns and other biological principles, the AI model can predict when gene pairs last shared a common ancestor, known as the &#8220;coalescence time.&#8221;\u00a0<\/p>\n<p>In tests, the tool performed as well as\u00a0state-of-the-art\u00a0statistical methods,\u00a0which was\u00a0surprising\u00a0to\u00a0the research team.\u00a0<\/p>\n<p>&#8220;You never really know what&#8217;s going to work when you&#8217;re essentially borrowing techniques from a totally different world and applying them to a new problem,&#8221; Kern said. &#8220;But this was a case where things worked really well.&#8221;\u00a0<\/p>\n<p>The computer model\u00a0was\u00a0also dramatically faster.\u00a0While\u00a0traditional methods can take hours or even days to decode\u00a0a single mosquito chromosome,\u00a0the\u00a0new approach\u00a0can do it in minutes.\u00a0That\u00a0efficiency is especially\u00a0beneficial for\u00a0scientists\u00a0handling\u00a0large amounts\u00a0of genetic sequence data.\u00a0<\/p>\n<p>&#8220;Compared to classical inferential approaches, the AI tool doesn&#8217;t have to reason\u00a0about\u00a0every mutation individually,&#8221; Korfmann said. &#8220;It just reads the patterns because all of the expensive statistical work was done up front, during training, which sidesteps the bottleneck.&#8221;\u00a0<\/p>\n<p>The model&#8217;s simulation-based training also enables scientists to use DNA datasets that are incomplete or missing genetic code &#8211; an issue Kern\u00a0frequently\u00a0faces when working with mosquito genetic databases for\u00a0his research on malaria transmission.\u00a0<\/p>\n<p>That versatility comes at a crucial moment for malaria control, Kern said. For decades, insecticides have been a cornerstone for\u00a0the\u00a0prevention of malaria-spreading mosquitoes. But evolution, as Kern puts it, &#8220;did its thing.&#8221;\u00a0<\/p>\n<p>&#8220;Insecticide resistance is being observed in all of these mosquito populations today,&#8221; he said.\u00a0&#8220;A major challenge in preventing the spread of malaria has been understanding the evolution of insecticide resistance. Now, we can go in with our AI model, ask how long ago these resistance genes arose in the population, and learn about the evolutionary history of this critical carrier of malaria.&#8221;\u00a0<\/p>\n<p>Looking ahead, Kern and Korfmann\u00a0aim to advance the biological model\u00a0beyond tracing shared ancestry between\u00a0two\u00a0lineages\u00a0towards\u00a0reconstructing full\u00a0genealogical\u00a0trees across multiple\u00a0lineages.\u00a0Some traditional methods can already do this,\u00a0but Kern said\u00a0they&#8217;d\u00a0like to\u00a0chase\u00a0that goal\u00a0from\u00a0a\u00a0machine-learning angle.\u00a0<\/p>\n<p>&#8220;There&#8217;s so much going on in the machine learning field that we haven&#8217;t applied yet in our field,&#8221; Korfmann said. &#8220;There&#8217;s tons of translational work to do to get these novel algorithms working in biology.&#8221;\u00a0<\/p>\n<p>Source:<\/p>\n<p>Journal reference:<\/p>\n<p>DOI:\u00a0<a href=\"http:\/\/dx.doi.org\/10.1073\/pnas.2518956123\" rel=\"noopener nofollow\" target=\"_blank\">10.1073\/pnas.2518956123<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"Researchers at the University of Oregon have developed an artificial intelligence tool that can read genetic code the&hellip;\n","protected":false},"author":2,"featured_media":112634,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[77],"tags":[289,3286,18,8101,5506,5052,458,3289,19,17,8659,610,8704,8447,5053,172,133],"class_list":{"0":"post-468872","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-science","8":"tag-artificial-intelligence","9":"tag-dna","10":"tag-eire","11":"tag-evolution","12":"tag-genes","13":"tag-genetic","14":"tag-genetics","15":"tag-genome","16":"tag-ie","17":"tag-ireland","18":"tag-language","19":"tag-machine-learning","20":"tag-malaria","21":"tag-mosquito","22":"tag-mutation","23":"tag-research","24":"tag-science"},"share_on_mastodon":{"url":"","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/posts\/468872","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/comments?post=468872"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/posts\/468872\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/media\/112634"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/media?parent=468872"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/categories?post=468872"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/ie\/wp-json\/wp\/v2\/tags?post=468872"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}