{"id":430809,"date":"2025-09-17T09:09:09","date_gmt":"2025-09-17T09:09:09","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/430809\/"},"modified":"2025-09-17T09:09:09","modified_gmt":"2025-09-17T09:09:09","slug":"origins-of-the-genetic-code-traced-to-dipeptides","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/430809\/","title":{"rendered":"Origins of the Genetic Code Traced to Dipeptides"},"content":{"rendered":"<p>Genes are the building blocks of life, and the genetic code provides the instructions for the complex processes that make organisms function. But how and why did it come to be the way it is? A\u00a0<a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0022283625004620?via%3Dihub\" target=\"_blank\" rel=\"noopener\">recent study<\/a>\u00a0from the\u00a0<a href=\"https:\/\/illinois.edu\/\" target=\"_blank\" rel=\"noopener\">University of Illinois Urbana-Champaign<\/a>\u00a0sheds new light on the origin and evolution of the genetic code, providing valuable insights for genetic engineering and bioinformatics.<\/p>\n<p>\u201cWe find the origin of the genetic code mysteriously linked to the dipeptide composition of a proteome, the collective of proteins in an organism,\u201d said corresponding author\u00a0<a href=\"https:\/\/cropsciences.illinois.edu\/directory\/gca\" target=\"_blank\" rel=\"noopener\">Gustavo Caetano-Anoll\u00e9s<\/a>, professor in the\u00a0<a href=\"https:\/\/cropsciences.illinois.edu\/\" target=\"_blank\" rel=\"noopener\">Department of Crop Sciences<\/a>, the Carl R. Woese\u00a0<a href=\"https:\/\/www.igb.illinois.edu\/\" target=\"_blank\" rel=\"noopener\">Institute for Genomic Biology<\/a>, and\u00a0<a href=\"https:\/\/medicine.illinois.edu\/academics\/departments\/biomedical-and-translational-sciences\" target=\"_blank\" rel=\"noopener\">Biomedical and Translation Sciences<\/a>\u00a0of\u00a0<a href=\"https:\/\/medicine.illinois.edu\/\" target=\"_blank\" rel=\"noopener\">Carle Illinois College of Medicine\u00a0<\/a>at U. of I.<\/p>\n<p>Caetano-Anoll\u00e9s\u2019 work focuses on phylogenomics, which is the study of evolutionary relationships between the genomes of organisms. His research team previously built phylogenetic trees mapping the evolutionary timelines of protein domains (structural units in proteins) and transfer RNA (tRNA), an RNA molecule that delivers amino acids to the ribosome during protein synthesis. In this study, they explored the evolution of dipeptide sequences (basic modules of two amino acids linked by a peptide bond), finding the histories of domains, tRNA, and dipeptides all match.<\/p>\n<p>Life on Earth began 3.8 billion years ago, but genes and the genetic code did not emerge until 800,000 million years later, and there are competing theories about how it happened.<\/p>\n<p>Some scientists believe RNA-based enzymatic activity came first, while others suggest proteins first started working together. The research of Caetano-Anoll\u00e9s and his colleagues over the past decades supports the latter view, showing that ribosomal proteins and tRNA interactions appeared later in the evolutionary timeline.<\/p>\n<p>Life runs on two codes that work hand in hand, Caetano-Anoll\u00e9s explained. The genetic code stores instructions in nucleic acids (DNA and RNA), while the protein code tells enzymes and other molecules how to keep cells alive and running. Bridging the two is the ribosome, the cell\u2019s protein factory, which assembles amino acids carried by tRNA molecules into proteins. The enzymes that load the amino acids onto the tRNAs are called aminoacyl tRNA synthetases. These synthetase enzymes serve as guardians of the genetic code, monitoring that everything works properly.<\/p>\n<p>\u201cWhy does life rely on two languages \u2013 one for genes and one for proteins?\u201d Caetano-Anoll\u00e9s asked. \u201cWe still don\u2019t know why this dual system exists or what drives the connection between the two. The drivers couldn\u2019t be in RNA, which is functionally clumsy. Proteins, on the other hand, are experts in operating the sophisticated molecular machinery of the cell.\u201d<\/p>\n<p>The proteome appeared to be a better fit to hold the early history of the genetic code, with dipeptides playing a particularly significant role as early structural modules of proteins. There are 400 possible dipeptide combinations whose abundances vary across different organisms.<\/p>\n<p>The research team analyzed a dataset of 4.3 billion dipeptide sequences across 1,561 proteomes representing organisms from the three superkingdoms of life: Archaea, Bacteria, and Eukarya. They used the information to construct a phylogenetic tree and a chronology of dipeptide evolution. They also mapped the dipeptides to a tree of protein structural domains to see if similar patterns arose.<\/p>\n<p>In previous work, the researchers had built a phylogeny of tRNA that helped provide a timeline of the entry of amino acids into the genetic code, categorizing amino acids into three groups based on when they appeared. The oldest were Group 1, which included tyrosine, serine, and leucine, and Group 2, with 8 additional amino acids. These two groups were associated with the origin of editing in synthetase enzymes, which corrected inaccurate loading of amino acids, and an early operational code, which established the first rules of specificity, ensuring each codon corresponds to a single amino acid. Group 3 included amino acids that came later and were linked to derived functions related to the standard genetic code.\u00a0<\/p>\n<p>The team had already demonstrated the co-evolution of synthetases and tRNA in relation to the appearance of amino acids. Now, they could add dipeptides to the analysis.\u00a0\u00a0<\/p>\n<p>\u201cWe found the results were congruent,\u201d Caetano-Anoll\u00e9s explained. \u201cCongruence is a key concept in phylogenetic analysis. It means that a statement of evolution obtained with one type of data is confirmed by another. In this case, we examined three sources of information: protein domains, tRNAs, and dipeptide sequences. All three reveal the same progression of amino acids being added to the genetic code in a specific order.\u201d\u00a0<\/p>\n<p>Another novel finding was duality in the appearance of dipeptide pairs. Each dipeptide combines two amino acids, for example, alanine-leucine (AL), while a symmetrical one \u2014 an anti-dipeptide \u2014 has the opposite combination of leucine-alanine (LA). The two dipeptides in a pair are complementary; they can be considered mirror images of each other.\u00a0<\/p>\n<p>\u201cWe found something remarkable in the phylogenetic tree,\u201d Caetano-Anoll\u00e9s said. \u201cMost dipeptide and anti-dipeptide pairs appeared very close to each other on the evolutionary timeline. This synchronicity was unanticipated. The duality reveals something fundamental about the genetic code with potentially transformative implications for biology. It suggests dipeptides were arising encoded in complementary strands of nucleic acid genomes, likely minimalistic tRNAs that interacted with primordial synthetase enzymes.\u201d\u00a0<\/p>\n<p>Dipeptides did not arise as arbitrary combinations but as critical structural elements that shaped protein folding and function. The study suggests that dipeptides represent a primordial protein code emerging in response to the structural demands of early proteins, alongside an early RNA-based operational code. This process was shaped by co-evolution, molecular editing, catalysis, and specificity, ultimately giving rise to the synthetase enzymes, the modern guardians of the genetic code.<\/p>\n<p>Uncovering the evolutionary roots of the genetic code deepens our understanding of life\u2019s origin, and it informs modern fields such as genetic engineering, synthetic biology, and biomedical research.<\/p>\n<p>\u201cSynthetic biology is recognizing the value of an evolutionary perspective. It strengthens genetic engineering by letting nature guide the design. Understanding the antiquity of biological components and processes is important because it highlights their resilience and resistance to change. To make meaningful modifications, it is essential to understand the constraints and underlying logic of the genetic code,\u201d Caetano-Anoll\u00e9s said.<\/p>\n<p><b>Reference:\u00a0<\/b>Wang M, Aziz MF, Caetano-Anoll\u00e9s G. Tracing the origin of the genetic code and thermostability to dipeptide sequences in proteomes. J Mol Biol. 2025:169396. doi:\u00a0<a href=\"https:\/\/doi.org\/10.1016\/j.jmb.2025.169396\" style=\"background-color: rgb(255, 255, 255);\" target=\"_blank\" rel=\"noopener\">10.1016\/j.jmb.2025.169396<\/a><\/p>\n<p>This article has been republished from the following <a href=\"https:\/\/aces.illinois.edu\/news\/u-i-researchers-trace-genetic-codes-origins-early-protein-structures\" target=\"_blank\" rel=\"noopener\">materials<\/a>. Note: material may have been edited for length and content. For further information, please contact the cited source. Our press release publishing policy can be accessed <a href=\"https:\/\/www.technologynetworks.com\/tn\/editorial-policies#republishing\" target=\"_blank\" rel=\"noopener\">here<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"Genes are the building blocks of life, and the genetic code provides the instructions for the complex processes&hellip;\n","protected":false},"author":2,"featured_media":430810,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3846],"tags":[267,70,16,15],"class_list":{"0":"post-430809","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-genetics","9":"tag-science","10":"tag-uk","11":"tag-united-kingdom"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/115218858904944236","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/430809","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=430809"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/430809\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/430810"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=430809"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=430809"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=430809"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}