{"id":53774,"date":"2025-04-27T03:44:17","date_gmt":"2025-04-27T03:44:17","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/53774\/"},"modified":"2025-04-27T03:44:17","modified_gmt":"2025-04-27T03:44:17","slug":"genomic-and-genetic-insights-into-mendels-pea-genes","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/53774\/","title":{"rendered":"Genomic and genetic insights into Mendel\u2019s pea genes"},"content":{"rendered":"<p>Plant materials and methodsGermplasm panel<\/p>\n<p>A total of 697 accessions, maximizing genetic diversity, were selected from the JI Pisum Germplasm Collection for this study (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM4\" target=\"_blank\" rel=\"noopener\">1<\/a> and <a href=\"https:\/\/www.seedstor.ac.uk\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.seedstor.ac.uk\/<\/a>). These germplasm accessions were introduced in 2019 to the Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, where they are grown and investigated annually.<\/p>\n<p>DNA extraction for whole-genome resequencing<\/p>\n<p>Genomic DNA was extracted from approximately 50\u2009mg leaf tissue of 3-week-old seedlings. Extraction used the oKtopure system (LGC Biosearch Technology) following tissue desiccation with silica for 48\u2009h. A bespoke protocol was used with the following volumes per sample: 250\u2009\u00b5l lysis buffer, 170\u2009\u00b5l binding buffer, 20\u2009\u00b5l sbeadexTM suspension, 300\u2009\u00b5l PN1 wash buffer, 300\u2009\u00b5l PN2 wash buffer, 300\u2009\u00b5l PN2 wash buffer (\u00d73 wash cycles) and using 75\u2009\u00b5l final elution buffer. For each accession, a minimum of 6\u2009\u03bcg of genomic DNA was used to construct a 150-bp paired-end sequencing library with an insert size of 500\u2009bp, following the manufacturer\u2019s protocols (employing PCR-free methods), which was subsequently sequenced on the DNBSEQ Platform at BGI-Shenzhen resulting in ~80\u2009Gb clean reads with a coverage of ~20\u00d7 for each accession.<\/p>\n<p>Phenotyping<\/p>\n<p>DNA was extracted from a single plant whose seed was bulked up for progeny phenotyping. The diversity panel was planted in three different sites, Norwich, UK (52.62\u00b0 N, 1.28\u00b0 E), Shenzhen (Southern China, 22.61\u00b0 N, 114.51\u00b0 E) and Harbin (northern China, 45.86\u00b0 N, 126.83\u00b0 E). In China, four rounds of phenotyping were conducted. Specific subsets of accessions and some F2 populations were grown indoors in the greenhouse of Shenzhen Agricultural Field Farm, with 16\u2009h of light\/8\u2009h of darkness. Phenotypes collected at the three stations (2020\u20132023) and a historical JIC phenotype dataset were curated in Seedstor (<a href=\"https:\/\/www.seedstor.ac.uk\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.seedstor.ac.uk\/<\/a>). In Shenzhen, peas were planted in winter (October) and harvested in March the following year, whereas in Norwich and Harbin they were planted in spring (March to April) and harvested in August to October of the same year.<\/p>\n<p>For the phenotyping of pod colour (green versus yellow-podded lines), a field trial of three 1\u2009m2 microplots of 100 seeds each was sown in spring 2023, in which a 1:1 ratio of BC6 S3 Gp\/Gp and gp\/gp seeds (selfed seed of S2 homozygotes) were mixed and sown at random in each plot. At the pod filling stage, the Gp plants were tagged and at plot harvest seed was collected from individual plants to determine the yield of Gp and gp homozygotes. Seeds were weighed and counted on a Data Count R25+ machine (<a href=\"https:\/\/data-technologies.com\/\" target=\"_blank\" rel=\"noopener\">https:\/\/data-technologies.com\/<\/a>). Pod length and width were measured on 25 randomly selected pods. For the phenotyping of organ size, pod width (PW) and HSW were measured in mature pods of the F2 and F2:3 populations after\u00a0harvest. In the F2 populations, PW was assessed using 15 representative pods, divided into 3 groups of 5, with the total width of each group measured sequentially. For the F2:3 populations, PW was determined using 5 representative pods, with their total width measured in a similar manner. HSW was calculated by randomly weighing 100 seeds from each accession and repeating the process 3 times to obtain an average weight for each accession. Other more specific phenotypes were collected as described in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">37<\/a> and in line with published descriptors (<a href=\"https:\/\/www.seedstor.ac.uk\/search-phenotypes.php\" target=\"_blank\" rel=\"noopener\">https:\/\/www.seedstor.ac.uk\/search-phenotypes.php<\/a>).<\/p>\n<p>Construction of the pea genomic variation mapRead mapping, SNP calling and SNP annotation<\/p>\n<p>The trimmed clean reads of each accession were aligned against the reference genome of pea (P. sativum) cultivar, ZW6\u00a0(ref.\u00a0<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 18\" title=\"Yang, T. et al. Improved pea reference genome and pan-genome highlight genomic features and evolutionary characteristics. Nat. Genet. 54, 1553&#x2013;1563 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR18\" id=\"ref-link-section-d261742149e3407\" target=\"_blank\" rel=\"noopener\">18<\/a>) and Cam\u00e9or v.1.0\u00a0(ref.\u00a0<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 17\" title=\"Kreplak, J. et al. A reference genome for pea provides insight into legume genome evolution. Nat. Genet. 51, 1411&#x2013;1422 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR17\" id=\"ref-link-section-d261742149e3411\" target=\"_blank\" rel=\"noopener\">17<\/a>), using BWA-MEM (v.0.7.17) with default parameters<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 63\" title=\"Li, H. &amp; Durbin, R. Fast and accurate short read alignment with Burrows&#x2013;Wheeler transform. Bioinformatics 25, 1754&#x2013;1760 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR63\" id=\"ref-link-section-d261742149e3415\" target=\"_blank\" rel=\"noopener\">63<\/a>. Unmapped, non-unique and duplicated reads were filtered out using SAMtools<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 64\" title=\"Li, H. et al. The Sequence Alignment\/Map format and SAMtools. Bioinformatics 25, 2078&#x2013;2079 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR64\" id=\"ref-link-section-d261742149e3419\" target=\"_blank\" rel=\"noopener\">64<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491&#x2013;498 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR65\" id=\"ref-link-section-d261742149e3422\" target=\"_blank\" rel=\"noopener\">65<\/a> (v.1.9) and Picard (v.2.20.3-SNAPSHOT) before variants were called by a standard pipeline of Genome Analysis Toolkit (GATK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491&#x2013;498 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR65\" id=\"ref-link-section-d261742149e3427\" target=\"_blank\" rel=\"noopener\">65<\/a> v.4.1.2) and Sentieon<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 66\" title=\"Freed, D., Aldana, R., Weber, J. A. &amp; Edwards, J. S. The Sentieon Genomics Tools&#x2014;a fast and accurate solution to variant calling from next-generation sequence data. Preprint at BioRxiv &#010;                https:\/\/doi.org\/10.1101\/115717&#010;                &#010;               (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR66\" id=\"ref-link-section-d261742149e3431\" target=\"_blank\" rel=\"noopener\">66<\/a> (v.202112.01). SNPs were further filtered to remove low-quality variants defined as (1) SNPs with more than two alleles; (2) SNPs with QD\u2009\u200960.0, MQ\u2009\u20093.0, MQRankSum\u2009Hobs) exceeding the maximum calculated value (Hobs_max) based on the inbreeding coefficient (F), where F was calculated as 1\u2009\u2212\u2009(Hobs\/Hexp), with Hexp defined as 2p(1\u2009\u2212\u2009p) using the frequency of the non-reference allele, and Hobs_max was determined as 10\u2009\u00d7\u2009(1\u2009\u2212\u2009Fmedian)\u2009\u00d7\u2009Hexp for variants with F\u2009&gt;\u20090 and minor allele frequency (MAF)\u2009&gt;\u20090.05; (4) SNPs with missing rate &gt;20% and MAF\u2009<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 67\" title=\"Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly 6, 80&#x2013;92 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR67\" id=\"ref-link-section-d261742149e3485\" target=\"_blank\" rel=\"noopener\">67<\/a> (v.4.3t) was used to annotate the SNPs, and functional significance was then categorized on the basis of\u00a0their positions with respect to genes (intergenic regions, exons, introns, splicing sites, untranslated regions, upstream and downstream regions) and mutation consequences (missense, start codon gain or loss, stop codon gain or loss and splicing mutations).<\/p>\n<p>Identification of indels, gene PAV and gene CNVs, and SV<\/p>\n<p>Small insertion\u2013deletion mutants (indels; \u226450\u2009bp) were called using GATK (v.4.1.2) and filtered following the criteria: QD\u2009\u2009200.0 || high_FS || ReadPosRankSum\u2009<\/p>\n<p>Different categories of structural variants (SVs: duplication, inversion, translocation and large-scale deletion or insertion) were detected on the basis of\u00a0read mapping (read depth and read pair relationships) on PCR-duplicate-marked bam files using Delly (v 0.8.7) with default parameters; a summary of SVs identified is given in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">11<\/a>.<\/p>\n<p>Linkage disequilibrium analysis and pea haplotype map<\/p>\n<p>A two-step LD pruning process was implemented to generate a high-quality core SNP dataset for the construction of a haplotype map<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"Wang, W. et al. Genomic variation in 3,010 diverse accessions of Asian cultivated rice. Nature 557, 43&#x2013;49 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR68\" id=\"ref-link-section-d261742149e3511\" target=\"_blank\" rel=\"noopener\">68<\/a>. Initially, SNPs were pruned on the basis of\u00a0linkage disequilibrium (LD) using PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 69\" title=\"Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR69\" id=\"ref-link-section-d261742149e3515\" target=\"_blank\" rel=\"noopener\">69<\/a>, with a window size of 10\u2009kb, a window step of one SNP, and an r2 threshold of 0.8. A second round of LD pruning was conducted with a window size of 50\u2009kb, a window step of one SNP, and the same r2 threshold of 0.8. For population LD-based haplotype analysis, the filtered SNPs were phased using Beagle (v.21Apr21.304)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 70\" title=\"Browning, S. R. &amp; Browning, B. L. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet. 81, 1084&#x2013;1097 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR70\" id=\"ref-link-section-d261742149e3528\" target=\"_blank\" rel=\"noopener\">70<\/a>. Subsequently, haplotype blocks were delineated utilizing PLINK with specific parameters (&#8211;blocks no-pheno-req &#8211;blocks-max-kb 1000 &#8211;geno 0.1 &#8211;blocks-min-maf 0.05). To merge adjacent blocks maintaining significant LD, D\u2019 statistic values were calculated between all SNPs of consecutive blocks. If the lower quartile (Q1) exceeded 0.98, the adjacent blocks were merged. After filtering for the inbreeding coefficient, HAPPE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 71\" title=\"Feng, C. et al. HAPPE: a tool for population haplotype analysis and visualization in editable Excel tables. Front. Plant Sci. 13, 927407 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR71\" id=\"ref-link-section-d261742149e3532\" target=\"_blank\" rel=\"noopener\">71<\/a> was employed to identify haplotype clusters (haplogroups) for each block.<\/p>\n<p>Construction of mapping and validation populationsJI2822\u2009\u00d7\u2009JI0816 F2 population<\/p>\n<p>Lines JI0816 and JI2822 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">18<\/a>), both of short stature, are maintained in the John Innes Pisum germplasm collection (<a href=\"https:\/\/www.seedstor.ac.uk\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.seedstor.ac.uk\/<\/a>). JI0816, also known as WBH 1185, has pink flowers, a fasciated stem and yellow pods lacking pod parchment, corresponding to the mutant alleles b, fa, gp and p, respectively. JI2822, a recombinant inbred line derived from the cross JI0015\u2009\u00d7\u2009JI0399, is wild type at these four loci. JI0015 and JI0816 share the gp allele, indicating that these two lines had a common parent; therefore segments of the genetic map are devoid of segregating alleles. 1,000 F2 seeds from 9 F1 plants (JI2822\u2009\u00d7\u2009JI0816) were sown at the JIC field station in spring 2022. DNA preps from 942 plants were prepared from individual leaflets using the Qiagen DNeasy protocol (<a href=\"https:\/\/www.qiagen.com\" target=\"_blank\" rel=\"noopener\">https:\/\/www.qiagen.com<\/a>). Of these, 405 were genotyped using an Axiom SNP array as described<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Ellis, N. et al. Recombinant inbred lines derived from wide crosses in pisum. Sci. Rep. 13, 20408&#x2013;20408 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR49\" id=\"ref-link-section-d261742149e3593\" target=\"_blank\" rel=\"noopener\">49<\/a>. The phenotypic and genotypic data are available in Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">18<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">20<\/a>, and the sequences corresponding to the Axiom markers are available in supplementary table\u00a03 of Ellis et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Ellis, N. et al. Recombinant inbred lines derived from wide crosses in pisum. Sci. Rep. 13, 20408&#x2013;20408 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR49\" id=\"ref-link-section-d261742149e3603\" target=\"_blank\" rel=\"noopener\">49<\/a>.<\/p>\n<p>JI0015\u2009\u00d7\u2009JI0399 and JI2822\u2009\u00d7\u2009JI2833<\/p>\n<p>Three populations have been used for mapping Gp. The first to be used was the previously described recombinant inbred population JI0015\u2009\u00d7\u2009JI0399 (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">21<\/a>), later genotyped by Neogen, using an Infinium array as described previously<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Sawada, C. et al. An integrated linkage map of three recombinant inbred populations of pea (Pisum sativum L). Genes 13, 196&#x2013;196 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR51\" id=\"ref-link-section-d261742149e3621\" target=\"_blank\" rel=\"noopener\">51<\/a>. The second was an F2 population derived from a cross between two of these RILs (JI2822 Gp\/Gp and JI2833 gp\/gp), which was screened using PCR for markers already mapped in JI0015\u2009\u00d7\u2009JI0399 in order to identify informative individuals (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">22<\/a>). These, together with selected RILs with informative recombination events were genotyped with Axiom markers as described elsewhere<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Ellis, N. et al. Recombinant inbred lines derived from wide crosses in pisum. Sci. Rep. 13, 20408&#x2013;20408 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR49\" id=\"ref-link-section-d261742149e3637\" target=\"_blank\" rel=\"noopener\">49<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491&#x2013;498 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR65\" id=\"ref-link-section-d261742149e3640\" target=\"_blank\" rel=\"noopener\">65<\/a>. Gp also segregates in the JI2822\u2009\u00d7\u2009JI0816 F2 population as described above. The marker data are available in the supplementary file: Gp mapping in JI0015\u2009\u00d7\u2009JI0399 (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">21<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">22<\/a>).<\/p>\n<p>Other F2 mapping populations and BSA<\/p>\n<p>We selected parental lines with contrasting pairs of traits to map genetic loci of interest in F2 populations using mapping by sequencing<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 72\" title=\"Schneeberger, K. Using next-generation sequencing to isolate mutant genes from forward genetic screens. Nat. Rev. Genet. 15, 662&#x2013;676 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR72\" id=\"ref-link-section-d261742149e3669\" target=\"_blank\" rel=\"noopener\">72<\/a> of bulked segregants. For genetic loci controlling uncharacterized Mendel traits: flower position (axial versus terminal), pod colour (yellow versus green), and pod shape (inflated versus constricted), crosses were made between Cam\u00e9or (axial)\u2009\u00d7\u2009JI0814 (fasciated) and JI1995 (green pod)\u2009\u00d7\u2009JI2366 (yellow pod). F2 populations for the P\/V loci (pod shape) were derived from the cross between JI0074 (P\/P v\/v) as the male parent and JI1995 (P\/P V\/V) as the female parent, and the cross between JI2822\u2009\u00d7\u2009JI0816. F2 populations for the D locus (one (Dco) or two (Dw) axial rings of anthocyanin pigmentation) were derived from three crosses, with JI0191 (Dw), JI0794 (Dw) and JI1669 (Dw) as male parents and JI0328 (Dco) as the female parent. F2 populations for the Fn\/Fna loci (flower number per node, fpn) were derived from four crosses, with JI0441 (1fpn), JI2410 (3fpn), JI0745 (2fpn) and JI0746 (3fpn) as male parents and JI1995 (2fpn) as the female parent. The marker and BSA analysis of the F2 population is as described<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Karaca, D. E. Molecular Mapping of the v Locus in Pea (Pisum sativum l.). Master of Science Thesis, Washington State Univ. (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR36\" id=\"ref-link-section-d261742149e3735\" target=\"_blank\" rel=\"noopener\">36<\/a>.<\/p>\n<p>Approximately 300 plants from the F2 population of each of these crosses were planted in Shenzhen, China. Wild-type and mutant bulked DNA samples were prepared by mixing equal amounts of DNA from 30 accessions with the dominant and recessive phenotypes, respectively. DNA was isolated from fresh leaves using the CTAB method<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 73\" title=\"Aboul-Maaty, N. A.-F. &amp; Oraby, H. A.-S. Extraction of high-quality genomic DNA from different plant orders applying a modified CTAB-based method. Bull. Natl. Res. Cent. 43, 25 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR73\" id=\"ref-link-section-d261742149e3744\" target=\"_blank\" rel=\"noopener\">73<\/a>. 50\u00d7 depth genome sequences for each of the parents and the bulked samples were generated. Short reads were aligned against the ZW6 reference genome using BWA-MEM (v.0.7.17) and SNPs were identified using Samtools (v.1.9). The variation dataset was analysed using the G\u2019s value method of the QTLseqr package (v.0.7.5.2).<\/p>\n<p>Genetic mapping of Gp<\/p>\n<p>Green versus yellow pod colour segregates in the recombinant inbred (RIL) population derived from the cross between JI0015 (gp\/gp) and JI0399 (Gp\/Gp). The JI0015\u2009\u00d7\u2009JI0399 RIL population comprises 90 recombinant inbred lines, which, together with their parents, were genotyped using an Infinium array (Neogen) that detected 13,204 biallelic SNPs. This enabled us to position 5,209 PsCam markers on a genetic map (JI0015\u2009\u00d7\u2009JI0399) and place Gp between the markers PsCam005046 and PsCam056084 (and their co-segregating markers). Additional mapping was undertaken, using an Axiom SNP array with 84,691 features<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Ellis, N. et al. Recombinant inbred lines derived from wide crosses in pisum. Sci. Rep. 13, 20408&#x2013;20408 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR49\" id=\"ref-link-section-d261742149e3769\" target=\"_blank\" rel=\"noopener\">49<\/a> of selected F2 progeny of a cross between JI2822 (Gp) and JI2833 (gp) together with RILs from JI0015\u2009\u00d7\u2009JI0399 known to have recombination events at informative locations. JI2822 and JI2833 are both RILs from the JI0015\u2009\u00d7\u2009JI0399 population. With respect to the ZW6 assembly<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 18\" title=\"Yang, T. et al. Improved pea reference genome and pan-genome highlight genomic features and evolutionary characteristics. Nat. Genet. 54, 1553&#x2013;1563 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR18\" id=\"ref-link-section-d261742149e3782\" target=\"_blank\" rel=\"noopener\">18<\/a>, this placed Gp between the Axiom markers AX-183865165 (chr. 2:320968993) and AX-183571028 (chr. 3:325580858) (JI0015\u2009\u00d7\u2009JI0399). Analysis of an F2 population derived from crosses between JI2822 (Gp) and JI0816 (gp) placed Gp between the Axiom markers AX-183571050 (chr. 3:321020350) and AX-183879077, (chr. 3:324762848; Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">18<\/a>).<\/p>\n<p>We performed different association genomics analysis for pod colours, including the SNP-based GWAS, LD-based haplotype GWAS, kmer-derived IBS-based haplotype GWAS, and the SV-based GWAS (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM1\" target=\"_blank\" rel=\"noopener\">14<\/a>), all resulting in consistent and significant single GWAS peaks for pod colour located in the expected position of Gp, as seen in Manhattan plots (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM1\" target=\"_blank\" rel=\"noopener\">14<\/a>).<\/p>\n<p>Allelism tests for gp<\/p>\n<p>Crosses were made between pairs of yellow-podded lines in the JIC germplasm collection (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">18<\/a>). Seed and vegetative phenotypes were used to identify F1 progeny plants, and those accessions allelic, or non-allelic, to gp were identified by their yellow or green pod colour, respectively.<\/p>\n<p>Near-isogenic lines for Gp versus gp<\/p>\n<p>The JI0015 gp allele was introgressed into the Cam\u00e9or background by sequential back-crossing and F1 progeny testing using a codominant PCR marker assay with one forward (25994_F) and two reverse (25994_15R and 25994_399R) primers (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">18<\/a>). Gp (596\u2009bp) and gp (688\u2009bp) alleles were distinguished in a 35 cycle, 10s\u201330s\u201360s Touchdown PCR reaction that reduces the initial 62\u2009\u00b0C annealing temperature to 50\u2009\u00b0C in the first 10 cycles.<\/p>\n<p>Marker development and QTL mapping for PsOs1<\/p>\n<p>The organ size-related quantitative trait locus (PsOs1) was fine-mapped using 21 Kompetitive Allele Specific PCR (KASP) markers for SNPs distinguishing accessions JI0074 and JI1995 after whole-genome resequencing in the candidate region. Each KASP marker was designed with two allele-specific forward primers (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>) and one common reverse primer, on the basis of\u00a0200-bp sequences upstream and downstream of target genic SNPs, following the standards of LGC Biosearch Technologies. The genetic linkage map was constructed using JoinMap v.4.0 software. Windows QTL Cartographer v.2.5 software facilitated inclusive composite interval mapping (ICIM) for identifying and analysing QTLs. A logarithm of odds (LOD) score of \u22653.0 was deemed indicative of a QTL.<\/p>\n<p>Genome-wide association study<\/p>\n<p>The multi-location and multi-season phenotypic dataset was used to perform GWASs with the SNP matrix using GEMMA (v.0.98.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 74\" title=\"Zhou, X. &amp; Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 44, 821&#x2013;824 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR74\" id=\"ref-link-section-d261742149e3889\" target=\"_blank\" rel=\"noopener\">74<\/a>, and employing the following parameters (gemma-0.98.1-linux-static -miss 0.9 \u2013gk -o kinship.txt and gemma-0.98.1-linux-static -miss 0.9 -lmm -k kinship.txt). The structural variation matrix was used to test for association with phenotypic variation for each of the selected traits using the same parameters as above. The haplotype map was used to test for association with phenotypic variation for each of the selected traits using RTM-GWAS<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 75\" title=\"He, J. et al. An innovative procedure of genome-wide association analysis fits studies on germplasm population and plant breeding. Theor. Appl. Genet. 130, 2327&#x2013;2343 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR75\" id=\"ref-link-section-d261742149e3893\" target=\"_blank\" rel=\"noopener\">75<\/a> with the following parameters (rtm-gwas-gsc \u2013vcf in.vcf \u2013out out.matrix and rtm-gwas-assoc \u2013vcf in.vcf &#8211;covar out.matrix.evec &#8211;no-gxe). We used GEMMA\u2019s Wald tests and directly visualized the resulting P values as \u2013log10-transformed values in Manhattan plots. In-house R scripts were employed for data plotting. To identify SNPs of interest, we applied a Bonferroni correction to the significance threshold (\u03b1\u2009=\u20090.1) based on the total number of tested variants (9,214,461 SNPs). This yielded a threshold of \u2013log10(0.1\/9,214,461), and any SNP surpassing this cutoff was considered noteworthy and highlighted in the Manhattan plots.<\/p>\n<p>Orthologues and gene family analysis<\/p>\n<p>Phylogenetic analyses were conducted on key gene families in this study, such as MYB, CLE and CIK\/SERK, following a consistent workflow. Relevant A. thaliana orthologous genes containing the required domains were retrieved from TAIR (<a href=\"https:\/\/www.arabidopsis.org\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.arabidopsis.org<\/a>), and profile hidden Markov models (HMMs) were constructed using HMMER (v.3.1b1) on the basis of\u00a0multiple sequence alignments generated by MAFFT (v.7.475). These HMMs were then employed to identify putative homologues in the pea (P. sativum) ZW6 genome. Multiple sequence alignments for each family were trimmed with trimAl (v.1.5.rev.0), and the best-fit amino acid substitution models were selected using ModelTest-NG (v.0.1.7). Phylogenetic trees were constructed by IQ-TREE (v.2.1.2) with 1,000 ultrafast bootstrap replicates. For synteny analysis for each gene family, we used OrthoFinder (v.2.5.4) to identify orthologous clusters among pea and related legumes (for example, Vicia sativa, Medicago truncatula, Cicer arietinum, Lotus japonicus, Vigna radiata, Phaseolus vulgaris and Glycine max), and visualized collinearity blocks with JCVI (v.1.2.7).<\/p>\n<p>Gene functional validation experimentsFast neutron mutants<\/p>\n<p>Several Fast Neutron mutants from a population described by Domoney et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 76\" title=\"Domoney, C. et al. Exploiting a fast neutron mutant genetic resource in Pisum sativum (pea) for functional genomics. Funct. Plant Biol. 40, 1261 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR76\" id=\"ref-link-section-d261742149e3969\" target=\"_blank\" rel=\"noopener\">76<\/a>, were included in this project. These were: FN1453\/1sil-like; FN1091\/4 lacking axil ring pigmentation, allelic to d; FN1218\/6 lacking axil ring pigmentation, allelic to d; FN2026\/7coch2 candidate; FN2073\/5 lacking axil ring pigmentation, not allelic to d; and FN2076\/5VicA FN deletion line.<\/p>\n<p>Crosses were made between pairs of lines lacking axil ring pigmentation (Supplementary Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM1\" target=\"_blank\" rel=\"noopener\">32<\/a>) to test for complementation. Where possible, vegetative phenotypes were used to identify F1 progeny plants, and those accessions allelic, or non-allelic, to d were identified by the absence, or presence of pigmented axil rings, respectively.<\/p>\n<p>Complementation test<\/p>\n<p>A reverse genetics screen for the ChlG gene in pea was carried out in an ethane methane sulfonate-mutagenised targeting induced local lesions in genomes (TILLING) population in background Cam\u00e9or<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 26\" title=\"Dalmais, M. et al. Utilldb, a Pisum sativum in silicoforward and reverse genetics tool. Genome Biol. 9, R43 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR26\" id=\"ref-link-section-d261742149e4011\" target=\"_blank\" rel=\"noopener\">26<\/a>. Line 411.1, with a G&gt;A mutation 1,900 bp after the ATG, resulting in a W121* nonsense mutation, was identified. Eight M4 seeds were sown and seedlings were genotyped with a cleaved amplified polymorphic sequence (CAPS) marker (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM4\" target=\"_blank\" rel=\"noopener\">47<\/a>). No seedlings were homozygous mutants (signified by a single undigested 1,125-bp band), 6 were heterozygous (signified by 3 bands of sizes 1,125\u2009bp, 699\u2009bp and 426\u2009bp) and 2 were homozygous wild type (signified by 2 bands of sizes 699\u2009bp and 426\u2009bp). A complementation test was carried out by crossing heterozygous seedlings with a homozygous JI0015 gp mutant (13 crosses with male JI0015 and F1 identified by long internodes, and 3 crosses with female JI0015 and F1 identified by yellow cotyledons). Nine out of 16 F1 progeny plants had yellow pods, indicating non-complementation.<\/p>\n<p>Virus-induced gene silencing<\/p>\n<p>VIGS in peas was performed using a published methodology as described previously<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 77\" title=\"Gr&#xF8;nlund, M., Olsen, A., Johansen, E. I. &amp; Jakobsen, I. Protocol: using virus-induced gene silencing to study the arbuscular mycorrhizal symbiosis in Pisum sativum. Plant Methods 6, 28 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR77\" id=\"ref-link-section-d261742149e4034\" target=\"_blank\" rel=\"noopener\">77<\/a>. To target genes of interest, a 200\u2013500\u2009bp fragment from the CDS region of each gene were amplified. The primers for VIGS constructs, including VIGS-PsChlG, VIGS-PsOs1, VIGS-PsMYB26 and PsMYB16, are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>. SpeI\/XbaI and EcoRI were used to linearize the pCAPE2 vector, which was kindly provided by Li et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 78\" title=\"Li, X. et al. BIGGER ORGANS and ELEPHANT EAR-LIKE LEAF1 control organ size and floral organ internal asymmetry in pea. J. Exp. Bot. 70, 179&#x2013;191 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR78\" id=\"ref-link-section-d261742149e4063\" target=\"_blank\" rel=\"noopener\">78<\/a>, and corresponding fragments of gene targets were ligated into the vector to construct the vectors for VIGS assays. For VIGS-PsChlG, the negative control vector, pCAPE2-Con, was constructed in the same way by replacing the PsChlG fragment in pCAPE2-PsChlG with a 529-bp insert derived from a cDNA fragment of Bean Yellow Mosaic Virus (GenBank accession AJ622899). The positive control vector, pCAPE2-PDS, targeting the phytoene desaturase gene, was also provided by Li et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 78\" title=\"Li, X. et al. BIGGER ORGANS and ELEPHANT EAR-LIKE LEAF1 control organ size and floral organ internal asymmetry in pea. J. Exp. Bot. 70, 179&#x2013;191 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR78\" id=\"ref-link-section-d261742149e4077\" target=\"_blank\" rel=\"noopener\">78<\/a>. These vectors were transferred into Agrobacterium tumefaciens (GV3101) (Shanghai Weidi Biotechnology) and VIGS assays carried out following the protocol described by Constantin et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 79\" title=\"Constantin, G. D. et al. Virus&#x2010;induced gene silencing as a tool for functional genomics in a legume species. Plant J. 40, 622&#x2013;631 (2004).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR79\" id=\"ref-link-section-d261742149e4084\" target=\"_blank\" rel=\"noopener\">79<\/a>. In brief, Agrobacterium strains carrying these vectors were shaken separately until OD600\u2009=\u20091.2, followed by the collection and resuspension of the bacteria in injection buffer (NaCl: 10\u2009mM, CaCl2: 10\u2009mM, acetosyringone: 0.1\u2009mM) to a concentration of OD600\u2009=\u20091.2. After resting for 2\u20133\u2009h, the solution of pCAPE2-target gene, pCAPE2-PDS (positive control), and pCAPE2-Con (negative control) was mixed with pCAPE1, separately, in equal proportions, and injected into 10-day-old compound leaves of the acceptant lines (Yunnan2070 or JI1995). Specifically, pCAPE1 and pCAPE2 are plasmid vectors used to induce gene silencing in plants such as M. truncatula and P. sativum. After 24\u2009h of darkness, they were transferred to long day conditions. New leaves of positive control plants bleached in about 10 days, indicating successful silencing of PDS. VIGS was employed with the same procedure for PsMYB16 gene within the D locus and VIGS-MYB26 for the V candidate gene. For PsOs1, which is described in detail below, all the gene-specific primers used for VIGS constructs are listed in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>.<\/p>\n<p>Transformation, gene overexpression and silencing of PsOs1<\/p>\n<p>The PsOs1 coding sequence of JI0074 was amplified (primers listed in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>) and integrated into the pCAMBIA1305 vector, resulting in the pCAMBIA1305-PsOs1JI0074 construct. The plasmid was then introduced into A. tumefaciens GV3101, which was subsequently employed to transform A. thaliana (Col-0) via the floral dip technique. T3 generation homozygous transgenic Arabidopsis lines were selected for measurement of thousand-seed weight and the dimensions of elongated siliques.<\/p>\n<p>GUS staining, GFP fluorescence observations and flow cytometry<\/p>\n<p>The pCAMBIA1305-PsOs1JI1995 vector was constructed using the same methodology, with primers detailed in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>. Both vectors, pCAMBIA1305-PsOs1JI1995 and pCAMBIA1305-PsOs1JI0074, were introduced into the A. tumefaciens strain GV3101. In these experiments, H2B-mCherry served as a nucleus marker. The agrobacteria were resuspended and infiltrated into Nicotiana benthamiana leaf epidermal cells using an infiltration buffer consisting of 10\u2009mM MES (pH 5.6), 10\u2009mM MgCl2, and 150\u2009\u03bcM acetosyringone, at an OD600 of 0.8. Fluorescence was observed 48\u2009h after infiltration using a confocal laser-scanning microscope.<\/p>\n<p>To compare the promoter activities of JI0074 and JI1995 alleles of PsOs1, we cloned sequences 3,000-bp upstream of the coding region and inserted them into pCAMBIA1300-GUS, resulting in the constructs ProJI0074-GUS and ProJI1995-GUS. These were expressed in Nicotiana tabacum leaves and subsequently stained using a GUS Staining Kit (Coolaber Biotech). GUS activity was quantified using the GUS Gene Quantitative Detection Kit (Coolaber Biotech). For a detailed examination of PsOs1 expression patterns in Arabidopsis, various Arabidopsis tissues were sampled from ProJI0074-GUS transgenic plants. After\u00a0ethanol decolourization, observations and photographs were taken under a microscope. Details of the primers used are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>.<\/p>\n<p>Intact nuclei from pea pods were isolated using LB01 lysis buffer (Coolaber Biotech), followed by RNA removal and subsequent propidium iodide staining. The nuclei were then quantified using a CytoFLEX flow cytometer. A minimum of 20,000 nuclei were counted for each sample, and each experiment was replicated at least three times. Data analysis was conducted using FLOWJO software, and representative images were presented. The endoreduplication index (EI) was calculated using the formula: EI\u2009=\u2009[(0\u2009\u00d7\u2009percentage of 2C nuclei)\u2009+\u2009(1\u2009\u00d7\u2009percentage of 4C nuclei)\u2009+\u2009(2\u2009\u00d7\u2009percentage of 8C muclei)\u2009+\u2009(3\u2009\u00d7\u2009percentage of 16C nuclei)\u2009+\u2009(4\u2009\u00d7\u2009percentage of 32C nuclei)]\/100.<\/p>\n<p>Yeast two-hybrid experiment<\/p>\n<p>Yeast two-hybrid assays were conducted according to the protocols outlined in the Yeast Protocols Handbook (Clontech). The CDS of PsCIK2\/3 was cloned into the bait plasmid pBT3-SUC, while the CDS of PsCLV1 or PsCLV2 was cloned into the prey plasmid pPR3-N. The primer sequences used for cloning are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>. These plasmids were co-transformed into the yeast strain NMY51 in different combinations. Transformants were initially screened on SD\/-Trp\/-Leu medium to confirm successful co-transformation. Interaction assays were then performed on SD\/-Trp\/-Leu\/-His\/-Ade medium containing the chromogenic substrate X-\u03b1-Gal at 30\u2009\u00b0C to detect protein-protein interactions.<\/p>\n<p>Anatomical studies and TEM<\/p>\n<p>Confocal images were collected with Leica TCS SP8 confocal laser-scanning microscope (Leica). After sampling, the shoot apices of Cam\u00e9or and fa mutant line JI0814, the young leaves (gp\/gp, JI2366; Gp\/Gp JI0817), and the pod walls of JI0074 and JI1995 were immediately preserved in formaldehyde\/alcohol\/acetic acid fixative. Paraffin sectioning was performed following established methodologies. Staining was conducted using safranin and fast green (JI0074 and JI1995) and toluidine blue (Cam\u00e9or and JI0814). Prepared slides were scanned using a NanoZoomer, and cell quantification was carried out using NDP.view2 software. For pods, the resin block was sliced at 60\u201380\u2009nm on an ultrathin slicer, and the slices were picked up on a 150-mesh copper mesh. The copper mesh was stained with a 2% uranyl acetate saturated alcohol solution in the dark for 8\u2009min; washed 3 times with 70% alcohol; washed 3 times with ultrapure water; stained with a 2.6% lead citrate solution in the dark for 8\u2009min; washed 3 times with ultrapure water, and slightly dried with filter paper. The copper mesh sections were placed in a copper mesh box and dried overnight at room temperature. The observation was under a transmission electron microscope and images collected for analysis.<\/p>\n<p>For TEM studies, pea leaflets and pods (18 days after flowering) were removed from BC3 S2 gp\/gp and Gp\/Gp plants, after 9\u2009h of daylight. Tissue (1\u2009mm2) pieces were placed in a solution of 2.5% (v\/v) glutaraldehyde in 0.05\u2009M sodium cacodylate, pH 7.3 for fixation. Samples were left overnight at room temperature and then processed for embedding (Leica EM TP embedding machine) by washing out the fixative with three successive 15-min washes in 0.05\u2009M sodium cacodylate, followed by fixation in 1% (w\/v) OsO4 in 0.05\u2009M sodium cacodylate for 2\u2009h at room temperature. After three 15\u2009min washes in distilled water, samples were dehydrated in an ethanol series (30%, 50%, 70%, 95% and two changes of 100% ethanol), then infiltrated with LR White resin (London Resin Company) by successive changes of resin:ethanol mixes at room temperature (1:1 for 1\u2009h, 2:1 for 1\u2009h, 3:1 for 1\u2009h, 100% resin for 1\u2009h, then 100% resin for 16\u2009h, and 100% resin for a further 8\u2009h). Samples were polymerized in LR White resin at 60 \u00b0C for 16\u2009h, then sectioned with a diamond knife (Leica UC7 ultramicrotome). Ultrathin sections (approximately 90\u2009nm) were placed on 200 mesh formvar and carbon-coated copper grids (Agar Scientific). Sections were stained with 2% (w\/v) uranyl acetate for 1\u2009h and 1% (w\/v) lead citrate for 1\u2009min, washed in distilled water and air dried. Grids were viewed in a FEI Talos 200\u2009C transmission electron microscope (FEI) at 200\u2009kV and imaged using a Gatan OneView 4K\u2009\u00d7\u20094K digital camera (Gatan) to record DM4 files.<\/p>\n<p>In situ hybridization<\/p>\n<p>Tissues were rinsed with PBS and immediately placed in the in situ hybridization fixative solution for more than 12\u2009h. Paraffin section preparation and in situ hybridization of the probes were performed according to standard protocols<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 80\" title=\"Brewer, P. B., Heisler, M. G., Hej&#xE1;tko, J., Friml, J. &amp; Benkov&#xE1;, E. In situ hybridization for mRNA detection in Arabidopsis tissue sections. Nat. Protoc. 1, 6 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR80\" id=\"ref-link-section-d261742149e4297\" target=\"_blank\" rel=\"noopener\">80<\/a>. The sequences of the digoxigenin (DIG)-labelled antisense riboprobes used for in situ hybridization are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">47<\/a>. The hybridization signal was detected with an alkaline phosphatase-conjugated anti-DIG antibody (200-052-156, Jackson ImmunoResearch). Finally, images were obtained and analysed using a Pannoramic MIDI digital slide scanner (3DHISTECH).<\/p>\n<p>RNA-seq, iso-seq and gene expressionRNA extraction and pea transcriptome<\/p>\n<p>We built a transcriptome atlas from the reference line Cam\u00e9or (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">30<\/a>), and selected various accessions that display the contrasting pairs of traits studied here (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM5\" target=\"_blank\" rel=\"noopener\">31<\/a>). In China, plant tissues (seed, root, nodule, leaflet, stem, flower, pod, stipule, tendril and apical bud) at different development stages (seedling, flowering and podding) were collected and fixed in Trizol before RNA extraction. Tissues were ground in liquid nitrogen and the FastPure Universal Plant Total RNA Isolation Kit (Vazyme Biotech) was used to extract total RNA, the quality of which was assessed by gel electrophoresis. For each sample, we performed short-read RNA-sequencing using the DNBSEQ Platform at BGI Group to generate 6\u20138\u2009Gb raw RNA reads for each accession.<\/p>\n<p>At JIC, RNA was prepared from young developing pods (flat pod stage, ~60\u201370\u2009mm in length) of each of the parental and RI lines derived from the cross between JI0015 (gp\/gp) and JI0399 (Gp\/Gp). Developing seeds were removed from the pods, which were then rapidly frozen in liquid nitrogen. High-quality RNA lacking genomic DNA was extracted from 97 individual pod samples, using a Spectrum Plant Total RNA Kit (Sigma-Aldrich), and used for PCR with reverse transcription and RNA-seq experiments focussed on the identification and characterisation of gene candidates for gp. For the latter analysis, green-podded and yellow-podded RILs (95 in total) were assigned to three groups for each phenotype, ensuring that lines with contrasting plant phenotypes (e.g. plant height) were randomly distributed among the replicate groups (G1, G2 and G3 for green-podded RILs; Y1, Y2 and Y3 for yellow-podded RILs, with 15\u201317 RILs per pool). Equal amounts of RNA from every line within a group were pooled. RNA-seq (Illumina HiSeq4000) and initial bioinformatic analyses were carried out by the Earlham Institute.<\/p>\n<p>We performed Iso-seq sequencing for a subset of accessions for the target organ at specific developmental stages. We used the Iso-Seq (v.4.0.0) pipeline to process PacBio SMRT Cell subreads and generate high-quality, full-length transcripts. First, subreads from each SMRT Cell were processed with ccs (v.3.4.1) to produce one circular consensus sequence per zero-mode waveguide, applying a minimum read quality of 0.9. Next, primer removal and demultiplexing were performed with lima (v.2.9.0) in Iso-Seq mode, removing unwanted primer combinations and orienting reads from 5\u2032 to 3\u2032. Full-length reads were then refined by trimming poly(A) tails and removing concatemers. When multiple SMRT cells were sequenced, the resulting full-length non-concatemer (FLNC) BAM files were merged before clustering. Iso-Seq cluster (v.4.0.0) was applied to produce polished consensus transcripts, partitioned into high-quality (HQ) and low-quality (LQ) sets on the basis of\u00a0predicted accuracy. The final consensus transcripts were aligned to the ZW6 and JI0074 reference genomes using pbmm2 (v.1.14.99). Last, iso-seq collapse was used to collapse redundant isoforms and generate GFF files, which were converted to GFF3 with gffread (0.12.7) for downstream analyses.<\/p>\n<p>RT\u2013qPCR<\/p>\n<p>RT\u2013qPCR was conducted to analyse gene expression. Total RNA was extracted using the FastPure Plant Total RNA Isolation Kit (Vazyme Biotech) following the manufacturer\u2019s instructions, including an on-column DNase I digestion step to remove genomic DNA. Subsequently, 1\u2009\u03bcg of RNA was used for cDNA synthesis with the All-in-One First-Strand cDNA Synthesis SuperMix for qPCR kit (TransGen Biotech). Green qPCR SuperMix kit (TransGen Biotech) was used for amplification on a CFX384TM Real-Time System (Bio-Rad). The method of collecting plant material for the detection of P\/PsCLE41 and V\/PsMYB26 was as follows: (1) pods were collected 10 days after flowering. After removing the seeds, the pods were cut into pieces and frozen in liquid nitrogen; (2) apical buds were collected 14 days after emergence and, after removing the extra young leaves under a microscope, the buds were cut into pieces and frozen in liquid nitrogen; (3) stems were collected 14 days after emergence at the third node from the top of the stem, cut into pieces and frozen in liquid nitrogen; (4) seed cotyledons were collected 12 days after flowering; after removing the testa, the cotyledons were cut into pieces and frozen in liquid nitrogen.<\/p>\n<p>To validate the Gp transcriptional fusion and aberrant transcripts, the total RNA was reverse transcribed to cDNA using HiScript III First Strand cDNA Synthesis Kit (+gDNA wiper, Vazyme Biotech). RT\u2013qPCR analysis was conducted using Taq Pro Universal SYBR qPCR Master Mix (Vazyme Biotech), employing specific primers, with PsACTIN serving as the internal standard. Expression levels of genes were quantified relative to the control based using the 2\u2212\u0394\u0394CT method. All RT\u2013qPCR results represent the mean\u2009\u00b1\u2009s.d. from three separate biological experiments. The primers used for RT\u2013qPCR primers used are provided in Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM4\" target=\"_blank\" rel=\"noopener\">47<\/a>.<\/p>\n<p>DNA methylation sequencing<\/p>\n<p>Bisulfite treatment and libraries were prepared accordingly to the standard protocol of methylation library construction. PE150 sequencing was performed using Illumina NovaSeq X Plus sequencing platform 25B chip. Quality control and adapter trimming of the raw whole-genome bisulfite sequencing (WGBS) data was then performed using Trimmomatic (v.0.39). The resulting WGBS reads were mapped to the JI0074 reference genome using Bismark (v.0.23.0) and PCR duplicates were removed from the aligned reads.<\/p>\n<p>Statistical methodsGeneral statistical analysis<\/p>\n<p>Statistical analyses were conducted in R software suite (v.4.2; <a href=\"https:\/\/www.r-project.org\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.r-project.org\/<\/a>) unless otherwise stated. The correlation between different traits was tested by calculating the coefficients of Pearson correlation, as well as the P values, using the cor.test package, with the method set to \u2018Pearson\u2019 for the correlation analyses between quantitative traits. Traits collected at different locations and in different years were analysed by calculating their rank correlations by setting the option \u2018method\u2019 to \u2018Spearman\u2019. The correlation between qualitative traits was assessed using the chi-square test using the \u2018chisq.test\u2019 package in R. Gene expression levels in different lines or organs under different treatments or at different developmental stages (at least three biological replicates for each sample) were analysed using DESeq2\u00a0(ref.\u00a0<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 81\" title=\"Love, M. I., Huber, W. &amp; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR81\" id=\"ref-link-section-d261742149e4397\" target=\"_blank\" rel=\"noopener\">81<\/a>), in which the genes with a false discovery rate (Bonferroni) lower than 0.01 were defined as significantly regulated genes, unless there is an alternative explanation in a specific legend. The min\u2013max scaling (normalization) approach was used to calculate the expression level for comparison of each gene across stages and organs by the formula: Xscaled\u2009=\u2009(X\u2009\u2212\u2009Xmin)\/(Xmax\u2009\u2212\u2009Xmin).<\/p>\n<p>PCA (main text Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#Fig1\" target=\"_blank\" rel=\"noopener\">1<\/a>) was performed on the PLINK distance matrix using an Excel add-in downloaded from RIKEN, now available at <a href=\"https:\/\/systemsomicslab.github.io\/compms\/others\/main.html#Statistics\" target=\"_blank\" rel=\"noopener\">https:\/\/systemsomicslab.github.io\/compms\/others\/main.html#Statistics<\/a>.<\/p>\n<p>Statistics and reproducibility<\/p>\n<p>All experiments were designed with explicit consideration of statistical power and reproducibility. Each experiment was independently repeated at least three times (biological replicates) with consistent results across repetitions, including microscopy analyses used at least three samples or technical replicates per experimental condition (such as Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#Fig3\" target=\"_blank\" rel=\"noopener\">3b,c,g<\/a> and Extended Data Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#Fig8\" target=\"_blank\" rel=\"noopener\">4a\u2013c,e<\/a> and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#Fig10\" target=\"_blank\" rel=\"noopener\">6b,h,i<\/a>); phenotypic observations were validated across \u22653 independent growth cycles (Norwich, Shenzhen, Harbin, multiple sites and multiple years). Micrographs shown are representative of at least three independent experiments. Complete protocols and raw data supporting the conclusions are available in the Data availability and <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM1\" target=\"_blank\" rel=\"noopener\">Supplementary Information<\/a>. Data are presented as mean\u2009\u00b1\u2009s.e.m., and statistical significance was determined using two-tailed Students\u2019 t-tests (\u2018t.test\u2019 package in R software v.4.2) in the analyses of the phenotypes, such as seed weight and pod width, between different accessions.<\/p>\n<p>Population structure analysis<\/p>\n<p>The core high-quality SNP dataset was used for population structural analyses. PCA and t-distributed stochastic neighbour embedding analyses were first performed using beta Python modules sklearn.decomposition and sklearn.manifold. ADMIXTURE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 82\" title=\"Alexander, D. H., Novembre, J. &amp; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655&#x2013;1664 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR82\" id=\"ref-link-section-d261742149e4468\" target=\"_blank\" rel=\"noopener\">82<\/a> (version 1.3.0) was employed to analyse the population structure, with K increasing from 2 to 16. The Q value represents the estimated proportion of an individual\u2019s genome that originates from each inferred ancestral population cluster.<\/p>\n<p>Genetic differentiation (Fst) and nucleotide diversity (\u03c0) were calculated with VCFtools (version 0.1.13). Fst scores were calculated within nonoverlapping 100-kb windows and \u03c0 was calculated for each individual site and averaged across the genome for each group. LD was calculated on SNP pairs within a 500-kb window using PopLDdecay<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 83\" title=\"Zhang, C., Dong, S.-S., Xu, J.-Y., He, W.-M. &amp; Yang, T.-L. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35, 1786&#x2013;1788 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR83\" id=\"ref-link-section-d261742149e4489\" target=\"_blank\" rel=\"noopener\">83<\/a> (version 3.31; <a href=\"https:\/\/github.com\/BGI-shenzhen\/PopLDdecay\" target=\"_blank\" rel=\"noopener\">https:\/\/github.com\/BGI-shenzhen\/PopLDdecay<\/a>) and the decay was measured by the distance at which the Pearson\u2019s correlation efficient (r2) dropped to half of the maximum. Splits tree analysis of the PLINK distance matrix was performed using SplitsTree4\u00a0(ref.\u00a0<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"Huson, D. H. &amp; Bryant, D. Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23, 254&#x2013;267 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#ref-CR61\" id=\"ref-link-section-d261742149e4505\" target=\"_blank\" rel=\"noopener\">61<\/a>).<\/p>\n<p>Germplasm availability<\/p>\n<p>All the germplasm described and used in this work is available to order from the John Innes Centre Germplasm Resources Unit (<a href=\"https:\/\/www.seedstor.ac.uk\/\" target=\"_blank\" rel=\"noopener\">https:\/\/www.seedstor.ac.uk\/<\/a>). The 697 sequenced single seed descent lines derived from John Innes Pisum Germplasm accessions, were imported in 2019 to the Agricultural Genomics Institute at Shenzhen (AGIS), Chinese Academy of Agricultural Sciences (CAAS), where they have been grown and phenotyped annually since then.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41586-025-08891-6#MOESM2\" target=\"_blank\" rel=\"noopener\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Plant materials and methodsGermplasm panel A total of 697 accessions, maximizing genetic diversity, were selected from the JI&hellip;\n","protected":false},"author":2,"featured_media":53775,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3846],"tags":[12369,267,7189,3965,3966,28662,28663,70,16,15],"class_list":{"0":"post-53774","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-agricultural-genetics","9":"tag-genetics","10":"tag-genome-wide-association-studies","11":"tag-humanities-and-social-sciences","12":"tag-multidisciplinary","13":"tag-natural-variation-in-plants","14":"tag-plant-genetics","15":"tag-science","16":"tag-uk","17":"tag-united-kingdom"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/114407870519293166","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/53774","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=53774"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/53774\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/53775"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=53774"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=53774"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=53774"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}