{"id":409614,"date":"2025-09-09T05:13:15","date_gmt":"2025-09-09T05:13:15","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/409614\/"},"modified":"2025-09-09T05:13:15","modified_gmt":"2025-09-09T05:13:15","slug":"multiancestry-brain-pqtl-fine-mapping-and-integration-with-genome-wide-association-studies-of-21-neurologic-and-psychiatric-conditions","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/409614\/","title":{"rendered":"Multiancestry brain pQTL fine-mapping and integration with genome-wide association studies of 21 neurologic and psychiatric conditions"},"content":{"rendered":"<p>Ethics<\/p>\n<p>Our study complies with all relevant ethical regulations and was approved by the institutional review boards at University of California, Davis; Rush University; the Mayo Clinic; Mount Sinai University; Emory University and Banner Sun Health Research Institute.<\/p>\n<p>Genetic data<\/p>\n<p>Genetic data were generated from blood or brain tissue using either array-based genotyping or whole-genome sequencing (WGS) as described previously<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"De Jager, P. L. et al. A multi-omic atlas of the human frontal cortex for aging and Alzheimer&#x2019;s disease research. Sci. Data 5, 180142 (2018).\" href=\"#ref-CR19\" id=\"ref-link-section-d105321419e5506\">19<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Wang, M. et al. The Mount Sinai cohort of large-scale genomic, transcriptomic and proteomic data in Alzheimer&#x2019;s disease. Sci. Data 5, 180185 (2018).\" href=\"#ref-CR20\" id=\"ref-link-section-d105321419e5506_1\">20<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" title=\"Wingo, A. P. et al. Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer&#x2019;s disease pathogenesis. Nat. Genet. 53, 143&#x2013;146 (2021).\" href=\"#ref-CR21\" id=\"ref-link-section-d105321419e5506_2\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Joseph, S. R. et al. Bridging the Gap: multi-omics profiling of brain tissue in Alzheimer&#x2019;s disease and older controls in multi-ethnic populations. Alzheimers Dement. 20, 7174&#x2013;7192 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR22\" id=\"ref-link-section-d105321419e5509\" target=\"_blank\" rel=\"noopener\">22<\/a>. New to this study was WGS from 181 AA, 168 Hispanic and 292 NHW individuals whose raw Illumina 150-bp reads were mapped to GRCh38 to a median depth of 30\u00d7 with PEMapper<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Johnston, H. R. et al. PEMapper and PECaller provide a simplified approach to whole-genome sequencing. Proc. Natl Acad. Sci. USA 114, E1923 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR23\" id=\"ref-link-section-d105321419e5513\" target=\"_blank\" rel=\"noopener\">23<\/a>. Genetic variants were jointly called across samples with PECaller (v.2.0.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Johnston, H. R. et al. PEMapper and PECaller provide a simplified approach to whole-genome sequencing. Proc. Natl Acad. Sci. USA 114, E1923 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR23\" id=\"ref-link-section-d105321419e5517\" target=\"_blank\" rel=\"noopener\">23<\/a> using default parameters.<\/p>\n<p>Quality control of genetic data<\/p>\n<p>We prioritized use of WGS over genotyping where available and performed quality control of WGS and genotyping data separately using PLINK<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Purcell, S. et al. PLINK: a toolset for whole-genome association and population-based linkage analysis. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR24\" id=\"ref-link-section-d105321419e5528\" target=\"_blank\" rel=\"noopener\">24<\/a>. Participants with overall genotyping missingness &gt;10% and sex mismatch were excluded. Variants with evidence of deviation from Hardy\u2013Weinberg equilibrium (P\u2009\u22127) and missing genotype rate &gt;5% were removed. Genotype data in GRCh37 were converted to GRCh38. We followed the quality-control steps provided by the TOPMed pipeline<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 25\" title=\"Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284&#x2013;1287 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR25\" id=\"ref-link-section-d105321419e5537\" target=\"_blank\" rel=\"noopener\">25<\/a> for genetic data before imputation. Imputation was performed using the TOPMed imputation panel and server with default parameters<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 25\" title=\"Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284&#x2013;1287 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR25\" id=\"ref-link-section-d105321419e5541\" target=\"_blank\" rel=\"noopener\">25<\/a>. SNPs with imputation quality R2\u2009&gt;\u20090.3 were retained.<\/p>\n<p>For WGS, we assessed the coverage, missingness, ratio between the number of transition mutations and the number of transversion mutations, and the silent\/replacement ratio (the ratio between synonymous and nonsynonymous mutations). We merged the imputed genotyping data and WGS data and performed further quality control in each population (AA, Hispanic and NHW) separately by excluding variants with any of the following criteria: genotype missing rate &gt;5%, MAF\u2009P\u2009\u22127.<\/p>\n<p>Related individuals were identified using KING (v.2.2.2)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 26\" title=\"Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867&#x2013;2873 (2010).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR26\" id=\"ref-link-section-d105321419e5561\" target=\"_blank\" rel=\"noopener\">26<\/a>, and one in each pair of individuals who were second-degree or closer relatives was randomly removed. Individuals who were population outliers were identified and removed using EIGENSTRAT from EIGENSOFT (v.8.0.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 27\" title=\"Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904&#x2013;909 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR27\" id=\"ref-link-section-d105321419e5565\" target=\"_blank\" rel=\"noopener\">27<\/a>.<\/p>\n<p>Benchmarking ancestry and\/or ethnicity<\/p>\n<p>All participants with individual-level genetic data were divided into three populations based on self-report: AA, Hispanic and NHW. Within each self-report population, we removed outliers based on genetic ancestry by performing multidimensional scaling analysis using individual-level genetic data and phase 3 1000 Genomes data as in ref. <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 28\" title=\"Auton, A. et al. A global reference for human genetic variation. Nature 526, 68&#x2013;74 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR28\" id=\"ref-link-section-d105321419e5577\" target=\"_blank\" rel=\"noopener\">28<\/a>. Self-defined Hispanic study participants were combined with the MXL (Mexican ancestry in Los Angeles, CA), PUR (Puerto Rican), CLM (Columbian) and PEL (Peruvian) reference panel populations. Self-defined NHW study participants were combined with the CEU (Utah residents with northern and western European ancestry), TSI (Toscani in Italy), IBS (Iberian population in Spain) and FIN (Finnish) reference panel populations. Self-reported AA participants were combined with the ASW (African ancestry in southwest USA) population. Within each of the three populations, SNPs were filtered to meet the criteria MAF\u2009&gt;\u20095%, data missingness P\u2009\u22127 using an independent pairwise linkage filter window of 50\u2009kb at 5\u2009kb steps and an r2 threshold of 0.15. We removed AT, TA, GC and CG markers. Then, for each group, multidimensional scaling analysis was performed using PLINK (v.1.90b53)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Purcell, S. et al. PLINK: a toolset for whole-genome association and population-based linkage analysis. Am. J. Hum. Genet. 81, 559&#x2013;575 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR24\" id=\"ref-link-section-d105321419e5590\" target=\"_blank\" rel=\"noopener\">24<\/a>, and results were visualized along principal components 1 and 2 using R. For each population, samples that were six or more standard deviations from the mean of any reference panel populations were considered outliers and removed (42 in the NHW population, 4 in the AA population and 4 in the Hispanic population).<\/p>\n<p>LD panel for each population for PWASs and PMR-Egger<\/p>\n<p>We generated an LD panel for integration of GWASs and brain proteomic and genetic data for each population separately. For NHW, we used the LD reference panel from the 1000 Genomes European data provided in the FUSION pipeline<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\" title=\"Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245&#x2013;252 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR3\" id=\"ref-link-section-d105321419e5602\" target=\"_blank\" rel=\"noopener\">3<\/a>. For African ancestry, we constructed an LD panel using the AFR HapMap SNP sites in the AFR 1000 Genome data<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 28\" title=\"Auton, A. et al. A global reference for human genetic variation. Nature 526, 68&#x2013;74 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR28\" id=\"ref-link-section-d105321419e5606\" target=\"_blank\" rel=\"noopener\">28<\/a>. Likewise, for the Hispanic population, we constructed an LD panel using the AMR HapMap SNPs in the AMR 1000 Genome data<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 28\" title=\"Auton, A. et al. A global reference for human genetic variation. Nature 526, 68&#x2013;74 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR28\" id=\"ref-link-section-d105321419e5610\" target=\"_blank\" rel=\"noopener\">28<\/a>.<\/p>\n<p>Proteomic dataProteomic sequencing and database search<\/p>\n<p>Proteomic data were generated by collaborative efforts of the Accelerating Medicines Partnership: Alzheimer\u2019s Disease (AMP-AD)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 29\" title=\"Hodes, R. J. &amp; Buckholtz, N. Accelerating Medicines Partnership: Alzheimer&#x2019;s Disease (AMP-AD) Knowledge Portal aids Alzheimer&#x2019;s drug discovery through open data sharing. Expert Opin. Ther. Targets 20, 389&#x2013;391 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR29\" id=\"ref-link-section-d105321419e5627\" target=\"_blank\" rel=\"noopener\">29<\/a> and AMP-AD Diversity<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Joseph, S. R. et al. Bridging the Gap: multi-omics profiling of brain tissue in Alzheimer&#x2019;s disease and older controls in multi-ethnic populations. Alzheimers Dement. 20, 7174&#x2013;7192 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR22\" id=\"ref-link-section-d105321419e5631\" target=\"_blank\" rel=\"noopener\">22<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Seifar, F. et al. Large-scale deep proteomic analysis in Alzheimer&#x2019;s disease brain regions across race and ethnicity. Alzheimers Dement. 20, 8878&#x2013;8897 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR30\" id=\"ref-link-section-d105321419e5634\" target=\"_blank\" rel=\"noopener\">30<\/a> involving multiple research sites. Brain samples were collected by Rush Alzheimer\u2019s Disease Center (n\u2009=\u2009815), the Mayo Clinic (n\u2009=\u2009399), Mount Sinai University Hospital (n\u2009=\u2009205), Emory University (n\u2009=\u2009129) and the Brain and Body Donation Program at Banner Sun Health (n\u2009=\u2009200) for proteomic studies as previously reported<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Wingo, A. P. et al. Sex differences in brain protein expression and disease. Nat. Med. 29, 2224&#x2013;2232 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR7\" id=\"ref-link-section-d105321419e5654\" target=\"_blank\" rel=\"noopener\">7<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Wingo, A. P. et al. Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer&#x2019;s disease pathogenesis. Nat. Genet. 53, 143&#x2013;146 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR21\" id=\"ref-link-section-d105321419e5657\" target=\"_blank\" rel=\"noopener\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Joseph, S. R. et al. Bridging the Gap: multi-omics profiling of brain tissue in Alzheimer&#x2019;s disease and older controls in multi-ethnic populations. Alzheimers Dement. 20, 7174&#x2013;7192 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR22\" id=\"ref-link-section-d105321419e5660\" target=\"_blank\" rel=\"noopener\">22<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Seifar, F. et al. Large-scale deep proteomic analysis in Alzheimer&#x2019;s disease brain regions across race and ethnicity. Alzheimers Dement. 20, 8878&#x2013;8897 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR30\" id=\"ref-link-section-d105321419e5663\" target=\"_blank\" rel=\"noopener\">30<\/a>. All donors or their next of kin provided informed consent, and the study was approved by the respective institutional review boards at all sites. AMP-AD generated data from samples collected by Rush and Banner Sun Health. AMP-AD-Diversity generated data from samples provided by Rush, the Mayo Clinic, Mount Sinai and Emory. Proteomes were profiled using tandem mass tag mass spectrometry as described in detail previously<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Wingo, A. P. et al. Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer&#x2019;s disease pathogenesis. Nat. Genet. 53, 143&#x2013;146 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR21\" id=\"ref-link-section-d105321419e5667\" target=\"_blank\" rel=\"noopener\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Seifar, F. et al. Large-scale deep proteomic analysis in Alzheimer&#x2019;s disease brain regions across race and ethnicity. Alzheimers Dement. 20, 8878&#x2013;8897 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR30\" id=\"ref-link-section-d105321419e5670\" target=\"_blank\" rel=\"noopener\">30<\/a>. Briefly, postmortem brain samples were homogenized, and proteins were digested with trypsin. Samples were randomized for sex, age, diagnosis, and race and\/or ethnicity and labeled with isobaric tandem mass tag peptides. All proteomic sequencing batches included at least one global internal standard (GIS). The GIS were created by aliquoting equal amounts of protein from each sample within a batch; these standards were then digested in parallel with other samples within the batch<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 31\" title=\"Johnson, E. C. B. et al. Large-scale proteomic analysis of Alzheimer&#x2019;s disease brain and cerebrospinal fluid reveals early changes in energy metabolism associated with microglia and astrocyte activation. Nat. Med. 26, 769&#x2013;780 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR31\" id=\"ref-link-section-d105321419e5674\" target=\"_blank\" rel=\"noopener\">31<\/a>. Next, high-pH fractionation was performed, and the resulting samples were analyzed by liquid chromatography coupled to tandem mass spectrometry. Raw files were searched using Fragpipe (v.19.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 32\" title=\"Kong, A. T., Leprevost, F. V., Avtonomov, D. M., Mellacheruvu, D. &amp; Nesvizhskii, A. I. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat. Methods 14, 513&#x2013;520 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR32\" id=\"ref-link-section-d105321419e5678\" target=\"_blank\" rel=\"noopener\">32<\/a>, MSFragger<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 32\" title=\"Kong, A. T., Leprevost, F. V., Avtonomov, D. M., Mellacheruvu, D. &amp; Nesvizhskii, A. I. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat. Methods 14, 513&#x2013;520 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR32\" id=\"ref-link-section-d105321419e5683\" target=\"_blank\" rel=\"noopener\">32<\/a> (v.3.5) and human proteome database Swiss-Prot<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 33\" title=\"Khoury, G. A., Baliban, R. C. &amp; Floudas, C. A. Proteome-wide post-translational modification statistics: frequency analysis and curation of the swiss-prot database. Sci. Rep. 1, 90 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR33\" id=\"ref-link-section-d105321419e5687\" target=\"_blank\" rel=\"noopener\">33<\/a> containing 20,402 sequences (downloaded 11 February 2019). Subsequently, we used Post-MSFragger (v.3.6) and Percolator<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 34\" title=\"K&#xE4;ll, L., Canterbury, J. D., Weston, J., Noble, W. S. &amp; MacCoss, M. J. Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat. Methods 4, 923&#x2013;925 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR34\" id=\"ref-link-section-d105321419e5691\" target=\"_blank\" rel=\"noopener\">34<\/a> (v.3.0.5) for peptide\u2013spectrum match validation and Philosopher (v.4.6.0) for protein inference using ProteinProphet (v.4.6.0) and filtering using FDR. The database search yielded a total of 11,748 protein groups from the dorsolateral prefrontal cortex.<\/p>\n<p>Quality control and normalization of proteomic data<\/p>\n<p>We examined proteomic assay precision via coefficient of variation (CV) analysis, using the 7 batches with \u22652 GIS (a total of 15 within-batch GIS). Lower CVs reflect lower measurement errors and higher precision. Based on protein abundance before normalization, we calculated the CV for each gene within each batch using the formula CV\u2009=\u2009s.d.(x)\/mean(x), where x is the protein abundance for each gene in the GIS samples within each batch. We found the CVs to have a median of 2.2, interquartile range of 1.8 and range of [0.23\u201323.1] among 7,675 proteins without any missing values, suggesting very high reproducibility. We performed quality control of proteomic data in the AMP-AD-Rush (n\u2009=\u2009619), AMP-AD-Banner (n\u2009=\u2009198) and AMP-AD-Diversity (n\u2009=\u20091,105; Rush, Mayo, Mount Sinai, Emory; Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM3\" target=\"_blank\" rel=\"noopener\">25<\/a>) datasets separately following our prior approach<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Wingo, A. P. et al. Sex differences in brain protein expression and disease. Nat. Med. 29, 2224&#x2013;2232 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR7\" id=\"ref-link-section-d105321419e5725\" target=\"_blank\" rel=\"noopener\">7<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Wingo, A. P. et al. Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer&#x2019;s disease pathogenesis. Nat. Genet. 53, 143&#x2013;146 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR21\" id=\"ref-link-section-d105321419e5728\" target=\"_blank\" rel=\"noopener\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 35\" title=\"Wingo, T. S. et al. Brain proteome-wide association study implicates novel proteins in depression pathogenesis. Nat. Neurosci. 24, 810&#x2013;817 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR35\" id=\"ref-link-section-d105321419e5731\" target=\"_blank\" rel=\"noopener\">35<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Wingo, T. S. et al. Shared mechanisms across the major psychiatric and neurodegenerative diseases. Nat. Commun. 13, 4314 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR36\" id=\"ref-link-section-d105321419e5734\" target=\"_blank\" rel=\"noopener\">36<\/a>. Specifically, in each of the three datasets, we performed the following steps. First, we removed duplicate samples (only AMP-AD-Diversity had duplicate samples, and one of the pair was removed). Second, we removed proteins with missing values in more than 50% of the samples. Third, the protein abundance for each gene per sample was normalized using the total abundance of all the proteins in that sample to account for protein loading differences and then log2-transformed. Fourth, we performed iterative principal component analysis to remove sample outliers that were more than four standard deviations from the mean of either the first or the second principal component. Fifth, we performed linear regression to estimate and remove the effects of protein sequencing batch, PMI, age at death, sex and clinical diagnosis of cognitive status from the proteomic profiles. In the AMP-AD-Diversity dataset, a subset of the samples did not have PMI (n\u2009=\u2009149). Thus, we performed regression to remove unwanted technical and biological effects in the subset with PMI separately from the subset without PMI. Last, to enable comparisons across the three datasets (AMP-AD-Rush, AMP-AD-Banner and AMP-AD-Diversity), a Z-score transformation of the protein abundance for each protein to a mean of 0 and standard deviation of 1 was applied within each dataset. For proteins with multiple isoforms, we selected the most abundant isoform for investigation. Variance partition plots showing the contributions of technical and biological effects to the proteomic profiles before and after quality control and normalization for these three datasets are provided in Supplementary Figs. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM1\" target=\"_blank\" rel=\"noopener\">1<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM1\" target=\"_blank\" rel=\"noopener\">3<\/a>.<\/p>\n<p>Next, we combined the proteomic profiles from the three datasets and retained samples having both proteomic and genetic data. Then we performed additional quality control of the proteomic data in each population (AA, Hispanic and NHW) separately as follows. First, we retained proteins with nonmissing data in at least 50 individuals. Second, we performed surrogate variable analysis (SVA v.3.20.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 37\" title=\"Leek, J. T. &amp; Storey, J. D. Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet. 3, 1724&#x2013;1735 (2007).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR37\" id=\"ref-link-section-d105321419e5756\" target=\"_blank\" rel=\"noopener\">37<\/a> on the proteomic profile to assess potentially hidden confounding variables and regressed out the significant surrogate variables from the normalized proteomic profile. Last, Z-score transformation of the surrogate-variable-adjusted protein abundance for each protein was applied to the proteomic profile in each population. After quality control of genetic and proteomic data as described above, a total of 1,362 individuals remained for pQTL mapping. They comprised 181 AA (9,304 proteins), 168 Hispanic (9,036 proteins) and 1,013 NHW (9,725 proteins) individuals (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM3\" target=\"_blank\" rel=\"noopener\">24<\/a>).<\/p>\n<p>Population-stratified pQTL mapping<\/p>\n<p>pQTL mapping in each population was performed for SNPs present at MAF\u2009\u2265\u20090.05 in a population. We defined cis as being within 500\u2009kb up or downstream of the gene. We fitted an LMM with SNP as the independent variable and normalized protein expression as the outcome, adjusting for sex using GEMMA (v.0.98.1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 4\" title=\"Zhou, X. &amp; Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 44, 821&#x2013;824 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR4\" id=\"ref-link-section-d105321419e5778\" target=\"_blank\" rel=\"noopener\">4<\/a>. We chose an LMM as it could account for sample relatedness and population substructure. We calculated the P value using the Wald test in GEMMA. As described above, we regressed our the surrogate variables from the proteomic profile of each population before performing pQTL mapping. The genomic inflation factor (\u03bb) was calculated by dividing the median of the resulting chi-squared test statistics by the expected median of the chi-squared distribution<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 38\" title=\"Yang, J. et al. Genomic inflation factors under polygenic inheritance. Eur. J. Hum. Genet. 19, 807&#x2013;812 (2011).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR38\" id=\"ref-link-section-d105321419e5788\" target=\"_blank\" rel=\"noopener\">38<\/a>.<\/p>\n<p>Influence of allele frequency differences on population-specific pQTLs<\/p>\n<p>We investigated whether differences in pQTL detection were related to differences in allele frequencies between populations. Here we defined AA-only candidate pQTLs as those identified as pQTLs in AA that were not pQTLs in NHW and were found to have \u22641 minor allele in European populations in the 1000 Genomes WGS data. Among the 144,654 candidate pQTLs in AA from the population-stratified pQTL mapping, we found 14,239 AA-only candidate pQTLs, corresponding to 1,578 independent AA-only candidate pQTLs (after clumping at r2\u2009r2\u2009<\/p>\n<p>Fine-mapping<\/p>\n<p>Multiancestry pQTL fine-mapping was performed with MESuSiE (v.1.0)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 8\" title=\"Gao, B. &amp; Zhou, X. MESuSiE enables scalable and powerful multi-ancestry fine-mapping of causal variants in genome-wide association studies. Nat. Genet. 56, 170&#x2013;179 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR8\" id=\"ref-link-section-d105321419e5817\" target=\"_blank\" rel=\"noopener\">8<\/a>, which considers only proteins and SNPs that are present in all examined populations. We considered SNPs present at MAF\u2009&gt;\u20090.05 in each population. Inputs to MESuSiE included the summary statistics of the population-stratified pQTL mapping and LD matrix for each population. The LD matrix for each population was generated using the same individual-level genotyping and\/or WGS data used in the population-stratified pQTL mapping to ensure there was no mismatch between the population-stratified pQTL summary statistics and the LD matrix. We performed an LD mismatch check following MESuSiE guidelines. We set the L parameter (that is, the maximum number of possible credible sets) to the default value of 10. As we performed multiancestry pQTL fine-mapping in three populations, there were seven possible pQTL categories: (1) shared across AA, Hispanic and NHW; (2) shared between AA and Hispanic; (3) shared between AA and NHW; (4) shared between Hispanic and NHW; (5) specific to AA; (6) specific to Hispanic; and (7) specific to NHW. The output from MESuSiE included the PIP for each of these categories, referred to as the category.PIP. We then applied additional filters to the MESuSiE results to remove causal pQTLs with category assignments discordant with the population-specific pQTL results. Specifically, among causal pQTLs categorized as shared across populations, we filtered out those that were not nominally significant in each population (that is, P\u2009&gt;\u20090.05) or had discordant beta values. Likewise, for the causal pQTLs categorized as population-specific, we filtered out those that had population-stratified pQTL P\u2009<\/p>\n<p>pQTL fine-mapping in NHW with SuSiE<\/p>\n<p>To compare the size of the 95% credible sets between multiancestry pQTL fine-mapping and NHW-only pQTL fine-mapping, we performed pQTL fine-mapping in NHW with susieR (v.0.12.35)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 9\" title=\"Wang, G., Sarkar, A., Carbonetto, P. &amp; Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Series B 82, 1273&#x2013;1300 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR9\" id=\"ref-link-section-d105321419e5837\" target=\"_blank\" rel=\"noopener\">9<\/a> following its default pipeline. SuSiE was designed to identify as many credible sets as the data would support, each with as few variants as possible. For a given gene and its corresponding variants in the cis-regulatory region, the output was the number of credible sets that had 95% probability of containing a variant with nonzero causal effect. We set the maximum number of credible sets for a gene to be ten (the default value).<\/p>\n<p>Influence of sample size on population-shared and -specific pQTLs<\/p>\n<p>We downsampled the NHW proteomic dataset from 1,013 to 191 to make it comparable to the sample sizes for the AA and Hispanic populations. The smaller NHW group (EUR-2) had similar distributions of age, sex and PMI as the larger NHW group. We performed population-stratified pQTL mapping for each group (EUR-2 n\u2009=\u2009191; AA n\u2009=\u2009181; Hispanic n\u2009=\u2009168) and used the pQTL summary statistics as an input for multiancestry pQTL fine-mapping with MESuSiE.<\/p>\n<p>Genomic-site-type enrichment<\/p>\n<p>Genomic-site-type annotations for the 3,643,245 SNPs tested in all three populations in the population-stratified pQTL analyses were obtained using Bystro (v.2.0.0-beta1)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 39\" title=\"Kotlar, A. V., Trevino, C. E., Zwick, M. E., Cutler, D. J. &amp; Wingo, T. S. Bystro: rapid online variant annotation and natural-language filtering at whole-genome scale. Genome Biol. 19, 14 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR39\" id=\"ref-link-section-d105321419e5870\" target=\"_blank\" rel=\"noopener\">39<\/a>. SNPs were annotated with the following site types: nonsynonymous, synonymous, UTR or intronic. In each case, the annotation was assigned to a SNP if and only if it pertained to one or more of the genes tested for the SNP. Promoter overlap for each SNP was determined using the promoter-like candidate cis-regulatory elements from ENCODE<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Moore, J. E. et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699&#x2013;710 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR40\" id=\"ref-link-section-d105321419e5877\" target=\"_blank\" rel=\"noopener\">40<\/a>. Fisher\u2019s exact test was used to test enrichment of each site type in two sets of pQTLs: the multiancestry causal pQTLs from MESuSiE and the shared candidate pQTLs from the population-stratified pQTL analyses. As the shared candidate pQTLs included many SNPs in LD, a set of approximately independent SNPs was selected by applying the &#8211;clump function from PLINK to greedily select pQTLs with pairwise LD\u2009<\/p>\n<p>Cross-ancestry prediction of genetically regulated protein abundance<\/p>\n<p>Focusing on the 858 multi-ancestry causal pQTLs, we included SNPs in their corresponding 95% credible sets. Then, we computed the effect of these SNPs on protein abundance (also referred to as protein \u2018weights\u2019) using NHW genetic and proteomic data and multiple predictive models (top1, blup, lasso, enet and bslmm) following the FUSION framework<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\" title=\"Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245&#x2013;252 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR3\" id=\"ref-link-section-d105321419e5889\" target=\"_blank\" rel=\"noopener\">3<\/a>. We extracted the protein weights from the most predictive models. Next, we estimated the genetically regulated protein abundance for the pGenes corresponding to the multiancestry causal pQTLs using AA genotyping and the above protein weights. Subsequently, we calculated the R2 and P value for regression of the predicted AA proteomic expression on the measured AA proteomic expression to provide estimates of cross-ancestry prediction accuracy.<\/p>\n<p>In addition, we performed same-ancestry prediction following the pipeline described above using AA genetic and proteomic data for the pGenes corresponding to the 858 multiancestry causal pQTLs. Following the FUSION framework, we used all the SNPs corresponding to the pGenes for the 858 multiancestry causal pQTLs. The output provided cross-validation R2 values for the best-performing models. The R2 metric reflects the percentage variance of protein abundance explained by the genetically regulated protein level, with higher values corresponding to more variance explained. To compare the cross-ancestry to the same-ancestry prediction, we took the ratio of the cross-ancestry and same-ancestry cross-validation R2 values.<\/p>\n<p>                        \u03c0<br \/>\n                        1 replication rate for pQTLs<\/p>\n<p>To determine the \u03c01 rate for the multiancestry causal pQTLs in each of the populations, we first estimated \u03c00, which is the proportion of true null hypotheses among a set of tests, using R package qvalue (v.2.15.0). In the case of our application to P values from the LMM tests, the null hypothesis was that the SNP had zero effect on protein expression. \u03c01, the proportion of true alternative hypotheses, was defined as 1\u2009\u2212\u2009\\({\\pi }_{0}\\).<\/p>\n<p>To compare the pQTLs and pGenes identified here in the NHW population with those from our previous published brain pQTL study using a smaller dorsolateral prefrontal cortex dataset<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 7\" title=\"Wingo, A. P. et al. Sex differences in brain protein expression and disease. Nat. Med. 29, 2224&#x2013;2232 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR7\" id=\"ref-link-section-d105321419e5976\" target=\"_blank\" rel=\"noopener\">7<\/a>, we extracted significant pQTLs and pGenes from each study (FDR\u2009\u03c01 rates using the current brain pQTLs as the discovery set and published plasma pQTLs as replication sets, we extracted the population-stratified brain pQTLs (at FDR\u2009\u03c00. In addition, we estimated \u03c01 for the multiancestry causal pQTLs (n\u2009=\u2009858) in each of the two plasma pQTL datasets in African and European ancestry populations separately following a similar procedure.<\/p>\n<p>To determine which pGenes were shared between brain and plasma, we focused on European ancestry owing to the higher sample sizes and statistical power. We defined pGenes as proteins with cis-pQTLs at FDR\u2009<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 41\" title=\"Ferkingstad, E. et al. Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 53, 1712&#x2013;1721 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR41\" id=\"ref-link-section-d105321419e6002\" target=\"_blank\" rel=\"noopener\">41<\/a>, ARIC<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 5\" title=\"Zhang, J. et al. Plasma proteome analyses in individuals of European and African ancestry identify cis-pQTLs and models for proteome-wide association studies. Nat. Genet. 54, 593&#x2013;602 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR5\" id=\"ref-link-section-d105321419e6006\" target=\"_blank\" rel=\"noopener\">5<\/a> and UKB<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 6\" title=\"Sun, B. B. et al. Plasma proteomic associations with genetics and health in the UK Biobank. Nature 622, 329&#x2013;338 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR6\" id=\"ref-link-section-d105321419e6010\" target=\"_blank\" rel=\"noopener\">6<\/a>). For each comparison, we considered only genes available in both brain and plasma datasets. We examined the overlap between brain and plasma pGenes (\u2018shared\u2019 pGenes) and the pGenes found only in brain analysis or plasma analysis (\u2018unique\u2019 pGenes) for each plasma pQTL dataset (Supplementary Tables <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM3\" target=\"_blank\" rel=\"noopener\">25<\/a>\u2013<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM3\" target=\"_blank\" rel=\"noopener\">27<\/a>). Of our brain pGenes, we found that 91% shared pGenes with plasma pGenes from the Icelanders dataset, 84% shared pGenes with ARIC plasma pGenes and 98% shared pGenes with UKB plasma pGenes (Supplementary Table <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM3\" target=\"_blank\" rel=\"noopener\">28<\/a>).<\/p>\n<p>GWAS summary statistics<\/p>\n<p>We had access to GWAS summary statistics for 15 psychiatric traits: alcohol use disorder<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 42\" title=\"Zhou, H. et al. Multi-ancestry study of the genetics of problematic alcohol use in over 1 million individuals. Nat. Med. 29, 3184&#x2013;3192 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR42\" id=\"ref-link-section-d105321419e6032\" target=\"_blank\" rel=\"noopener\">42<\/a>, anorexia nervosa<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"Watson, H. J. et al. Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat. Genet. 51, 1207&#x2013;1214 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR43\" id=\"ref-link-section-d105321419e6036\" target=\"_blank\" rel=\"noopener\">43<\/a>, anxiety<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Levey, D. F. et al. Reproducible genetic risk loci for anxiety: results from approximately 200,000 participants in the Million Veteran Program. Am. J. Psychiatry 177, 223&#x2013;232 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR44\" id=\"ref-link-section-d105321419e6040\" target=\"_blank\" rel=\"noopener\">44<\/a>, ADHD<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Demontis, D. et al. Genome-wide analyses of ADHD identify 27 risk loci, refine the genetic architecture and implicate several cognitive domains. Nat. Genet. 55, 198&#x2013;208 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR45\" id=\"ref-link-section-d105321419e6044\" target=\"_blank\" rel=\"noopener\">45<\/a>, autism spectrum disorder<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 46\" title=\"Grove, J. et al. Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 51, 431&#x2013;444 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR46\" id=\"ref-link-section-d105321419e6048\" target=\"_blank\" rel=\"noopener\">46<\/a>, bipolar disorder<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 47\" title=\"Mullins, N. et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat. Genet. 53, 817&#x2013;829 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR47\" id=\"ref-link-section-d105321419e6053\" target=\"_blank\" rel=\"noopener\">47<\/a>, cannabis use disorder<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Johnson, E. C. et al. A large-scale genome-wide association study meta-analysis of cannabis use disorder. Lancet Psychiatry 7, 1032&#x2013;1045 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR48\" id=\"ref-link-section-d105321419e6057\" target=\"_blank\" rel=\"noopener\">48<\/a>, insomnia<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Jansen, P. R. et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nat. Genet. 51, 394&#x2013;403 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR49\" id=\"ref-link-section-d105321419e6061\" target=\"_blank\" rel=\"noopener\">49<\/a>, major depressive disorder<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"Als, T. D. et al. Depression pathophysiology, risk prediction of recurrence and comorbid psychiatric disorders using genome-wide analyses. Nat. Med. 29, 1832&#x2013;1844 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR50\" id=\"ref-link-section-d105321419e6065\" target=\"_blank\" rel=\"noopener\">50<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Meng, X. et al. Multi-ancestry genome-wide association study of major depression aids locus discovery, fine mapping, gene prioritization and causal inference. Nat. Genet. 56, 222&#x2013;233 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR51\" id=\"ref-link-section-d105321419e6068\" target=\"_blank\" rel=\"noopener\">51<\/a>, neuroticism<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"Nagel, M. et al. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat. Genet. 50, 920&#x2013;927 (2018).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR52\" id=\"ref-link-section-d105321419e6072\" target=\"_blank\" rel=\"noopener\">52<\/a>, opioid addiction<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 53\" title=\"Gaddis, N. et al. Multi-trait genome-wide association study of opioid addiction: OPRM1 and beyond. Sci. Rep. 12, 16873 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR53\" id=\"ref-link-section-d105321419e6076\" target=\"_blank\" rel=\"noopener\">53<\/a>, PTSD<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 54\" title=\"Stein, M. B. et al. Genome-wide association analyses of post-traumatic stress disorder and its symptom subdomains in the Million Veteran Program. Nat. Genet. 53, 174&#x2013;184 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR54\" id=\"ref-link-section-d105321419e6081\" target=\"_blank\" rel=\"noopener\">54<\/a>, schizophrenia<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 55\" title=\"Trubetskoy, V. et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature 604, 502&#x2013;508 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR55\" id=\"ref-link-section-d105321419e6085\" target=\"_blank\" rel=\"noopener\">55<\/a>, suicide attempt<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 56\" title=\"Mullins, N. et al. Dissecting the shared genetic architecture of suicide attempt, psychiatric disorders, and known risk factors. Biol. Psychiatry 91, 313&#x2013;327 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR56\" id=\"ref-link-section-d105321419e6089\" target=\"_blank\" rel=\"noopener\">56<\/a> and tobacco use<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 57\" title=\"Saunders, G. R. B. et al. Genetic diversity fuels gene discovery for tobacco and alcohol use. Nature 612, 720&#x2013;724 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR57\" id=\"ref-link-section-d105321419e6093\" target=\"_blank\" rel=\"noopener\">57<\/a>. We also had summary statistics for six neurologic traits: Alzheimer\u2019s disease<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 58\" title=\"Bellenguez, C. et al. New insights into the genetic etiology of Alzheimer&#x2019;s disease and related dementias. Nat. Genet. 54, 412&#x2013;436 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR58\" id=\"ref-link-section-d105321419e6097\" target=\"_blank\" rel=\"noopener\">58<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 59\" title=\"Kunkle, B. W. et al. Novel Alzheimer disease risk loci and pathways in African American individuals using the African Genome Resources Panel: a meta-analysis. JAMA Neurol. 78, 102&#x2013;113 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR59\" id=\"ref-link-section-d105321419e6100\" target=\"_blank\" rel=\"noopener\">59<\/a>, amyotrophic lateral sclerosis<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 60\" title=\"van Rheenen, W. et al. Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology. Nat. Genet. 53, 1636&#x2013;1648 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR60\" id=\"ref-link-section-d105321419e6104\" target=\"_blank\" rel=\"noopener\">60<\/a>, frontotemporal dementia<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 61\" title=\"Ferrari, R. et al. Frontotemporal dementia and its subtypes: a genome-wide association study. Lancet Neurol. 13, 686&#x2013;699 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR61\" id=\"ref-link-section-d105321419e6109\" target=\"_blank\" rel=\"noopener\">61<\/a>, Lewy body dementia<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 62\" title=\"Chia, R. et al. Genome sequencing analysis identifies new loci associated with Lewy body dementia and provides insights into its genetic architecture. Nat. Genet. 53, 294&#x2013;303 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR62\" id=\"ref-link-section-d105321419e6113\" target=\"_blank\" rel=\"noopener\">62<\/a>, Parkinson\u2019s disease<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 63\" title=\"Nalls, M. A. et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson&#x2019;s disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 18, 1091&#x2013;1102 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR63\" id=\"ref-link-section-d105321419e6117\" target=\"_blank\" rel=\"noopener\">63<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 64\" title=\"Rizig, M. et al. Identification of genetic risk loci and causal insights associated with Parkinson&#x2019;s disease in African and African admixed populations: a genome-wide association study. Lancet Neurol. 22, 1015&#x2013;1025 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR64\" id=\"ref-link-section-d105321419e6120\" target=\"_blank\" rel=\"noopener\">64<\/a> and stroke<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 65\" title=\"Mishra, A. et al. Stroke genetics informs drug discovery and risk prediction across ancestries. Nature 611, 115&#x2013;123 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR65\" id=\"ref-link-section-d105321419e6124\" target=\"_blank\" rel=\"noopener\">65<\/a>. We used these results to integrate brain proteomes with GWASs in PWASs, SMR and PMR-Egger as described below.<\/p>\n<p>Integrating GWASs with brain proteomes using PWASs and SMR<\/p>\n<p>For each of the 21 psychiatric and neurologic conditions, we integrated GWAS summary statistics with brain proteomic and genetic data in each population separately using the population-matching GWAS, brain proteome and LD reference panel. All gene coordinates were based on GRCh38. We performed two independent but complementary integration approaches. First, we performed a PWAS of each trait using FUSION<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\" title=\"Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245&#x2013;252 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR3\" id=\"ref-link-section-d105321419e6136\" target=\"_blank\" rel=\"noopener\">3<\/a> (<a href=\"https:\/\/github.com\/gusevlab\/fusion_twas\" target=\"_blank\" rel=\"noopener\">https:\/\/github.com\/gusevlab\/fusion_twas<\/a>, commit e1ba5f7). We restricted the genotype data to the SNPs in the LD reference panel in each population. SNP-based heritability for each protein was estimated, and proteins with SNP-based heritability P\u2009cis-regulated proteins associated with the trait. We defined significant proteins as those with FDR\u2009<\/p>\n<p>In the second approach, among the significant cis-regulated proteins identified in the PWAS described above, we performed summary data-based Mendelian randomization with SMR<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481&#x2013;487 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR10\" id=\"ref-link-section-d105321419e6159\" target=\"_blank\" rel=\"noopener\">10<\/a> to test whether the brain protein mediated the association between the SNP and trait. We used the population-stratified pQTLs and GWAS summary statistics for each trait in each population separately. As mediation can arise from causality, pleiotropy or LD, we then used HEIDI<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481&#x2013;487 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR10\" id=\"ref-link-section-d105321419e6163\" target=\"_blank\" rel=\"noopener\">10<\/a> to test for and remove associations likely to be due to LD (that is, HEIDI P\u2009P\u2009P\u2009&gt;\u20090.05 and a consistent direction of association between protein and trait in the PWAS and SMR.<\/p>\n<p>PMR-Egger using population-stratified data<\/p>\n<p>As an alternative approach to the above PWAS and SMR framework, we used PMR-Egger as implemented in PMR (<a href=\"https:\/\/github.com\/yuanzhongshang\/PMR\" target=\"_blank\" rel=\"noopener\">https:\/\/github.com\/yuanzhongshang\/PMR<\/a>, commit 7e49f14) to perform SNP\u2013protein\u2013disease causal inference using pQTL and GWAS summary statistics. PMR-Egger is a probabilistic two-sample Mendelian randomization that also controls for horizontal pleiotropy<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 11\" title=\"Yuan, Z. et al. Testing and controlling for horizontal pleiotropy with probabilistic Mendelian randomization in transcriptome-wide association studies. Nat. Commun. 11, 3861 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR11\" id=\"ref-link-section-d105321419e6193\" target=\"_blank\" rel=\"noopener\">11<\/a>. Unlike traditional Mendelian randomization approaches that use independent instruments, PMR uses correlated instruments and has similar detection power to PWAS approaches such as FUSION<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 3\" title=\"Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245&#x2013;252 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR3\" id=\"ref-link-section-d105321419e6197\" target=\"_blank\" rel=\"noopener\">3<\/a>. It also accounts for horizontal pleiotropy and provides a P value for the effect of horizontal pleiotropy. We ran PMR with function PMR_summary_Egger for each gene in each population (NHW, AA and Hispanic) with our pQTL data and published GWAS results, the same data used in the PWAS and SMR analyses described above, following the instructions in the developer\u2019s example (<a href=\"https:\/\/github.com\/yuanzhongshang\/PMR\/tree\/master\/example\" target=\"_blank\" rel=\"noopener\">https:\/\/github.com\/yuanzhongshang\/PMR\/tree\/master\/example<\/a>). We ran PMR-Egger only for genes that passed the heritability test in the PWAS analysis and calculated an LD matrix for each population using individual-level genetic data from postmortem brain donors. To account for potential mismatches between GWAS data and the LD matrix, we performed an LD mismatch check per MESuSiE guidelines and removed mismatching SNPs before the PMR-Egger run.<\/p>\n<p>Identifying multiancestry causal pQTLs in psychiatric and neurologic conditions through Mendelian randomization<\/p>\n<p>In each population separately, we modeled the 858 multiancestry causal pQTLs as the instrumental variable in Mendelian randomization analysis with SMR<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 10\" title=\"Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481&#x2013;487 (2016).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR10\" id=\"ref-link-section-d105321419e6219\" target=\"_blank\" rel=\"noopener\">10<\/a> using the population-matching GWAS and pQTL summary statistics. We declared multiancestry causal pQTLs to be consistent with a causal role if they had SMR FDR\u2009P\u2009&gt;\u20090.05 for the trait of interest.<\/p>\n<p>Physical PPIs<\/p>\n<p>The BioGRID database (v.4.4.232; 28 April 2024)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 66\" title=\"Oughtred, R. et al. The BioGRID database: a comprehensive biomedical resource of curated protein, genetic, and chemical interactions. Protein Sci. 30, 187&#x2013;200 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR66\" id=\"ref-link-section-d105321419e6234\" target=\"_blank\" rel=\"noopener\">66<\/a> was used to obtain pairwise protein interactions containing only human gene symbols, which were further filtered to include only physical PPIs such as physical associations and direct interactions, associations and colocalization.<\/p>\n<p>Gene set enrichment analysis<\/p>\n<p>For causal proteins for each trait, gene set enrichment analysis was performed using GO-Elite (v.1.2.5)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 67\" title=\"Zambon, A. C. et al. GO-Elite: a flexible solution for pathway and ontology over-representation. Bioinformatics 28, 2209&#x2013;2210 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR67\" id=\"ref-link-section-d105321419e6246\" target=\"_blank\" rel=\"noopener\">67<\/a> with the background gene set of 9,725 proteins that remained after comprehensive quality control. Causal proteins were subjected to Fisher\u2019s exact overlap test and Z-test in the Python command-line version of GO-Elite, setting the species as Homo sapiens and using the current annotation databases for gene ontology biological processes, molecular functions, cellular components, Wiki pathways, KEGG, REACTOME and CORUM (downloaded April 2024)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 68\" title=\"Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545&#x2013;15550 (2005).\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#ref-CR68\" id=\"ref-link-section-d105321419e6256\" target=\"_blank\" rel=\"noopener\">68<\/a>.<\/p>\n<p>Reporting summary<\/p>\n<p>Further information on research design is available in the <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/www.nature.com\/articles\/s41588-025-02291-2#MOESM2\" target=\"_blank\" rel=\"noopener\">Nature Portfolio Reporting Summary<\/a> linked to this article.<\/p>\n","protected":false},"excerpt":{"rendered":"Ethics Our study complies with all relevant ethical regulations and was approved by the institutional review boards at&hellip;\n","protected":false},"author":2,"featured_media":409615,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3846],"tags":[3971,231,3973,3967,3970,1301,21798,3972,3968,267,3969,140980,70,16,15],"class_list":{"0":"post-409614","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-agriculture","9":"tag-alzheimers-disease","10":"tag-animal-genetics-and-genomics","11":"tag-biomedicine","12":"tag-cancer-research","13":"tag-depression","14":"tag-gene-expression-profiling","15":"tag-gene-function","16":"tag-general","17":"tag-genetics","18":"tag-human-genetics","19":"tag-proteome-informatics","20":"tag-science","21":"tag-uk","22":"tag-united-kingdom"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/115172632379173293","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/409614","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=409614"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/409614\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/409615"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=409614"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=409614"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=409614"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}