Skip to main content
eLife logoLink to eLife
. 2024 Dec 17;12:RP88768. doi: 10.7554/eLife.88768

Statistical examination of shared loci in neuropsychiatric diseases using genome-wide association study summary statistics

Thomas P Spargo 1,2,3,, Lachlan Gilchrist 1,4,5, Guy P Hunt 2,5,6, Richard JB Dobson 2,3,7,8, Petroula Proitsi 1, Ammar Al-Chalabi 1,9, Oliver Pain 1,, Alfredo Iacoangeli 1,2,3,†,
Editors: Joris Deelen10, Timothy E Behrens11
PMCID: PMC11651651  PMID: 39688956

Abstract

Continued methodological advances have enabled numerous statistical approaches for the analysis of summary statistics from genome-wide association studies. Genetic correlation analysis within specific regions enables a new strategy for identifying pleiotropy. Genomic regions with significant ‘local’ genetic correlations can be investigated further using state-of-the-art methodologies for statistical fine-mapping and variant colocalisation. We explored the utility of a genome-wide local genetic correlation analysis approach for identifying genetic overlaps between the candidate neuropsychiatric disorders, Alzheimer’s disease (AD), amyotrophic lateral sclerosis (ALS), frontotemporal dementia, Parkinson’s disease, and schizophrenia. The correlation analysis identified several associations between traits, the majority of which were loci in the human leukocyte antigen region. Colocalisation analysis suggested that disease-implicated variants in these loci often differ between traits and, in one locus, indicated a shared causal variant between ALS and AD. Our study identified candidate loci that might play a role in multiple neuropsychiatric diseases and suggested the role of distinct mechanisms across diseases despite shared loci. The fine-mapping and colocalisation analysis protocol designed for this study has been implemented in a flexible analysis pipeline that produces HTML reports and is available at: https://github.com/ThomasPSpargo/COLOC-reporter.

Research organism: Human

Introduction

The genetic spectrum of neuropsychiatric disease is diverse and various overlaps exist between traits. For instance, genetic pleiotropy between amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) is increasingly recognised, and ALS is genetically correlated with Alzheimer’s disease (AD), Parkinson’s disease (PD), and schizophrenia (Van Rheenen et al., 2021; Li et al., 2021; Ranganathan et al., 2020). Improving understanding of the genetic architecture underlying these complex diseases could facilitate future treatment discovery.

Advances in genomic research techniques have accelerated discovery of genetic variation associated with complex traits. Genome-wide association studies (GWAS), in particular, have enabled population-scale investigations of the genetic basis of human diseases and anthropometric measures (Abdellaoui et al., 2023). Summary-level results from GWAS are being shared alongside publications with increasing frequency over time (Reales and Wallace, 2023), and a breadth of approaches now exist for downstream analysis based on summary statistics which can enable their interpretation and provide further biological insight.

Genetic correlation analysis allows estimation of genetic overlap between traits (Bulik-Sullivan et al., 2015; Werme et al., 2022; Zhang et al., 2021; Shi et al., 2017). A ‘global’ genetic correlation approach gives a genome-wide average estimate of this overlap. However, genetic relationships between traits can be obscured when correlations in opposing directions cancel out genome-wide (Werme et al., 2022). Recent methods allow for a more nuanced analysis, of ‘local’ genetic correlations partitioned across the genome (Werme et al., 2022; Zhang et al., 2021). This stratified approach to genome-wide analysis could prove effective for identifying pleiotropic regions and designing subsequent analyses aiming to identify genetic variation shared between traits.

A number of methods aim to disentangle causality within associated regions. This is important because the focus on single-nucleotide polymorphisms (SNPs), which are markers of genetic variation, in GWAS produces results that can be difficult to interpret, and causal variants are typically unclear. More so, because of linkage disequilibrium (LD), GWAS associations often comprise large sets of highly correlated SNPs spanning large genomic regions. Statistical fine-mapping is a common approach for dissecting complex LD structures and finding variants with implications for a given trait among the tens or hundreds that might be associated in the region (Zou et al., 2022).

Interpretation of regions associated with multiple traits can also be challenging, since it is often unclear whether these overlaps are driven by the same causal variant. Statistical colocalisation analysis can disentangle association signals across traits to suggest whether the overlaps result from shared or distinct causal genetic factors (Wallace, 2021; Giambartolomei et al., 2018; Foley et al., 2021). Traditionally, this analysis was restricted by the assumption of at most one causal variant for each trait in the region. However, recent extensions to the method now permit analysis based on univariate fine-mapping results for the traits compared and, therefore, analysis of regions with multiple causal variants.

Accordingly, we conducted genome-wide local genetic correlation analysis across five neuropsychiatric traits with recognised phenotypic and genetic overlap (Li et al., 2021; Ranganathan et al., 2020; Ferrari et al., 2017; Weintraub and Mamikonyan, 2019 Beck et al., 2013): AD, ALS, FTD, PD, and schizophrenia. Although several previous studies have performed global genetic correlation analyses between various combinations of these traits (Van Rheenen et al., 2021; Li et al., 2021; Wainberg et al., 2023; McLaughlin et al., 2017), we believe that this is the first to compare them at a genome-wide scale using a local genetic correlation approach. Loci highly correlated between trait pairs were further investigated with univariate fine-mapping and bivariate colocalisation techniques to examine variants driving these associations.

Materials and methods

Sampled GWAS summary statistics

We leveraged publicly accessible summary statistics from European ancestry GWAS meta-analyses of risk for AD (Kunkle et al., 2019), ALS (Van Rheenen et al., 2021), FTD (Ferrari et al., 2014), PD (Nalls et al., 2019), and schizophrenia (Trubetskoy et al., 2022). European ancestry data were selected to avoid LD mismatch between the GWAS sample and reference data from an external European population.

Procedure

Figure 1 summarises the analysis protocol for this study; further details are provided below.

Figure 1. Overview of the analysis procedure for this study.

Figure 1.

SuSiE (sum of single effects) is a univariate fine-mapping approach implemented within the R package susieR. ‘coloc’ is an R package for bivariate colocalisation analysis between pairs of traits. h2 = heritability, rg = bivariate genetic correlation. The analysis steps shaded in blue have been implemented within a readily applied analysis pipeline available on GitHub, copy archived at Spargo, 2023.

Processing of GWAS summary statistics

A standard data cleaning protocol was applied to each set of summary statistics (Pain et al., 2021). We retained only SNPs, excluding any non-SNP or strand-ambiguous variants. SNPs were filtered to those present within the 1000 Genomes phase 3 (1KG) European ancestry population reference dataset (Auton et al., 2015) (N = 503). They were matched to the 1KG reference panel by GRCh37 chromosomal position using bigsnpr (version 1.11.6) (Privé et al., 2018), harmonising allele order with the reference and assigning SNP IDs.

If not reported, and where possible, effective sample size (Neff) was calculated from per-SNP case and control sample sizes. When this could not be determined per-SNP, all variants were assigned a single Neff, calculated as a sum of Neff values for each cohort within the GWAS meta-analysis (Grotzinger et al., 2023).

Further processing was performed where possible, excluding SNPs with p-values ≤0 or >1 and Neff >3 standard deviations from the median Neff. Imputation INFO scores indicate the probability of each genotype given the available data and reference panel - we removed variants with INFO <0.9 if indicated. We filtered to include only variants with minor allele frequency (MAF) ≥0.005 in both the reference and GWAS samples and excluded SNPs with an absolute MAF difference of >0.2 between the two.

Genome-wide analyses

Global heritability and genetic correlations

LDSC (version 1.0.1) (Bulik-Sullivan et al., 2015; Bulik-Sullivan et al., 2015) was applied to estimate genome-wide univariate heritability (h2) for each trait on the liability scale. The software was also applied to derive ‘global’ (i.e., genome-wide) genetic correlation estimates between trait pairs and estimate sample overlap from the bivariate intercept. The latter of these outputs was taken forward as an input for the local genetic correlation analysis using LAVA (see 2.2.2.2). Since global genetic correlation analysis across the traits studied here is not novel and associations reported in past studies are congruent across different tools (Wainberg et al., 2023), the compatibility between LDSC and LAVA motivated our use of LDSC for this analysis.

These analyses were performed using the HapMap3 (Altshuler et al., 2010) SNPs and the LD score files provided with the software, calculated in the 1KG European population. No further MAF filter was applied (therefore variants with MAF ≥0.005 were included) and the other settings were left to their defaults.

Local genetic correlation analysis

LAVA (version 0.1.0) (Werme et al., 2022) was applied to obtain local genetic correlation estimates across 2495 approximately independent blocks delineating the genome, based on patterns in LD. We used the blocks provided alongside the LAVA software which were derived from the 1KG European cohort. Bivariate intercepts from LDSC were provided to LAVA to estimate sample overlap between trait pairs.

LAVA was the most appropriate local genetic correlation approach for this study for several reasons (Werme et al., 2022). First, unlike SUPERGNOVA (Zhang et al., 2021) and rho-HESS (Shi et al., 2017), LAVA makes specific accommodations for analysis of binary traits. Second, other tools focus on bivariate correlation between traits while LAVA offers this alongside multivariate tests such as multiple regression and partial correlation, enabling rigorous testing of pleiotropic effects. Lastly, LAVA is shown to provide results which are less biased than those from other tools.

In accordance with prior studies, genetic correlation analysis was performed following an initial filtering step. Univariate heritability was estimated for each genomic block across SNPs in-common between a pair of traits, and only loci with local h2 p-values below a threshold of 2.004 × 10−5 (0.05/2495) in both traits continued to the bivariate analysis. This step ensures that univariate heritability is sufficient in both traits for a robust correlation estimate.

Targeted genetic analyses

Fine-mapping and colocalisation analysis

Statistical fine-mapping and colocalisation techniques were applied to further analyse associations between trait pairs in regions where the false discovery rate (FDR) adjusted p-value of local genetic correlation analysis was below 0.05 (after adjusting for all bivariate comparisons performed). Additional analysis was conducted at loci where significant correlations occurred between two trait pairs but not between the final pairwise comparison across the three implicated traits.

Fine-mapping was performed with susieR (v0.12.27) (Zou et al., 2022; Wang et al., 2020a), which implements the ‘sum of single effects’ (SuSiE) model to represent statistical evidence of causal genetic variation within ‘credible sets’ and per-SNP posterior inclusion probabilities (PIPs). A 95% credible set indicates 95% certainty that at least one SNP included within the set has a causal association with the phenotype and higher PIPs indicate a greater posterior probability of being a causal variant within a credible set. Multiple credible sets are identified when the data suggest more than one independent causal signal.

Colocalisation analysis was implemented with coloc (v5.1.0.1) (Wallace, 2021; Giambartolomei et al., 2018; Giambartolomei et al., 2014), which calculates posterior probabilities that a causal variant exists for neither, one, or both of two compared traits, testing also whether evidence for a causal variant in both traits suggests a shared variant (i.e., hypothesis 4 (H4); colocalisation) or independent signals (hypothesis 3 (H3)). Colocalisation analyses can be performed across all variants sampled in a region, under an assumption of at most one variant implicated per trait. It can also be performed using variants attributed to pairs of credible sets from SuSiE, relaxing the single variant assumption (Wallace, 2021). When evidence of a shared variant is found, the individual SNPs with the highest posterior probability of being that variant can be assessed. With a 95% confidence threshold, these are termed 95% credible SNPs.

Analysis pipeline

We conducted colocalisation and fine-mapping analysis within an open-access pipeline developed for this study using R (v4.2.2) (R Development Core Team, 2021): https://github.com/ThomasPSpargo/COLOC-reporter, copy archived at Spargo, 2023.

Briefly, in this workflow (see Figure 1), GWAS summary statistics are harmonised across analysed traits for a specified genomic region, including only variants in common between them and available within a reference population. An LD correlation matrix across sampled variants is derived from a reference population using PLINK (v1.90) (Purcell et al., 2007; Purcell, 2009).

Quality control is performed per-dataset prior to univariate fine-mapping analysis. Diagnostic tools provided with susieR are applied to test for consistency between the LD matrix and Z-scores from the GWAS and identify variants with a potential ‘allele flip’ (reversed effect estimate encoding) that can impact fine-mapping.

Fine-mapping is performed for each dataset with the coloc package runsusie function, which wraps around susie_rss from susieR and is configured to facilitate subsequent colocalisation analysis. Sample size (Neff for binary traits) is specified as the median for SNPs analysed. Colocalisation analysis can be performed with the coloc functions coloc.abf and coloc.susie when fine-mapping yields at least one credible set for both traits and otherwise using coloc.abf only. Genes located near credible sets from fine-mapping and credible SNPs from colocalisation analyses are identified via Ensembl and biomaRt (v2.54.0) (Durinck et al., 2005; Durinck et al., 2009; Cunningham et al., 2022).

Analysis parameters can be adjusted by the user in accordance with their needs. Various utilities are included to help interpretation of fine-mapping and colocalisation results, including identification of genes nearby to putatively causal signals, HTML reports to summarise completed analyses, and figures to visualise the results and compare the examined traits.

Current implementation

In this study, LD correlation matrices were derived from the 1KG European cohort. SNPs flagged for potential allele flip issues in either of the compared traits were removed from the analysis. Fine-mapping was performed with the susie_rss refine=TRUE option to avoid local maxima during convergence of the algorithm, leaving the other settings to the runsusie defaults. Colocalisation analysis was performed using the default priors for coloc.susie (p1 = 1 × 10−4, p2 = 1 × 10−4, p12 = 5 × 10−6).

Colocalisation and fine-mapping analyses were performed initially using the genomic blocks defined by LAVA, since these aim to define relatively independent LD partitions across the genome (Werme et al., 2022). If a 95% credible set could not be identified in one or both traits, we inspected local Manhattan plots for the region to determine whether potentially relevant signals occurred around the region boundaries. The analysis was repeated with a ±10 kb window around the LAVA-defined genomic region if p-values for SNPs at the edge of the block were p < 1 × 10−4 for both traits and the Manhattan plots were suggestive of a ‘peak’ not represented within the original boundaries.

Results

Genome-wide analyses

Descriptive information and heritability estimates for the sampled traits and GWAS are presented in Table 1. ALS had nominally significant global genetic correlations with schizophrenia (p = 0.045), PD (p = 0.013), and AD (p = 0.006); no other bivariate genome-wide correlations were statistically significant (see Figure 2).

Table 1. Genome-wide association studies (GWAS) sampled.

Each GWAS is a GWAS meta-analysis of disease risk across people of European ancestry.

Trait Estimated lifetime risk in population GWAS Liability scale h2 (standard error)
Reference N cases N controls
Alzheimer’s disease 1/10, Chêne et al., 2015 Kunkle et al., 2019 21,982 41,944 0.093 (0.0155)
Amyotrophic lateral sclerosis 1/350, Alonso et al., 2009; Johnston et al., 2006 Van Rheenen et al., 2021 27,205 110,881 0.0277 (0.003)
Frontotemporal dementia 1/742, Coyle-Gilchrist et al., 2016 Ferrari et al., 2014 2154 4308 0.0329 (0.0283)
Parkinson’s disease 1/37, Parkinson’s, 2017 Nalls et al., 2019 15,056
(+18,618 proxies*)
449,056 0.0506 (0.0046)
Schizophrenia 1/250, Saha et al., 2005 Trubetskoy et al., 2022 53,386 77,258 0.1761 (0.0061)
*

Proxy cases from the UK Biobank Cohort.

Estimated from cumulative risk after age 45 after correcting for competing risk of mortality and assuming a lifespan of ~85 years. h2 = heritability.

Figure 2. Genome-wide genetic correlation estimates between all trait pairs.

Figure 2.

The heatmap displays genetic correlations (rg) each tile is labelled with the rg estimate and with its standard error in parentheses, alongside the p-value. AD = Alzheimer’s disease, ALS = amyotrophic lateral sclerosis, FTD = frontotemporal dementia, PD = Parkinson’s disease, SZ = schizophrenia.

A total of 605 local genetic correlation analyses were performed across all trait pairs in genomic regions where both traits passed the univariate heritability filtering step after restricting to SNPs sampled in both GWAS (see Table 2; Figure 3; Supplementary file 1a). The number of loci passing to bivariate analysis varied greatly across trait pairs and was congruent with the genome-wide heritability estimates (and their uncertainty) for each trait, reflecting differences in phenotypic variance explained by measured genetic variants and statistical power for each GWAS (see Table 1).

Table 2. Comparison of genome-wide SNP significance against local genetic correlation significance thresholds in all trait pairs and loci analysed.

All loci analysed showed sufficient local univariate heritability across compared traits to allow bivariate correlation analysis. Subsequent fine-mapping and colocalisation analyses were performed in this study for regions with at least a false discovery rate (FDR) adjusted significance for the local genetic correlation. SNP = single-nucleotide polymorphism.

Number of traits in pair with genome-wide significant (p < 5 × 10−8) SNP in locus Smallest significance threshold for local genetic correlation
Bonferroni (p < 8.26 × 10−5; 0.05/605) FDR (pfdr < 0.05) Nominal (p < 0.05) Non-significant(p ≥ 0.05)
0 1 17 77 394
1 1 4 18 80
2 0 3 2 8

Figure 3. Local genetic correlation analyses between trait pairs.

The lower panel displays a heatmap of genetic correlations (rg) across genomic regions where any bivariate analyses were performed; white colouring indicates that the region was not analysed for a given trait pair owing to insufficient univariate heritability in one or both traits. The upper panel shows a Manhattan plot of p-values from each correlation analysis, denoting trait pairs by colour and comparisons passing defined significance thresholds by shape (square for a strict Bonferroni threshold and triangle for a false discovery rate [FDR] adjusted threshold); the hatched line indicates the threshold p-value above which pfdr < 0.05. The panels are both ordered by relative genomic position, with bars above and below indicating each chromosome. AD = Alzheimer’s disease, ALS = amyotrophic lateral sclerosis, FTD = frontotemporal dementia, PD = Parkinson’s disease, SZ = schizophrenia. Supplementary file 1a provides a complete summary of local genetic correlation analyses performed. Twenty-six bivariate comparisons were significant following FDR adjustment (pfdr < 0.05), two of which also passed the stringent Bonferroni threshold (p < 8.26 × 10−5; 0.05/605). While some regions included genome-wide significant single-nucleotide polymorphisms (SNPs) (p < 5 × 10−8) for one or both traits, others occurred in regions where genome-wide association studies (GWAS) associations were weaker (see Table 2). Five of these associations occurred at loci within the human leukocyte antigen (HLA) region (GRCh37: Chr6:28.48–33.45 Mb; 6p22.1–21.3, Genome Reference Consortium, 2023), and all five traits were implicated in at least one of these.

Figure 3.

Figure 3—figure supplement 1. Comparison of positively and negatively correlated genetic loci.

Figure 3—figure supplement 1.

(A) Distribution of genetic correlation estimates (rg) and p-values across trait pairs in local genetic correlation analysis. Trait pairs are denoted by colour and comparisons passing defined significance thresholds are denoted by shape (square for a strict Bonferroni threshold and triangle for a false discovery rate [FDR] adjusted threshold); the hatched line indicates the threshold p-value above which pfdr < 0.05. (B) Boxplots comparing FDR-significant loci with positive and negative local genetic correlations. Sub-panels are split row-wise by comparison measure: ‘−log10 p(A)’ and ‘−log10 p(B)’ show the p-value for the most significant variant in a given locus for each of the two analysed traits, with ‘A’ being the trait with more-significant and ‘B’ less-significant top variant; ‘Absolute rg’ shows the absolute correlation coefficient for the local genetic correlation between trait pairs in a given locus. The leftmost column of sub-panels displays the results across all significant loci and remaining columns split data by the number of traits in the pair with at least one fine-mapping credible set identified at that locus.

Targeted genetic analyses

Univariate fine-mapping and bivariate colocalisation analyses were subsequently performed to test for variants jointly implicated between trait pairs in regions with local genetic correlation pfdr < 0.05. The ALS and schizophrenia trait pair was additionally examined at Chr6:32.22–32.45 Mb because significant genetic correlations were found between ALS and FTD and between schizophrenia and FTD at this locus. The correlation between ALS and schizophrenia at this locus had not been analysed owing to insufficient univariate heritability for ALS after restricting to SNPs in common with the schizophrenia GWAS.

Fine-mapping identified at least one 95% credible set for each of the compared traits for 7 of the 27 comparisons performed (see Table 3), and for one trait only in a further 5 (see Supplementary file 1b; Supplementary file 1c). This analysis suggested two credible sets for schizophrenia in the Chr12:56.99–58.75 Mb locus, for AD in Chr6:32.45–32.54 Mb, and (only when harmonised to SNPs in common with the ALS GWAS) for FTD in Chr6:32.22–32.45 Mb (see Supplementary file 1c). Although both positive and negative local genetic correlations passed the FDR-adjusted significance threshold, we observed only positive local genetic correlations in loci where fine-mapping credible sets were identified for both traits in the pair. This reflects that the absolute correlation coefficients and variant associations from the analysed GWAS studies were generally stronger in the positively correlated loci (see Figure 3—figure supplement 1).

Table 3. Colocalisation analysis conducted across 95% credible sets identified during univariate fine-mapping of trait pairs.

N SNPs refer to the number of SNPs present for both traits and the 1000 Genomes reference panel in the region within colocalisation and fine-mapping analysis.

Trait Genomic position (GRCh37) Local genetic correlation estimate (95% confidence interval) Fine-mapping credible set for trait N SNPs SNP with highest PIP for fine-mapping credible set (nearest gene; sense-strand base pair distance) Posterior probability for hypothesis
1 2 1 2 Trait 1 Trait 2 H0 H1 H2 H3 H4
AD PD Chr6:32576785–32639239 0.406 (0.197, 0.648) 1 1 958 rs9271247
(HLA-DQA1; +15,844)
rs3129751
(HLA-DQA1; +13,767)
<0.01 <0.01 <0.01 0.95 0.05
ALS AD Chr6:32629240–32682213* 0.974 (0.717, 1.000) 1 1 475 rs9275477
(MTCO3P1; +1260)
rs9275207 (MTCO3P1; +16,191) <0.01 <0.01 <0.01 0.10 0.90
ALS FTD Chr6:32208902–32454577§ 0.723 (0.370, 1.000) 1 1 1709 rs9268833
(HLA-DRB9; 0)
rs1980493 (BTNL2; 0) <0.01 <0.01 0.01 0.99 <0.01
2 rs9767620
(HLA-DRB9; +1498)
<0.01 <0.01 0.01 0.99 <0.01
ALS SZ Chr6:32208902–32454577§ - 1 1 1711 rs9268833 (HLA-DRB9; 0) rs9268219 (C6orf10; 0) <0.01 <0.01 <0.01 0.98 <0.01
Chr12:56987106–58748139 0.506 (0.218, 0.807) 1 1 2260 rs113247976 (KIF5A; 0) rs12814239
(LRP1; 0)
<0.01 <0.01 <0.01 1.00 <0.01
2 rs324017 (NAB2; 0) <0.01 <0.01 <0.01 1.00 <0.01
PD SZ Chr17:43460501–44865832 0.595 (0.266, 0.950) 1 1 2453 rs58879558 (MAPT; 0) rs62062288 (MAPT; 0) <0.01 <0.01 <0.01 0.81 0.19
SZ FTD Chr6:32208902–32454577§ 0.669 (0.379, 0.990) 1 1 1657 rs9268219 (C6orf10; 0) rs9268877 (HLA-DRB9; 0) <0.01 <0.01 <0.01 1.00 <0.01
*

Indicates comparisons with genetic correlation analysis p < 8.26 × 10−5 (0.05/605).

Denotes locus extended by ±10 kb for fine-mapping and colocalisation analysis.

Variant identified in colocalisation as having the highest posterior probability of being shared variant assuming hypothesis 4 is true (see Figure 4).

§

Differences in fine-mapping solutions across trait pairs in the Chr6:32.21–32.45 Mb locus reflect differences in the SNPs retained after restricting to those in common between the compared genome-wide association studies (GWAS).

H0 = no causal variant for either trait, H1 = variant causal for trait 1, H2 = variant causal for trait 2, H3 = distinct causal variants for each trait, H4 = a shared causal variant between traits. PIP = posterior inclusion probability, AD = Alzheimer’s disease, ALS = amyotrophic lateral sclerosis, FTD = frontotemporal dementia, PD = Parkinson’s disease, SZ = schizophrenia.

Colocalisation analyses performed across fine-mapping credible sets and across all SNPs in a region generally gave support to the equivalent hypothesis (Table 3; Supplementary file 1b). Moreover, comparisons suggesting a signal was present in one trait only were largely concordant with the identification of fine-mapping credible sets in only that trait (Supplementary file 1b). Figure 4—figure supplement 1 compares per-SNP p-values across trait pairs for comparisons with evidence of a relevant signal in both traits. Figure 4—figure supplement 2 shows patterns of LD across SNPs assigned to credible sets for these analyses.

Strong evidence was found for a shared variant between ALS and AD within the Human leukocyte antigen (HLA) region (posterior probability of shared variant = 0.9; see Figure 4). The 95% credible SNPs for this association were distributed around the MTCO3P1 pseudogene and rs9275477, the lead genome-wide significant SNP from the ALS GWAS in this region, had the highest posterior probability of being implicated in both traits. Figure 4—figure supplement 3 presents sensitivity analysis showing that the result is robust to a range of values for the shared-variant hypothesis prior probability.

Figure 4. Evidence for colocalisation between amyotrophic lateral sclerosis (ALS) and Alzheimer’s disease (AD) in the Chr6:32.63–32.68 Mb region.

(A) Single-nucleotide polymorphism (SNP)-wise p-value distribution between ALS and AD across Chr6:32.63–32.68 Mb, in which colocalisation analysis found 0.90 posterior probability of the shared-variant hypothesis (see Table 3). (B) (upper) Per-SNP posterior probabilities for being a shared variant between ALS and AD, (lower) positions of HGNC gene symbols nearby to the 95% credible SNPs. Posterior probabilities for being a shared-variant sum to 1 across all SNPs analysed and are predicated on the assumption that a shared variant exists; 95% credible SNPs are those spanned by the top 0.95 of posterior probabilities. The x-axis for Panel B is truncated by the base pair range of the credible SNPs and genomic positions are based on GRCh37.

Figure 4.

Figure 4—figure supplement 1. Single-nucleotide polymorphism (SNP)-wise p-value distribution between trait pairs in comparisons where colocalisation analysis suggested a causal variant in both traits.

Figure 4—figure supplement 1.

Colocalisation analysis supported the shared-variant hypothesis for the comparison in (A), and the presence of distinct variants for each trait in (B-G) (see Table 3). Colouring indicates fine-mapping credible sets assigned to SNPs across the traits compared; the legend above each panel is in the format 'Trait: credible set number'. The genomic position range shown above each panel is in Mb. AD = Alzheimer’s disease, ALS = amyotrophic lateral sclerosis, FTD = frontotemporal dementia, PD = Parkinson’s disease, SZ = schizophrenia.
Figure 4—figure supplement 2. Heatmaps of linkage disequilibrium (LD) in the 1000 Genomes European reference population across variants assigned to any credible set during univariate fine-mapping of trait pairs (A-G).

Figure 4—figure supplement 2.

LD is shown relative to the single-nucleotide polymorphisms (SNPs) with the highest posterior inclusion probability (PIP) for each credible set, displaying all top PIP SNPs when ties occur. The y-axis splits by top PIP SNPs and the x-axis displays SNPs ordered by genomic position, marking only the positions of the top PIP SNPs. Credible set assignments for each variant are shown in the colour bar at the top of each panel and for the top PIP SNPs in the y-axis label; these are annotated in the format: ‘trait: credible set number’. The genomic range indicated at the top right of each panel refers to the positions spanned across all SNPs analysed and is in Mb. AD = Alzheimer’s disease, ALS = amyotrophic lateral sclerosis, FTD = frontotemporal dementia, PD = Parkinson’s disease, SZ = schizophrenia.
Figure 4—figure supplement 3. Sensitivity of colocalisation analysis to the prior probability of a shared variant between traits.

Figure 4—figure supplement 3.

The upper panels display analysis at Chr6:32629240–32682213 between amyotrophic lateral sclerosis (ALS) and Alzheimer’s disease (AD). The lower panels are for Chr17:43460501–44865832 between Parkinson’s disease (PD) and schizophrenia (SZ). Panels labelled ‘coloc.abf’ display analysis across all single-nucleotide polymorphisms (SNPs) in the region and ‘coloc.susie’ indicates analysis across the SNPs within the pair of fine-mapping credible sets identified across trait pairs. Plot points indicate posterior probability of each hypothesis (H0 = no causal variant for either trait, H1 = variant causal for the trait one, H2 = variant causal for trait two, H3 = distinct causal variants for each trait, H4 = a shared causal variant between traits), according to the prior probability of H4. The vertical hatched line indicates the prior H4 probability defined for the reported analysis; the black horizontal line indicates the defined threshold for acceptance of H4: posterior H4 probability >0.8. Cream shading of the plot area indicates prior H4 probabilities which yield a posterior probability of H4 above the threshold.

The other comparisons that found fine-mapping credible sets in both traits suggested that overlaps from the correlation analysis were driven by distinct causal variants (see Table 3; Supplementary file 1b).

Univariate fine-mapping of PD and schizophrenia at Chr17:43.46–44.87 Mb found large credible sets spanning many genes, including MAPT (Allen et al., 2014; Snowden et al., 2015; Origone et al., 2018; Nakayama et al., 2019; Cheng et al., 2020) and CRHR1 (Cheng et al., 2020; Bigdeli et al., 2021) which have been previously implicated in the traits we have analysed. These expansive credible sets reflect the strong LD in the region and indicate a signal that is difficult to localise (see Figure 4—figure supplement 2F; Supplementary file 1c). The colocalisation analysis suggested independent variants for each trait despite many SNPs overlapping across their respective credible sets (see Figure 4—figure supplement 2F). Sensitivity analysis showed robust support for the two independent variants hypothesis across shared-variant hypothesis priors (Figure 4—figure supplement 3). However, the colocalisation analysis will increasingly favour the two independent variants hypothesis as the number of analysed variants increases (Wallace, 2020). Hence, the wide-spanning LD of this region may have obstructed identification of variants and mechanisms shared between the traits.

Discussion

We examined genetic overlaps between the neuropsychiatric conditions AD, ALS, FTD, PD, and schizophrenia. Through genetic correlation analysis, we replicated genome-wide correlations previously described between the studied traits (Van Rheenen et al., 2021; Li et al., 2021; Wainberg et al., 2023; McLaughlin et al., 2017). Leveraging a more recent local genetic correlation approach, we identified specific genomic loci jointly implicated between pairs of traits which were further investigated using statistical fine-mapping and colocalisation techniques.

Significant local genetic correlations were most frequent across genomic blocks within the HLA region, implicating each of the studied traits in at least one comparison. Several associated regions contained genes with known relevance for the traits studied, such as KIF5A, MAPT, and CRHR1. Colocalisation analysis found strong evidence for a shared genetic variant between ALS and AD in the Chr6:32.62–32.68 Mb locus within HLA, while the other colocalisation analyses suggested causal signals distinct across traits, for one trait only, or for neither trait.

The tendency for association between traits around the HLA region is reasonable, since this is a known hotspot for pleiotropy (Werme et al., 2022; Watanabe et al., 2019). The human leukocyte antigen (HLA) system is particularly known for its role in immune response and it is implicated in various types of disease (Dendrou et al., 2018; Trowsdale and Knight, 2013). Mounting evidence has linked HLA and associated genetic variation to the traits we have analysed, and mechanisms underlying these associations are beginning to be understood (Dendrou et al., 2018; Trowsdale and Knight, 2013; Wang et al., 2020b; Song et al., 2016; Yu et al., 2021; Broce et al., 2018; Ferrari et al., 2016; Al-Diwani et al., 2017; Mokhtari and Lachman, 2016; Aliseychik et al., 2018; Zhang et al., 2022). For instance, AD is associated with variants around the HLA-DQA1 and HLA-DRB1 genes and several SNPs in the non-coding region between them have been shown to modulate their expression (Zhang et al., 2022). Notably, one of the SNPs with a demonstrated regulatory role, rs9271247, had the highest probability of being causal for AD across the 95% credible set identified in the fine-mapping of the region.

Variants showing evidence for colocalisation between AD and ALS were distributed around the MTCO3P1 pseudogene in the HLA class II non-coding region between HLA-DQB1 and HLA-DQB2. MTCO3P1 has been previously identified as one of the most pleiotropic genes in the GWAS catalog (Chesmore et al., 2018; Sollis et al., 2023). Previous studies have suggested the relevance of this region in both traits. HLA-DQB1 and HLA-DQB2 are both upregulated in the spinal cord of people with ALS, alongside other genes implicated in various immunological processes for antigen processing and inflammatory response (Andrés-Benito et al., 2017). HLA class II complexes, and their subcomponents, have been identified as upregulated in multiple brain regions of people with AD, using both gene and protein expression techniques (Aliseychik et al., 2018; Hopperton et al., 2018; Pain et al., 2023).

Our analysis of this region gave stronger support for colocalisation between the ALS and AD GWAS than a previous study (Van Rheenen et al., 2021). The previous study analysed a 200 kb window of over 2000 SNPs around the lead genome-wide significant SNP from the ALS GWAS, rs9275477, and found ~0.50 posterior probability for each of the shared and two independent variant(s) hypotheses. The current analysis used 475 SNPs occurring within a semi-independent LD block of ~50 kb in this locus. Since the posterior probability of the two independent variants hypothesis (H3) increases exponentially with the number of variants in the region while the shared-variant hypothesis (H4) scales linearly, it is expected that our analysis would give stronger support for the latter (Wallace, 2020). Given that the previous study defined regions for analysis based on an arbitrary window of ±100 kb around each lead genome-wide significant SNP from the ALS GWAS and we defined each analysis region based on patterns of LD in European ancestry populations, it is reasonable to favour the current finding.

More broadly, our analyses suggest that regions with a strong genetic correlation between the five traits studied often result from adjacent but trait-specific signals, likely reflecting overlaps between LD blocks (Watanabe et al., 2019). Correlations also occurred in regions with weaker overall GWAS associations (see Table 2), where fine-mapping and colocalisation analyses did not suggest causal associations in one or either trait. Such patterns likely reflect a shared polygenic trend across the region, rather than associations attributable to discrete variants. Accordingly, other approaches may be better suited for identifying regions containing genetic variation jointly causal across diseases, including the traditional approach of testing regions around overlapping genome-wide significant variants.

This study has used gold-standard statistical tools to examine genetic relationships between traits. The local genetic correlation analysis approach enabled targeted investigation of genomic regions which appear to overlap between traits. The application of colocalisation analysis alongside a prior univariate fine-mapping step allowed for associations to be tested without conflating independent but nearby signals under the single-variant assumption of colocalisation analysis across all variants sampled in a region.

The study is not without limitation. We necessarily used the 1KG European reference population to estimate LD between SNPs. Fine-mapping is ideally performed with an LD matrix from the GWAS sample and is sensitive to misspecification when inconsistencies in LD occur between the reference and GWAS cohorts. Use of a reference population is not uncommon, and diagnostic tools available within the susieR package allow testing for inconsistencies between the reference and GWAS samples (Zou et al., 2022) . We accordingly implemented these tools centrally into our workflow and determined that the LD matrices from the 1KG reference were suitable for the data (estimates of Z-score and LD consistency are available in Supplementary file 1c). Nevertheless, repeating this study in under-represented populations would be an important future step to validate our findings.

We employed statistical methods to identify and analyse genomic regions containing variants which might be jointly implicated across traits. These approaches provide useful associations between traits identified from large-scale genomic datasets. However, they alone are not sufficient for translation into clinical practice. Future studies should aim to extend any associations found by integrating functional and multi-omics datasets to gain mechanistic insights into observed trends and facilitate treatment discovery (Zhang et al., 2022; Pain et al., 2023).

The fine-mapping and colocalisation analysis pipeline we have used is available as an open-access resource on GitHub to facilitate the application of these methods in future studies. Specified genomic regions can be readily analysed by providing GWAS summary statistics for binary or quantitative traits of interest and a population-appropriate reference dataset for estimation of LD. The pipeline returns resources including detailed reports that overview the analyses performed.

Acknowledgements

The authors acknowledge the use of the CREATE research computing facility at King’s College London, 2022. We also acknowledge Health Data Research UK, which is funded by the UK Medical Research Council, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (United Kingdom), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome Trust. This project was part funded by the MND Association and the Wellcome Trust. This is an EU Joint Programme-Neurodegenerative Disease Research (JPND) project. The project is supported through the following funding organisations under the aegis of JPND–http://www.neurodegenerationresearch.eu/ [United Kingdom, Medical Research Council (MR/L501529/1 and MR/R024804/1) and Economic and Social Research Council (ES/L008238/1)]. AAC is a NIHR Senior Investigator. AAC received salary support from the National Institute for Health Research (NIHR) Dementia Biomedical Research Unit at South London and Maudsley NHS Foundation Trust and King’s College London. The work leading up to this publication was funded by the European Community’s Health Seventh Framework Program (FP7/2007–2013; grant agreement number 259867) and Horizon 2020 Program (H2020-PHC-2014-two-stage; grant agreement number 633413). This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme grant agreement no. 772376–EScORIAL. This study represents independent research part funded by the NIHR Maudsley Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR, King’s College London, or the Department of Health and Social Care. Funding was also provided by the King’s College London DRIVE-Health Centre for Doctoral Training and the Perron Institute for Neurological and Translational Science. AI is funded by South London and Maudsley NHS Foundation Trust, MND Scotland, Motor Neurone Disease Association, National Institute for Health and Care Research, Spastic Paraplegia Foundation, Rosetrees Trust, Darby Rimmer MND Foundation, UK Research and Innovation (UKRI), Medical Research Council, LifeArc and Alzheimer’s Research UK. OP is supported by a Sir Henry Wellcome Postdoctoral Fellowship [222811/Z/21/Z]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. This research was funded in whole or in part by the Wellcome Trust [222811/Z/21/Z]. For the purpose of open access, the author has applied a CC-BY public copyright licence to any author accepted manuscript version arising from this submission.

Funding Statement

The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication. For the purpose of Open Access, the authors have applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.

Contributor Information

Thomas P Spargo, Email: thomas.spargo@kcl.ac.uk.

Alfredo Iacoangeli, Email: alfredo.iacoangeli@kcl.ac.uk.

Joris Deelen, Max Planck Institute for Biology of Ageing, Germany.

Timothy E Behrens, University of Oxford, United Kingdom.

Funding Information

This paper was supported by the following grants:

  • Motor Neurone Disease Association to Thomas P Spargo, Ammar Al-Chalabi, Alfredo Iacoangeli.

  • Wellcome Trust to Ammar Al-Chalabi, Oliver Pain.

  • Medical Research Council to Ammar Al-Chalabi, Alfredo Iacoangeli.

  • Rosetrees Trust to Alfredo Iacoangeli.

  • MND Scotland to Ammar Al-Chalabi, Alfredo Iacoangeli.

  • Spastic Paraplegia Foundation to Ammar Al-Chalabi, Alfredo Iacoangeli.

  • Alzheimer’s Research UK to Ammar Al-Chalabi, Alfredo Iacoangeli.

  • Darby Rimmer MND Foundation to Alfredo Iacoangeli.

  • NIHR Maudsley Biomedical Research Centre to Thomas P Spargo, Richard JB Dobson, Ammar Al-Chalabi, Alfredo Iacoangeli.

  • LifeArc to Ammar Al-Chalabi, Alfredo Iacoangeli.

  • Medical Research Council MR/L501529/1 to Ammar Al-Chalabi.

  • Medical Research Council MR/R024804/1 to Ammar Al-Chalabi.

  • Economic and Social Research Council ES/L008238/1 to Ammar Al-Chalabi.

  • European Community’s Health Seventh Framework Programme 259867 to Ammar Al-Chalabi.

  • Horizon 2020 633413 to Ammar Al-Chalabi.

  • Horizon 2020 772376–EScORIAL to Ammar Al-Chalabi.

  • Wellcome Trust 10.35802/222811 to Oliver Pain.

  • King's College London DRIVE-Health Centre for Doctoral Training to Lachlan Gilchrist.

  • Perron Institute for Neurological and Translational Science to Alfredo Iacoangeli.

  • UK Research and Innovation to Alfredo Iacoangeli.

  • National Institute for Health and Care Research to Alfredo Iacoangeli.

  • South London and Maudsley NHS Foundation Trust to Alfredo Iacoangeli.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Software, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review and editing.

Investigation, Writing – review and editing.

Conceptualization, Software, Formal analysis, Writing – review and editing.

Conceptualization, Writing – review and editing.

Conceptualization, Writing – review and editing.

Conceptualization, Writing – review and editing.

Conceptualization, Supervision, Writing – review and editing.

Conceptualization, Supervision, Funding acquisition, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Extended results from local genetic correlation analysis and subsequent fine-mapping and colocalisation analyses across loci correlated between trait pairs.

(a) Results of all bivariate local genetic correlation analyses. (b) Results of colocalisation analyses performed across all single-nucleotide polymorphisms (SNPs) sampled in the region. Number of SNPs refers to the number of SNPs in common between the two traits analysed and present within the 1000 genomes reference panel. Comparisons where univariate fine-mapping identified at least one credible set in each trait were also performed on the basis of these credible sets (see Table 3). Annotations column: * denotes comparisons with genetic correlation analysis p-values below the strict Bonferroni correction threshold; all others passed false discovery rate (FDR) correction. ∆ denotes locus with boundaries extended by ±10 kb compared to the region partition defined in genetic correlation analysis. (c) Overview of credible sets identified across fine-mapping analyses in summary statistics harmonised across trait pairs.

elife-88768-supp1.xlsx (115.8KB, xlsx)
MDAR checklist

Data availability

All data used in this study are publicly available, including summary statistics from genome-wide association studies (GWAS) across each of the five analysed traits (Van Rheenen et al., 2021; Kunkle et al., 2019; Ferrari et al., 2014; Nalls et al., 2019; Trubetskoy et al., 2022; Pain et al., 2021) and the 1000 Genomes Phase 3 reference dataset (Auton et al., 2015). Code used to conduct the analyses described in this manuscript are available on GitHub, copy archived at Spargo, 2023.

References

  1. Abdellaoui A, Yengo L, Verweij KJH, Visscher PM. 15 years of GWAS discovery: realizing the promise. American Journal of Human Genetics. 2023;110:179–194. doi: 10.1016/j.ajhg.2022.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Al-Diwani AAJ, Pollak TA, Irani SR, Lennox BR. Psychosis: an autoimmune disease? Immunology. 2017;152:388–401. doi: 10.1111/imm.12795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Aliseychik MP, Andreeva TV, Rogaev EI. Immunogenetic factors of neurodegenerative diseases: the role of HLA Class II. Biochemistry. Biokhimiia. 2018;83:1104–1116. doi: 10.1134/S0006297918090122. [DOI] [PubMed] [Google Scholar]
  4. Allen M, Kachadoorian M, Quicksall Z, Zou F, Chai HS, Younkin C, Crook JE, Pankratz VS, Carrasquillo MM, Krishnan S, Nguyen T, Ma L, Malphrus K, Lincoln S, Bisceglio G, Kolbert CP, Jen J, Mukherjee S, Kauwe JK, Crane PK, Haines JL, Mayeux R, Pericak-Vance MA, Farrer LA, Schellenberg GD, Parisi JE, Petersen RC, Graff-Radford NR, Dickson DW, Younkin SG, Ertekin-Taner N. Association of MAPT haplotypes with Alzheimer’s disease risk and MAPT brain gene expression levels. Alzheimer’s Research & Therapy. 2014;6:39. doi: 10.1186/alzrt268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Alonso A, Logroscino G, Jick SS, Hernán MA. Incidence and lifetime risk of motor neuron disease in the United Kingdom: a population-based study. European Journal of Neurology. 2009;16:745–751. doi: 10.1111/j.1468-1331.2009.02586.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Altshuler DM, Gibbs RA, Peltonen L, Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, Yu F, Peltonen L, Dermitzakis E, Bonnen PE, Altshuler DM, Gibbs RA, de Bakker PIW, Deloukas P, Gabriel SB, Gwilliam R, Hunt S, Inouye M, Jia X, Palotie A, Parkin M, Whittaker P, Yu F, Chang K, Hawes A, Lewis LR, Ren Y, Wheeler D, Gibbs RA, Muzny DM, Barnes C, Darvishi K, Hurles M, Korn JM, Kristiansson K, Lee C, McCarrol SA, Nemesh J, Dermitzakis E, Keinan A, Montgomery SB, Pollack S, Price AL, Soranzo N, Bonnen PE, Gibbs RA, Gonzaga-Jauregui C, Keinan A, Price AL, Yu F, Anttila V, Brodeur W, Daly MJ, Leslie S, McVean G, Moutsianas L, Nguyen H, Schaffner SF, Zhang Q, Ghori MJR, McGinnis R, McLaren W, Pollack S, Price AL, Schaffner SF, Takeuchi F, Grossman SR, Shlyakhter I, Hostetter EB, Sabeti PC, Adebamowo CA, Foster MW, Gordon DR, Licinio J, Manca MC, Marshall PA, Matsuda I, Ngare D, Wang VO, Reddy D, Rotimi CN, Royal CD, Sharp RR, Zeng C, Brooks LD, McEwen JE. Integrating common and rare genetic variation in diverse human populations. Nature. 2010;467:52–58. doi: 10.1038/nature09298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Andrés-Benito P, Moreno J, Aso E, Povedano M, Ferrer I. Amyotrophic lateral sclerosis, gene deregulation in the anterior horn of the spinal cord and frontal cortex area 8: implications in frontotemporal lobar degeneration. Aging. 2017;9:823–851. doi: 10.18632/aging.101195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Beck J, Poulter M, Hensman D, Rohrer JD, Mahoney CJ, Adamson G, Campbell T, Uphill J, Borg A, Fratta P, Orrell RW, Malaspina A, Rowe J, Brown J, Hodges J, Sidle K, Polke JM, Houlden H, Schott JM, Fox NC, Rossor MN, Tabrizi SJ, Isaacs AM, Hardy J, Warren JD, Collinge J, Mead S. Large C9orf72 hexanucleotide repeat expansions are seen in multiple neurodegenerative syndromes and are more frequent than expected in the UK population. American Journal of Human Genetics. 2013;92:345–353. doi: 10.1016/j.ajhg.2013.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bigdeli TB, Fanous AH, Li Y, Rajeevan N, Sayward F, Genovese G, Gupta R, Radhakrishnan K, Malhotra AK, Sun N, Lu Q, Hu Y, Li B, Chen Q, Mane S, Miller P, Cheung KH, Gur RE, Greenwood TA, Braff DL, Achtyes ED, Buckley PF, Escamilla MA, Lehrer D, Malaspina DP, McCarroll SA, Rapaport MH, Vawter MP, Pato MT, Pato CN, Zhao H, Kosten TR, Brophy M, Pyarajan S, Shi Y, O’Leary TJ, Gleason T, Przygodzki R, Muralidhar S, Gaziano JM, Huang GD, Concato J, Siever LJ, Aslan M, Harvey PD. Genome-wide association studies of Schizophrenia and bipolar disorder in a diverse cohort of US veterans. Schizophrenia Bulletin. 2021;47:517–529. doi: 10.1093/schbul/sbaa133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Broce I, Karch CM, Wen N, Fan CC, Wang Y, Tan CH, Kouri N, Ross OA, Höglinger GU, Muller U, Hardy J, International FTD-Genomics Consortium. Momeni P, Hess CP, Dillon WP, Miller ZA, Bonham LW, Rabinovici GD, Rosen HJ, Schellenberg GD, Franke A, Karlsen TH, Veldink JH, Ferrari R, Yokoyama JS, Miller BL, Andreassen OA, Dale AM, Desikan RS, Sugrue LP. Immune-related genetic enrichment in frontotemporal dementia: an analysis of genome-wide association studies. PLOS Medicine. 2018;15:e1002487. doi: 10.1371/journal.pmed.1002487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, Duncan L, Perry JRB, Patterson N, Robinson EB, Daly MJ, Price AL, Neale BM. An atlas of genetic correlations across human diseases and traits. Nature Genetics. 2015;47:1236–1241. doi: 10.1038/ng.3406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chêne G, Beiser A, Au R, Preis SR, Wolf PA, Dufouil C, Seshadri S. Gender and incidence of dementia in the Framingham Heart Study from mid-adult life. Alzheimer’s & Dementia. 2015;11:310–320. doi: 10.1016/j.jalz.2013.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cheng WW, Zhu Q, Zhang HY. Identifying risk genes and interpreting pathogenesis for Parkinson’s Disease by a multiomics analysis. Genes. 2020;11:1100. doi: 10.3390/genes11091100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chesmore K, Bartlett J, Williams SM. The ubiquity of pleiotropy in human disease. Human Genetics. 2018;137:39–44. doi: 10.1007/s00439-017-1854-z. [DOI] [PubMed] [Google Scholar]
  16. Coyle-Gilchrist ITS, Dick KM, Patterson K, Vázquez Rodríquez P, Wehmann E, Wilcox A, Lansdall CJ, Dawson KE, Wiggins J, Mead S, Brayne C, Rowe JB. Prevalence, characteristics, and survival of frontotemporal lobar degeneration syndromes. Neurology. 2016;86:1736–1743. doi: 10.1212/WNL.0000000000002638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cunningham F, Allen JE, Allen J, Alvarez-Jarreta J, Amode MR, Armean IM, Austine-Orimoloye O, Azov AG, Barnes I, Bennett R, Berry A, Bhai J, Bignell A, Billis K, Boddu S, Brooks L, Charkhchi M, Cummins C, Da Rin Fioretto L, Davidson C, Dodiya K, Donaldson S, El Houdaigui B, El Naboulsi T, Fatima R, Giron CG, Genez T, Martinez JG, Guijarro-Clarke C, Gymer A, Hardy M, Hollis Z, Hourlier T, Hunt T, Juettemann T, Kaikala V, Kay M, Lavidas I, Le T, Lemos D, Marugán JC, Mohanan S, Mushtaq A, Naven M, Ogeh DN, Parker A, Parton A, Perry M, Piližota I, Prosovetskaia I, Sakthivel MP, Salam AIA, Schmitt BM, Schuilenburg H, Sheppard D, Pérez-Silva JG, Stark W, Steed E, Sutinen K, Sukumaran R, Sumathipala D, Suner MM, Szpak M, Thormann A, Tricomi FF, Urbina-Gómez D, Veidenberg A, Walsh TA, Walts B, Willhoft N, Winterbottom A, Wass E, Chakiachvili M, Flint B, Frankish A, Giorgetti S, Haggerty L, Hunt SE, IIsley GR, Loveland JE, Martin FJ, Moore B, Mudge JM, Muffato M, Perry E, Ruffier M, Tate J, Thybert D, Trevanion SJ, Dyer S, Harrison PW, Howe KL, Yates AD, Zerbino DR, Flicek P. Ensembl 2022. Nucleic Acids Research. 2022;50:D988–D995. doi: 10.1093/nar/gkab1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Dendrou CA, Petersen J, Rossjohn J, Fugger L. HLA variation and disease. Nature Reviews. Immunology. 2018;18:325–339. doi: 10.1038/nri.2017.143. [DOI] [PubMed] [Google Scholar]
  19. Durinck S, Moreau Y, Kasprzyk A, Davis S, De Moor B, Brazma A, Huber W. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21:3439–3440. doi: 10.1093/bioinformatics/bti525. [DOI] [PubMed] [Google Scholar]
  20. Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nature Protocols. 2009;4:1184–1191. doi: 10.1038/nprot.2009.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Ferrari R, Hernandez DG, Nalls MA, Rohrer JD, Ramasamy A, Kwok JBJ, Dobson-Stone C, Brooks WS, Schofield PR, Halliday GM, Hodges JR, Piguet O, Bartley L, Thompson E, Haan E, Hernández I, Ruiz A, Boada M, Borroni B, Padovani A, Cruchaga C, Cairns NJ, Benussi L, Binetti G, Ghidoni R, Forloni G, Galimberti D, Fenoglio C, Serpente M, Scarpini E, Clarimón J, Lleó A, Blesa R, Waldö ML, Nilsson K, Nilsson C, Mackenzie IRA, Hsiung G-YR, Mann DMA, Grafman J, Morris CM, Attems J, Griffiths TD, McKeith IG, Thomas AJ, Pietrini P, Huey ED, Wassermann EM, Baborie A, Jaros E, Tierney MC, Pastor P, Razquin C, Ortega-Cubero S, Alonso E, Perneczky R, Diehl-Schmid J, Alexopoulos P, Kurz A, Rainero I, Rubino E, Pinessi L, Rogaeva E, St George-Hyslop P, Rossi G, Tagliavini F, Giaccone G, Rowe JB, Schlachetzki JCM, Uphill J, Collinge J, Mead S, Danek A, Van Deerlin VM, Grossman M, Trojanowski JQ, van der Zee J, Deschamps W, Van Langenhove T, Cruts M, Van Broeckhoven C, Cappa SF, Le Ber I, Hannequin D, Golfier V, Vercelletto M, Brice A, Nacmias B, Sorbi S, Bagnoli S, Piaceri I, Nielsen JE, Hjermind LE, Riemenschneider M, Mayhaus M, Ibach B, Gasparoni G, Pichler S, Gu W, Rossor MN, Fox NC, Warren JD, Spillantini MG, Morris HR, Rizzu P, Heutink P, Snowden JS, Rollinson S, Richardson A, Gerhard A, Bruni AC, Maletta R, Frangipane F, Cupidi C, Bernardi L, Anfossi M, Gallo M, Conidi ME, Smirne N, Rademakers R, Baker M, Dickson DW, Graff-Radford NR, Petersen RC, Knopman D, Josephs KA, Boeve BF, Parisi JE, Seeley WW, Miller BL, Karydas AM, Rosen H, van Swieten JC, Dopper EGP, Seelaar H, Pijnenburg YAL, Scheltens P, Logroscino G, Capozzo R, Novelli V, Puca AA, Franceschi M, Postiglione A, Milan G, Sorrentino P, Kristiansen M, Chiang H-H, Graff C, Pasquier F, Rollin A, Deramecourt V, Lebert F, Kapogiannis D, Ferrucci L, Pickering-Brown S, Singleton AB, Hardy J, Momeni P. Frontotemporal dementia and its subtypes: a genome-wide association study. The Lancet. Neurology. 2014;13:686–699. doi: 10.1016/S1474-4422(14)70065-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ferrari R, Forabosco P, Vandrovcova J, Botía JA, Guelfi S, Warren JD, Momeni P, Weale ME, Ryten M, Hardy J. Frontotemporal dementia: insights into the biological underpinnings of disease through gene co-expression network analysis. Molecular Neurodegeneration. 2016;11:21. doi: 10.1186/s13024-016-0085-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ferrari R, Wang Y, Vandrovcova J, Guelfi S, Witeolar A, Karch CM, Schork AJ, Fan CC, Brewer JB, Momeni P, Schellenberg GD, Dillon WP, Sugrue LP, Hess CP, Yokoyama JS, Bonham LW, Rabinovici GD, Miller BL, Andreassen OA, Dale AM, Hardy J, Desikan RS. Genetic architecture of sporadic frontotemporal dementia and overlap with Alzheimer’s and Parkinson’s diseases. Journal of Neurology, Neurosurgery, and Psychiatry. 2017;88:152–164. doi: 10.1136/jnnp-2016-314411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Foley CN, Staley JR, Breen PG, Sun BB, Kirk PDW, Burgess S, Howson JMM. A fast and efficient colocalization algorithm for identifying shared genetic risk factors across multiple traits. Nature Communications. 2021;12:764. doi: 10.1038/s41467-020-20885-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Genome Reference Consortium Human Genome Region MHC. 2023. [March 7, 2023]. https://www.ncbi.nlm.nih.gov/grc/human/regions/MHC?asm=GRCh37
  26. Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, Plagnol V. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLOS Genetics. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Giambartolomei C, Zhenli Liu J, Zhang W, Hauberg M, Shi H, Boocock J, Pickrell J, Jaffe AE, Pasaniuc B, Roussos P. A Bayesian framework for multiple trait colocalization from summary association statistics. Bioinformatics. 2018;34:2538–2545. doi: 10.1093/bioinformatics/bty147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Grotzinger AD, De la Fuente J, Privé F, Nivard MG, Tucker-Drob EM. Pervasive downward bias in estimates of liability-scale heritability in genome-wide association study meta-analysis: a simple solution. Biological Psychiatry. 2023;93:29–36. doi: 10.1016/j.biopsych.2022.05.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hopperton KE, Mohammad D, Trépanier MO, Giuliano V, Bazinet RP. Markers of microglia in post-mortem brain samples from patients with Alzheimer’s disease: a systematic review. Molecular Psychiatry. 2018;23:177–198. doi: 10.1038/mp.2017.246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Johnston CA, Stanton BR, Turner MR, Gray R, Blunt AH-M, Butt D, Ampong M-A, Shaw CE, Leigh PN, Al-Chalabi A. Amyotrophic lateral sclerosis in an urban setting: a population based study of inner city London. Journal of Neurology. 2006;253:1642–1643. doi: 10.1007/s00415-006-0195-y. [DOI] [PubMed] [Google Scholar]
  31. King’s College London King’s Computational Research, Engineering and Technology Environment (CREATE) 2022. [October 12, 2022]. [DOI]
  32. Kunkle BW, Grenier-Boley B, Sims R, Bis JC, Damotte V, Naj AC, Boland A, Vronskaya M, van der Lee SJ, Amlie-Wolf A, Bellenguez C, Frizatti A, Chouraki V, Martin ER, Sleegers K, Badarinarayan N, Jakobsdottir J, Hamilton-Nelson KL, Moreno-Grau S, Olaso R, Raybould R, Chen Y, Kuzma AB, Hiltunen M, Morgan T, Ahmad S, Vardarajan BN, Epelbaum J, Hoffmann P, Boada M, Beecham GW, Garnier JG, Harold D, Fitzpatrick AL, Valladares O, Moutet ML, Gerrish A, Smith AV, Qu L, Bacq D, Denning N, Jian X, Zhao Y, Del Zompo M, Fox NC, Choi SH, Mateo I, Hughes JT, Adams HH, Malamon J, Sanchez-Garcia F, Patel Y, Brody JA, Dombroski BA, Naranjo MCD, Daniilidou M, Eiriksdottir G, Mukherjee S, Wallon D, Uphill J, Aspelund T, Cantwell LB, Garzia F, Galimberti D, Hofer E, Butkiewicz M, Fin B, Scarpini E, Sarnowski C, Bush WS, Meslage S, Kornhuber J, White CC, Song Y, Barber RC, Engelborghs S, Sordon S, Voijnovic D, Adams PM, Vandenberghe R, Mayhaus M, Cupples LA, Albert MS, De Deyn PP, Gu W, Himali JJ, Beekly D, Squassina A, Hartmann AM, Orellana A, Blacker D, Rodriguez-Rodriguez E, Lovestone S, Garcia ME, Doody RS, Munoz-Fernadez C, Sussams R, Lin H, Fairchild TJ, Benito YA, Holmes C, Karamujić-Čomić H, Frosch MP, Thonberg H, Maier W, Roshchupkin G, Ghetti B, Giedraitis V, Kawalia A, Li S, Huebinger RM, Kilander L, Moebus S, Hernández I, Kamboh MI, Brundin R, Turton J, Yang Q, Katz MJ, Concari L, Lord J, Beiser AS, Keene CD, Helisalmi S, Kloszewska I, Kukull WA, Koivisto AM, Lynch A, Tarraga L, Larson EB, Haapasalo A, Lawlor B, Mosley TH, Lipton RB, Solfrizzi V, Gill M, Longstreth WT, Jr, Montine TJ, Frisardi V, Diez-Fairen M, Rivadeneira F, Petersen RC, Deramecourt V, Alvarez I, Salani F, Ciaramella A, Boerwinkle E, Reiman EM, Fievet N, Rotter JI, Reisch JS, Hanon O, Cupidi C, Andre Uitterlinden AG, Royall DR, Dufouil C, Maletta RG, de Rojas I, Sano M, Brice A, Cecchetti R, George-Hyslop PS, Ritchie K, Tsolaki M, Tsuang DW, Dubois B, Craig D, Wu CK, Soininen H, Avramidou D, Albin RL, Fratiglioni L, Germanou A, Apostolova LG, Keller L, Koutroumani M, Arnold SE, Panza F, Gkatzima O, Asthana S, Hannequin D, Whitehead P, Atwood CS, Caffarra P, Hampel H, Quintela I, Carracedo Á, Lannfelt L, Rubinsztein DC, Barnes LL, Pasquier F, Frölich L, Barral S, McGuinness B, Beach TG, Johnston JA, Becker JT, Passmore P, Bigio EH, Schott JM, Bird TD, Warren JD, Boeve BF, Lupton MK, Bowen JD, Proitsi P, Boxer A, Powell JF, Burke JR, Kauwe JSK, Burns JM, Mancuso M, Buxbaum JD, Bonuccelli U, Cairns NJ, McQuillin A, Cao C, Livingston G, Carlson CS, Bass NJ, Carlsson CM, Hardy J, Carney RM, Bras J, Carrasquillo MM, Guerreiro R, Allen M, Chui HC, Fisher E, Masullo C, Crocco EA, DeCarli C, Bisceglio G, Dick M, Ma L, Duara R, Graff-Radford NR, Evans DA, Hodges A, Faber KM, Scherer M, Fallon KB, Riemenschneider M, Fardo DW, Heun R, Farlow MR, Kölsch H, Ferris S, Leber M, Foroud TM, Heuser I, Galasko DR, Giegling I, Gearing M, Hüll M, Geschwind DH, Gilbert JR, Morris J, Green RC, Mayo K, Growdon JH, Feulner T, Hamilton RL, Harrell LE, Drichel D, Honig LS, Cushion TD, Huentelman MJ, Hollingworth P, Hulette CM, Hyman BT, Marshall R, Jarvik GP, Meggy A, Abner E, Menzies GE, Jin LW, Leonenko G, Real LM, Jun GR, Baldwin CT, Grozeva D, Karydas A, Russo G, Kaye JA, Kim R, Jessen F, Kowall NW, Vellas B, Kramer JH, Vardy E, LaFerla FM, Jöckel KH, Lah JJ, Dichgans M, Leverenz JB, Mann D, Levey AI, Pickering-Brown S, Lieberman AP, Klopp N, Lunetta KL, Wichmann HE, Lyketsos CG, Morgan K, Marson DC, Brown K, Martiniuk F, Medway C, Mash DC, Nöthen MM, Masliah E, Hooper NM, McCormick WC, Daniele A, McCurry SM, Bayer A, McDavid AN, Gallacher J, McKee AC, van den Bussche H, Mesulam M, Brayne C, Miller BL, Riedel-Heller S, Miller CA, Miller JW, Al-Chalabi A, Morris JC, Shaw CE, Myers AJ, Wiltfang J, O’Bryant S, Olichney JM, Alvarez V, Parisi JE, Singleton AB, Paulson HL, Collinge J, Perry WR, Mead S, Peskind E, Cribbs DH, Rossor M, Pierce A, Ryan NS, Poon WW, Nacmias B, Potter H, Sorbi S, Quinn JF, Sacchinelli E, Raj A, Spalletta G, Raskind M, Caltagirone C, Bossù P, Orfei MD, Reisberg B, Clarke R, Reitz C, Smith AD, Ringman JM, Warden D, Roberson ED, Wilcock G, Rogaeva E, Bruni AC, Rosen HJ, Gallo M, Rosenberg RN, Ben-Shlomo Y, Sager MA, Mecocci P, Saykin AJ, Pastor P, Cuccaro ML, Vance JM, Schneider JA, Schneider LS, Slifer S, Seeley WW, Smith AG, Sonnen JA, Spina S, Stern RA, Swerdlow RH, Tang M, Tanzi RE, Trojanowski JQ, Troncoso JC, Van Deerlin VM, Van Eldik LJ, Vinters HV, Vonsattel JP, Weintraub S, Welsh-Bohmer KA, Wilhelmsen KC, Williamson J, Wingo TS, Woltjer RL, Wright CB, Yu CE, Yu L, Saba Y, Pilotto A, Bullido MJ, Peters O, Crane PK, Bennett D, Bosco P, Coto E, Boccardi V, De Jager PL, Lleo A, Warner N, Lopez OL, Ingelsson M, Deloukas P, Cruchaga C, Graff C, Gwilliam R, Fornage M, Goate AM, Sanchez-Juan P, Kehoe PG, Amin N, Ertekin-Taner N, Berr C, Debette S, Love S, Launer LJ, Younkin SG, Dartigues JF, Corcoran C, Ikram MA, Dickson DW, Nicolas G, Campion D, Tschanz J, Schmidt H, Hakonarson H, Clarimon J, Munger R, Schmidt R, Farrer LA, Van Broeckhoven C, C O’Donovan M, DeStefano AL, Jones L, Haines JL, Deleuze JF, Owen MJ, Gudnason V, Mayeux R, Escott-Price V, Psaty BM, Ramirez A, Wang LS, Ruiz A, van Duijn CM, Holmans PA, Seshadri S, Williams J, Amouyel P, Schellenberg GD, Lambert JC, Pericak-Vance MA. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing. Nature Genetics. 2019;51:414–430. doi: 10.1038/s41588-019-0358-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Li C, Yang T, Ou R, Shang H. Overlapping genetic architecture between schizophrenia and neurodegenerative disorders. Frontiers in Cell and Developmental Biology. 2021;9:797072. doi: 10.3389/fcell.2021.797072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. McLaughlin RL, Schijven D, van Rheenen W, van Eijk KR, O’Brien M, Kahn RS, Ophoff RA, Goris A, Bradley DG, Al-Chalabi A, van den Berg LH, Luykx JJ, Hardiman O, Veldink JH. Genetic correlation between amyotrophic lateral sclerosis and schizophrenia. Nature Communications. 2017;8:14774. doi: 10.1038/ncomms14774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Mokhtari R, Lachman HM. The major histocompatibility complex (MHC) in Schizophrenia: a review. Journal of Clinical & Cellular Immunology. 2016;7:479. doi: 10.4172/2155-9899.1000479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Nakayama S, Shimonaka S, Elahi M, Nishioka K, Oji Y, Matsumoto SE, Li Y, Yoshino H, Mogushi K, Hatano T, Sato T, Ikura T, Ito N, Motoi Y, Hattori N. Tau aggregation and seeding analyses of two novel MAPT variants found in patients with motor neuron disease and progressive parkinsonism. Neurobiology of Aging. 2019;84:240. doi: 10.1016/j.neurobiolaging.2019.02.016. [DOI] [PubMed] [Google Scholar]
  37. Nalls MA, Blauwendraat C, Vallerga CL, Heilbron K, Bandres-Ciga S, Chang D, Tan M, Kia DA, Noyce AJ, Xue A, Bras J, Young E, von Coelln R, Simón-Sánchez J, Schulte C, Sharma M, Krohn L, Pihlstrøm L, Siitonen A, Iwaki H, Leonard H, Faghri F, Gibbs JR, Hernandez DG, Scholz SW, Botia JA, Martinez M, Corvol JC, Lesage S, Jankovic J, Shulman LM, Sutherland M, Tienari P, Majamaa K, Toft M, Andreassen OA, Bangale T, Brice A, Yang J, Gan-Or Z, Gasser T, Heutink P, Shulman JM, Wood NW, Hinds DA, Hardy JA, Morris HR, Gratten J, Visscher PM, Graham RR, Singleton AB. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. The Lancet. Neurology. 2019;18:1091–1102. doi: 10.1016/S1474-4422(19)30320-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Origone P, Geroldi A, Lamp M, Sanguineri F, Caponnetto C, Cabona C, Gotta F, Trevisan L, Bellone E, Manganelli F, Devigili G, Mandich P. Role of MAPT in Pure motor neuron disease: Report of a recurrent mutation in italian patients. Neuro-Degenerative Diseases. 2018;18:310–314. doi: 10.1159/000497820. [DOI] [PubMed] [Google Scholar]
  39. Pain O, Glanville KP, Hagenaars SP, Selzam S, Fürtjes AE, Gaspar HA, Coleman JRI, Rimfeld K, Breen G, Plomin R, Folkersen L, Lewis CM. Evaluation of polygenic prediction methodology within a reference-standardized framework. PLOS Genetics. 2021;17:e1009021. doi: 10.1371/journal.pgen.1009021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Pain O, Jones A, Al Khleifat A, Agarwal D, Hramyka D, Karoui H, Kubica J, Llewellyn DJ, Ranson JM, Yao Z, Iacoangeli A, Al-Chalabi A. Harnessing Transcriptomic Signals for Amyotrophic Lateral Sclerosis to Identify Novel Drugs and Enhance Risk Prediction. medRxiv. 2023 doi: 10.1101/2023.01.18.23284589. [DOI] [PMC free article] [PubMed]
  41. Parkinson’s The Incidence and Prevalence of Parkinson’s in the UK: Results from the Clinical Practice Research Datalink Reference Report. 2017. [December 11, 2024]. https://www.parkinsons.org.uk/professionals/resources/incidence-and-prevalence-parkinsons-uk-report
  42. Privé F, Aschard H, Ziyatdinov A, Blum MGB. Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr. Bioinformatics. 2018;34:2781–2787. doi: 10.1093/bioinformatics/bty185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, de Bakker PIW, Daly MJ, Sham PC. PLINK: A tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Purcell S. Whole genome association analysis toolset. v1.07Plink. 2009 http://pngu.mgh.harvard.edu/purcell/plink/
  45. Ranganathan R, Haque S, Coley K, Shepheard S, Cooper-Knock J, Kirby J. Multifaceted genes in amyotrophic lateral sclerosis-frontotemporal dementia. Frontiers in Neuroscience. 2020;14:684. doi: 10.3389/fnins.2020.00684. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. R Development Core Team . Vienna, Austria: R Foundation for Statistical Computing; 2021. https://www.R-project.org/ [Google Scholar]
  47. Reales G, Wallace C. Sharing GWAS summary statistics results in more citations. Commun Biol. 2023;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Saha S, Chant D, Welham J, McGrath J. A systematic review of the prevalence of schizophrenia. PLOS Medicine. 2005;2:e141. doi: 10.1371/journal.pmed.0020141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Shi H, Mancuso N, Spendlove S, Pasaniuc B. Local genetic correlation gives insights into the shared genetic architecture of complex traits. American Journal of Human Genetics. 2017;101:737–751. doi: 10.1016/j.ajhg.2017.09.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Snowden JS, Adams J, Harris J, Thompson JC, Rollinson S, Richardson A, Jones M, Neary D, Mann DM, Pickering-Brown S. Distinct clinical and pathological phenotypes in frontotemporal dementia associated with MAPT, PGRN and C9orf72 mutations. Amyotrophic Lateral Sclerosis & Frontotemporal Degeneration. 2015;16:497–505. doi: 10.3109/21678421.2015.1074700. [DOI] [PubMed] [Google Scholar]
  51. Sollis E, Mosaku A, Abid A, Buniello A, Cerezo M, Gil L, Groza T, Güneş O, Hall P, Hayhurst J, Ibrahim A, Ji Y, John S, Lewis E, MacArthur JAL, McMahon A, Osumi-Sutherland D, Panoutsopoulou K, Pendlington Z, Ramachandran S, Stefancsik R, Stewart J, Whetzel P, Wilson R, Hindorff L, Cunningham F, Lambert SA, Inouye M, Parkinson H, Harris LW. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Research. 2023;51:D977–D985. doi: 10.1093/nar/gkac1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Song S, Miranda CJ, Braun L, Meyer K, Frakes AE, Ferraiuolo L, Likhite S, Bevan AK, Foust KD, McConnell MJ, Walker CM, Kaspar BK. Major histocompatibility complex class I molecules protect motor neurons from astrocyte-induced toxicity in amyotrophic lateral sclerosis. Nature Medicine. 2016;22:397–403. doi: 10.1038/nm.4052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Spargo T. COLOC-reporter. swh:1:rev:7b9bfd445870ebbec17968431e7b30b1300ce8f6Software Heritage. 2023 https://archive.softwareheritage.org/swh:1:dir:9000b9ab65ea0d2bd883f454e4415e620782b335;origin=https://github.com/ThomasPSpargo/COLOC-reporter;visit=swh:1:snp:63a2788dd34c0d6635460c6aad4a5f7c8f289a2e;anchor=swh:1:rev:7b9bfd445870ebbec17968431e7b30b1300ce8f6
  54. Trowsdale J, Knight JC. Major histocompatibility complex genomics and human disease. Annual Review of Genomics and Human Genetics. 2013;14:301–323. doi: 10.1146/annurev-genom-091212-153455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Trubetskoy V, Pardiñas AF, Qi T, Panagiotaropoulou G, Awasthi S, Bigdeli TB, Bryois J, Chen CY, Dennison CA, Hall LS, Lam M, Watanabe K, Frei O, Ge T, Harwood JC, Koopmans F, Magnusson S, Richards AL, Sidorenko J, Wu Y, Zeng J, Grove J, Kim M, Li Z, Voloudakis G, Zhang W, Adams M, Agartz I, Atkinson EG, Agerbo E, Al Eissa M, Albus M, Alexander M, Alizadeh BZ, Alptekin K, Als TD, Amin F, Arolt V, Arrojo M, Athanasiu L, Azevedo MH, Bacanu SA, Bass NJ, Begemann M, Belliveau RA, Bene J, Benyamin B, Bergen SE, Blasi G, Bobes J, Bonassi S, Braun A, Bressan RA, Bromet EJ, Bruggeman R, Buckley PF, Buckner RL, Bybjerg-Grauholm J, Cahn W, Cairns MJ, Calkins ME, Carr VJ, Castle D, Catts SV, Chambert KD, Chan RCK, Chaumette B, Cheng W, Cheung EFC, Chong SA, Cohen D, Consoli A, Cordeiro Q, Costas J, Curtis C, Davidson M, Davis KL, de Haan L, Degenhardt F, DeLisi LE, Demontis D, Dickerson F, Dikeos D, Dinan T, Djurovic S, Duan J, Ducci G, Dudbridge F, Eriksson JG, Fañanás L, Faraone SV, Fiorentino A, Forstner A, Frank J, Freimer NB, Fromer M, Frustaci A, Gadelha A, Genovese G, Gershon ES, Giannitelli M, Giegling I, Giusti-Rodríguez P, Godard S, Goldstein JI, González Peñas J, González-Pinto A, Gopal S, Gratten J, Green MF, Greenwood TA, Guillin O, Gülöksüz S, Gur RE, Gur RC, Gutiérrez B, Hahn E, Hakonarson H, Haroutunian V, Hartmann AM, Harvey C, Hayward C, Henskens FA, Herms S, Hoffmann P, Howrigan DP, Ikeda M, Iyegbe C, Joa I, Julià A, Kähler AK, Kam-Thong T, Kamatani Y, Karachanak-Yankova S, Kebir O, Keller MC, Kelly BJ, Khrunin A, Kim SW, Klovins J, Kondratiev N, Konte B, Kraft J, Kubo M, Kučinskas V, Kučinskiene ZA, Kusumawardhani A, Kuzelova-Ptackova H, Landi S, Lazzeroni LC, Lee PH, Legge SE, Lehrer DS, Lencer R, Lerer B, Li M, Lieberman J, Light GA, Limborska S, Liu CM, Lönnqvist J, Loughland CM, Lubinski J, Luykx JJ, Lynham A, Macek M, Jr, Mackinnon A, Magnusson PKE, Maher BS, Maier W, Malaspina D, Mallet J, Marder SR, Marsal S, Martin AR, Martorell L, Mattheisen M, McCarley RW, McDonald C, McGrath JJ, Medeiros H, Meier S, Melegh B, Melle I, Mesholam-Gately RI, Metspalu A, Michie PT, Milani L, Milanova V, Mitjans M, Molden E, Molina E, Molto MD, Mondelli V, Moreno C, Morley CP, Muntané G, Murphy KC, Myin-Germeys I, Nenadić I, Nestadt G, Nikitina-Zake L, Noto C, Nuechterlein KH, O’Brien NL, O’Neill FA, Oh SY, Olincy A, Ota VK, Pantelis C, Papadimitriou GN, Parellada M, Paunio T, Pellegrino R, Periyasamy S, Perkins DO, Pfuhlmann B, Pietiläinen O, Pimm J, Porteous D, Powell J, Quattrone D, Quested D, Radant AD, Rampino A, Rapaport MH, Rautanen A, Reichenberg A, Roe C, Roffman JL, Roth J, Rothermundt M, Rutten BPF, Saker-Delye S, Salomaa V, Sanjuan J, Santoro ML, Savitz A, Schall U, Scott RJ, Seidman LJ, Sharp SI, Shi J, Siever LJ, Sigurdsson E, Sim K, Skarabis N, Slominsky P, So HC, Sobell JL, Söderman E, Stain HJ, Steen NE, Steixner-Kumar AA, Stögmann E, Stone WS, Straub RE, Streit F, Strengman E, Stroup TS, Subramaniam M, Sugar CA, Suvisaari J, Svrakic DM, Swerdlow NR, Szatkiewicz JP, Ta TMT, Takahashi A, Terao C, Thibaut F, Toncheva D, Tooney PA, Torretta S, Tosato S, Tura GB, Turetsky BI, Üçok A, Vaaler A, van Amelsvoort T, van Winkel R, Veijola J, Waddington J, Walter H, Waterreus A, Webb BT, Weiser M, Williams NM, Witt SH, Wormley BK, Wu JQ, Xu Z, Yolken R, Zai CC, Zhou W, Zhu F, Zimprich F, Atbaşoğlu EC, Ayub M, Benner C, Bertolino A, Black DW, Bray NJ, Breen G, Buccola NG, Byerley WF, Chen WJ, Cloninger CR, Crespo-Facorro B, Donohoe G, Freedman R, Galletly C, Gandal MJ, Gennarelli M, Hougaard DM, Hwu HG, Jablensky AV, McCarroll SA, Moran JL, Mors O, Mortensen PB, Müller-Myhsok B, Neil AL, Nordentoft M, Pato MT, Petryshen TL, Pirinen M, Pulver AE, Schulze TG, Silverman JM, Smoller JW, Stahl EA, Tsuang DW, Vilella E, Wang SH, Xu S, Adolfsson R, Arango C, Baune BT, Belangero SI, Børglum AD, Braff D, Bramon E, Buxbaum JD, Campion D, Cervilla JA, Cichon S, Collier DA, Corvin A, Curtis D, Forti MD, Domenici E, Ehrenreich H, Escott-Price V, Esko T, Fanous AH, Gareeva A, Gawlik M, Gejman PV, Gill M, Glatt SJ, Golimbet V, Hong KS, Hultman CM, Hyman SE, Iwata N, Jönsson EG, Kahn RS, Kennedy JL, Khusnutdinova E, Kirov G, Knowles JA, Krebs MO, Laurent-Levinson C, Lee J, Lencz T, Levinson DF, Li QS, Liu J, Malhotra AK, Malhotra D, McIntosh A, McQuillin A, Menezes PR, Morgan VA, Morris DW, Mowry BJ, Murray RM, Nimgaonkar V, Nöthen MM, Ophoff RA, Paciga SA, Palotie A, Pato CN, Qin S, Rietschel M, Riley BP, Rivera M, Rujescu D, Saka MC, Sanders AR, Schwab SG, Serretti A, Sham PC, Shi Y, St Clair D, Stefánsson H, Stefansson K, Tsuang MT, van Os J, Vawter MP, Weinberger DR, Werge T, Wildenauer DB, Yu X, Yue W, Holmans PA, Pocklington AJ, Roussos P, Vassos E, Verhage M, Visscher PM, Yang J, Posthuma D, Andreassen OA, Kendler KS, Owen MJ, Wray NR, Daly MJ, Huang H, Neale BM, Sullivan PF, Ripke S, Walters JTR, O’Donovan MC. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature. 2022;604:502–508. doi: 10.1038/s41586-022-04434-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Van Rheenen W, Van der Spek RAA, Bakker MK, Van Vugt JJFA, Hop PJ, Zwamborn RAJ, De Klein N, Westra H-J, Bakker OB, Deelen P, Shireby G, Hannon E, Moisse M, Baird D, Restuadi R, Dolzhenko E, Dekker AM, Gawor K, Westeneng H-J, Tazelaar GHP, Van Eijk KR, Kooyman M, Byrne RP, Doherty M, Heverin M, Al Khleifat A, Iacoangeli A, Shatunov A, Ticozzi N, Cooper-Knock J, Smith BN, Gromicho M, Chandran S, Pal S, Morrison KE, Shaw PJ, Hardy J, Orrell RW, Sendtner M, Meyer T, Başak N, van der Kooi AJ, Ratti A, Fogh I, Gellera C, Lauria G, Corti S, Cereda C, Sproviero D, D’Alfonso S, Sorarù G, Siciliano G, Filosto M, Padovani A, Chiò A, Calvo A, Moglia C, Brunetti M, Canosa A, Grassano M, Beghi E, Pupillo E, Logroscino G, Nefussy B, Osmanovic A, Nordin A, Lerner Y, Zabari M, Gotkine M, Baloh RH, Bell S, Vourc’h P, Corcia P, Couratier P, Millecamps S, Meininger V, Salachas F, Mora Pardina JS, Assialioui A, Rojas-García R, Dion PA, Ross JP, Ludolph AC, Weishaupt JH, Brenner D, Freischmidt A, Bensimon G, Brice A, Durr A, Payan CAM, Saker-Delye S, Wood NW, Topp S, Rademakers R, Tittmann L, Lieb W, Franke A, Ripke S, Braun A, Kraft J, Whiteman DC, Olsen CM, Uitterlinden AG, Hofman A, Rietschel M, Cichon S, Nöthen MM, Amouyel P, SLALOM Consortium. PARALS Consortium. SLAGEN Consortium. SLAP Consortium. Traynor BJ, Singleton AB, Mitne Neto M, Cauchi RJ, Ophoff RA, Wiedau-Pazos M, Lomen-Hoerth C, van Deerlin VM, Grosskreutz J, Roediger A, Gaur N, Jörk A, Barthel T, Theele E, Ilse B, Stubendorff B, Witte OW, Steinbach R, Hübner CA, Graff C, Brylev L, Fominykh V, Demeshonok V, Ataulina A, Rogelj B, Koritnik B, Zidar J, Ravnik-Glavač M, Glavač D, Stević Z, Drory V, Povedano M, Blair IP, Kiernan MC, Benyamin B, Henderson RD, Furlong S, Mathers S, McCombe PA, Needham M, Ngo ST, Nicholson GA, Pamphlett R, Rowe DB, Steyn FJ, Williams KL, Mather KA, Sachdev PS, Henders AK, Wallace L, de Carvalho M, Pinto S, Petri S, Weber M, Rouleau GA, Silani V, Curtis CJ, Breen G, Glass JD, Brown RH, Jr, Landers JE, Shaw CE, Andersen PM, Groen EJN, van Es MA, Pasterkamp RJ, Fan D, Garton FC, McRae AF, Davey Smith G, Gaunt TR, Eberle MA, Mill J, McLaughlin RL, Hardiman O, Kenna KP, Wray NR, Tsai E, Runz H, Franke L, Al-Chalabi A, Van Damme P, Van den Berg LH, Veldink JH. Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology. Nature Genetics. 2021;53:1636–1648. doi: 10.1038/s41588-021-00973-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Wainberg M, Andrews SJ, Tripathy SJ. Shared genetic risk loci between Alzheimer’s disease and related dementias, Parkinson’s disease, and amyotrophic lateral sclerosis. Alzheimer’s Research & Therapy. 2023;15:113. doi: 10.1186/s13195-023-01244-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Wallace C. Eliciting priors and relaxing the single causal variant assumption in colocalisation analyses. PLOS Genetics. 2020;16:e1008720. doi: 10.1371/journal.pgen.1008720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Wallace C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLOS Genetics. 2021;17:e1009440. doi: 10.1371/journal.pgen.1009440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Wang G, Sarkar A, Carbonetto P, Stephens M. A simple new approach to variable selection in regression, with application to genetic fine mapping. Journal of the Royal Statistical Society. Series B, Statistical Methodology. 2020a;82:1273–1300. doi: 10.1111/rssb.12388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Wang ZX, Wan Q, Xing A. HLA in alzheimer’s disease: genetic association and possible pathogenic roles. Neuromolecular Medicine. 2020b;22:464–473. doi: 10.1007/s12017-020-08612-4. [DOI] [PubMed] [Google Scholar]
  62. Watanabe K, Stringer S, Frei O, Umićević Mirkov M, de Leeuw C, Polderman TJC, van der Sluis S, Andreassen OA, Neale BM, Posthuma D. A global overview of pleiotropy and genetic architecture in complex traits. Nature Genetics. 2019;51:1339–1348. doi: 10.1038/s41588-019-0481-0. [DOI] [PubMed] [Google Scholar]
  63. Weintraub D, Mamikonyan E. The neuropsychiatry of parkinson disease: a perfect storm. The American Journal of Geriatric Psychiatry. 2019;27:998–1018. doi: 10.1016/j.jagp.2019.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Werme J, van der Sluis S, Posthuma D, de Leeuw CA. An integrated framework for local genetic correlation analysis. Nature Genetics. 2022;54:274–282. doi: 10.1038/s41588-022-01017-y. [DOI] [PubMed] [Google Scholar]
  65. Yu E, Ambati A, Andersen MS, Krohn L, Estiar MA, Saini P, Senkevich K, Sosero YL, Sreelatha AAK, Ruskey JA, Asayesh F, Spiegelman D, Toft M, Viken MK, Sharma M, Blauwendraat C, Pihlstrøm L, Mignot E, Gan-Or Z. Fine mapping of the HLA locus in Parkinson’s disease in Europeans. NPJ Parkinson’s Disease. 2021;7:84. doi: 10.1038/s41531-021-00231-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Zhang Y, Lu Q, Ye Y, Huang K, Liu W, Wu Y, Zhong X, Li B, Yu Z, Travers BG, Werling DM, Li JJ, Zhao H. SUPERGNOVA: local genetic correlation analysis reveals heterogeneous etiologic sharing of complex traits. Genome Biology. 2021;22:262. doi: 10.1186/s13059-021-02478-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Zhang X, Zou M, Wu Y, Jiang D, Wu T, Zhao Y, Wu D, Cui J, Li G. Regulation of the late onset alzheimer’s disease associated HLA-DQA1/DRB1 Expression. American Journal of Alzheimer’s Disease and Other Dementias. 2022;37:15333175221085066. doi: 10.1177/15333175221085066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Zou Y, Carbonetto P, Wang G, Stephens M. Fine-mapping from summary data with the “Sum of Single Effects” model. PLOS Genetics. 2022;18:e1010299. doi: 10.1371/journal.pgen.1010299. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife assessment

Joris Deelen 1

This paper presents a valuable pipeline based on state-of-the-art analytical software that was used to study genetic pleiotropy between neuropsychiatric disorders. The presented evidence supporting the claims is convincing and now includes an appropriate comparison to previously published methods as well as a detailed exploration of the findings. The created pipeline can thus be used by researchers from diverse fields to study different combinations of diseases and traits.

Reviewer #1 (Public Review):

Anonymous

The authors investigate pleiotropy in the genetic loci previously associated to a range of neuropsychiatric disorders: Alzheimer's disease, amyotrophic lateral sclerosis (ALS), frontotemporal dementia, Parkinson's disease, and schizophrenia. The local statistical fine-mapping and variant colocalisation approaches they use have the potential to uncover not only shared loci but also shared causal variants between these disorders. There is existing literature describing the pleiotropy between ALS and these other disorders but here the authors apply state-of-the-art, local genetic correlation approaches to further refine any relationships.

Complex disease and GWAS is not my area of expertise but the authors managed to present their methods and results in a clear, easy-to-follow manner. Their results statistically support several correlations between the disorders and, for ALS and AD, a shared variant in the vicinity of the lead SNP from the original ALS GWAS. Such findings could have important implications for our understanding of the mechanisms of such disorders and eventually the possibility of managing and treating them.

The authors have built a useful pipeline that plugs together all the gold-standard, existing software to perform this analysis and made it openly available which is commendable. However, there is little discussion of what software is available to perform global and local correlation analysis and, if there are multiple tools available, why they consider the ones they selected to be the gold-standard.

There is some mention of previous findings of genetic pleiotropy between ALS and these other disorders in the introduction, and discussion of their improved ALS-AD evidence relative to previous work. However, detailed comparisons of their other correlations to what was described before for the same pairs of disorders (if any) is missing. Adding this would strengthen the impact of this paper.

Finally, being new to this approach I found the abstract a little confusing. Initially, the shared causal variant between ALS and AD is mentioned but immediately in the following sentence they describe how their study "suggested that disease- implicated variants in these loci often differ between traits". After reading the whole paper I understood that the ALS-AD shared variant was the exception but it may be best to restructure this part of the abstract. Additionally, in the abstract the authors state that different variants "suggests the role of distinct mechanisms across diseases despite shared loci". Is it not possible that different variants in the same regulatory region or protein-coding parts of a gene could be having the same effect and mechanism? Or does the methodology to establish that different variants are involved automatically mean that the variants are too distant for this to be possible?

These concerns were addressed in the revised version of this manuscript.

Reviewer #2 (Public Review):

Anonymous

Summary:

Spargo and colleagues present an analysis of the shared genetic architectures of Schizoprehnia and several late-onset neurological disorders. In contrast to many polygenic traits for which global genetic correlation estimates are substantial, global genetic correlation estimates for neurological conditions are relatively small, likely for several reasons. One is that assortative mating, which will spuriously inflate genetic correlation estimates, is likely to be less salient for late-onset conditions. Another, which the authors explore in the current manuscript, is that some loci affecting two or more conditions (i.e., pleiotropic loci) may have effects in opposite directions, or shared loci are sparse, such that the global genetic correlation signal washes out.

The authors apply a local genetic correlation approach that assesses the presence and direction of pleiotropy in much smaller spatial windows across the genome. Then, within regions evidencing local genetic correlations for a given trait pair, they apply fine-mapping and colocalization methods to attempt to differentiate between two scenarios: that the two traits share the same causal variant in the region or that distinct loci within the region influence the traits. Interestingly, the authors only discover one instance of the former: an SNP in the HLA region appearing to confer risk for both AD and ALS. This is in contrast to six regions with distinct causal loci, and twenty regions with no clear shared loci.

Finally, the authors have published their analysis pipeline such that other researchers might easily apply the same techniques to other collections of traits.

Strengths:

- All such analysis pipelines involve many decision points where there is often no clear correct option. Nonetheless, the authors clearly present their reasoning behind each such decision.

- The authors have published their analytic pipeline such that future researchers might easily replicate and extend their findings.

Weaknesses:

- The majority of regions display no clear candidate causal variants for the traits, whether shared or distinct. Further, despite the potential of local genetic correlation analysis to identify regions with effects in opposing directions, all of the regions for causal variants were identified for both traits evidenced positive correlations. The reasons for this aren't clear and the authors would do well to explore this in greater detail.

- The authors very briefly discuss how their findings differ from previous analyses because of their strict inclusion for "high-quality" variants. This might be the case, but the authors do not attempt to demonstrate this via simulation or otherwise, making it difficult to evaluate their explanation.

These concerns were addressed in the revised version of this manuscript.

eLife. 2024 Dec 17;12:RP88768. doi: 10.7554/eLife.88768.3.sa3

Author response

Thomas P Spargo 1, Lachlan Gilchrist 2, Guy P Hunt 3, Richard JB Dobson 4, Petroula Proitsi 5, Ammar Al-Chalabi 6, Oliver Pain 7, Alfredo Lacoangeli 8

The following is the authors’ response to the original reviews.

Public Reviews:

Reviewer #1 (Public Review):

The authors investigate pleiotropy in the genetic loci previously associated to a range of neuropsychiatric disorders: Alzheimer's disease, amyotrophic lateral sclerosis (ALS), frontotemporal dementia, Parkinson's disease, and schizophrenia. The local statistical fine-mapping and variant colocalisation approaches they use have the potential to uncover not only shared loci but also shared causal variants between these disorders. There is existing literature describing the pleiotropy between ALS and these other disorders but here the authors apply state of the art, local genetic correlation approaches to further refine any relationships.

Complex disease and GWAS is not my area of expertise but the authors managed to present their methods and results in a clear, easy to follow manner. Their results statistically support several correlations between the disorders and, for ALS and AD, a shared variant in the vicinity of the lead SNP from the original ALS GWAS. Such findings could have important implications for our understanding of the mechanisms of such disorders and eventually the possibility of managing and treating them.

The authors have built a useful pipeline that plugs together all the gold-standard, existing software to perform this analysis and made it openly available which is commendable. However, there is little discussion of what software is available to perform global and local correlation analysis and, if there are multiple tools available, why they consider the ones they selected to be the gold-standard.

There is some mention of previous findings of genetic pleiotropy between ALS and these other disorders in the introduction, and discussion of their improved ALS-AD evidence relative to previous work. However, detailed comparisons of their other correlations to what was described before for the same pairs of disorders (if any) is missing. Adding this would strengthen the impact of this paper.

Finally, being new to this approach I found the abstract a little confusing. Initially, the shared causal variant between ALS and AD is mentioned but immediately in the following sentence they describe how their study "suggested that disease- implicated variants in these loci often differ between traits". After reading the whole paper I understood that the ALS-AD shared variant was the exception but it may be best to restructure this part of the abstract. Additionally, in the abstract the authors state that different variants "suggests the role of distinct mechanisms across diseases despite shared loci". Is it not possible that different variants in the same regulatory region or protein-coding parts of a gene could be having the same effect and mechanism? Or does the methodology to establish that different variants are involved automatically mean that the variants are too distant for this to be possible?

We thank reviewer one for their considered review of this manuscript and for highlighting points that would benefit from further exploration. Itemised responses are provided below.

(1) The reviewer noted that we did not adequately explain our choice of software for global and local genetic correlation analysis, and why we consider the techniques chosen as gold standard. We agree that the paper would benefit from clarification around this aspect of the study.

Briefly, we firstly selected LAVA for the local genetic correlation analysis because it offers several advantages above competing software and was developed by a reputable team previously known for developing MAGMA, which is well-established in the statistical genetics field. In the manuscript (page 8), we added the following clarification: “LAVA was the most appropriate local genetic correlation approach for this study for several reasons. First, unlike SUPERGNOVA and rho-HESS, LAVA makes specific accommodations for analysis of binary traits. Second, other tools focus on bivariate correlation between traits whilst LAVA offers this alongside multivariate tests such as multiple regression and partial correlation, enabling rigorous testing of pleiotropic effects. Lastly, LAVA is shown to provide results which are less biased than those from other tools.”

LDSC was selected for the global genetic correlation analysis because the software is well-established and likely the most widely adopted global genetic correlation tool. Reflecting its prevalence, the software is also compatible with LAVA, which adjusts for sample overlap based on the bivariate intercept estimate returned by LDSC. Since global genetic correlations were not the primary focus of this study, having been tested across several previous investigations (see response 2), we did not prioritise comparison of correlation estimates from LDSC against other available software. In the manuscript (pages 7-8) we now include the following statement: “[LDSC] was also applied to derive ‘global’ (i.e., genome-wide) genetic correlation estimates between trait pairs and estimate sample overlap from the bivariate intercept. The latter of these outputs was taken forward as an input for the local genetic correlation analysis using LAVA (see 2.2.2.2). Since global genetic correlation analysis across the traits studied here is not novel and associations reported in past studies are congruent across different tools, the compatibility between LDSC and LAVA motivated our use of LDSC for this analysis”.

(2) The second comment was that the paper would be strengthened by contextualising our study with detail around what is previously known about associations between the studied traits. Accordingly, we have added clarifying text at the end of the introduction, stating: “although previous studies have performed global genetic correlation analyses between various combinations of these traits {references}, this is the first to compare them at a genome-wide scale using a local genetic correlation approach“. In the discussion, we link back to these studies, stating that “Through genetic correlation analysis, we replicated genome-wide correlations previously described between the studied traits {references}”.

(3) The reviewer highlighted that the abstract as originally written may mislead or confuse the reader and we agree that clarity could be improved with some restructuring. This has now been revised and should read more logically.

(4) They also enquired about our reasons for suggesting that the implication of distinct variants for each trait from a colocalisation analysis suggests a distinct causal mechanism. We thank them for this question as it encouraged us to reconsider how best to present the results of this analysis. To answer their question:

It is certainly true that nearby but distinct variants can confer the same effect. In a scenario where multiple distinct variants result in the same effect and thus increase susceptibility towards two or more related phenotypes, you would expect to find evidence of association to each relevant variant in GWAS across these related traits (even if the magnitude of the associations differ). Where biological mechanisms are shared, post-GWAS finemapping analysis would be expected to yield credible sets overlapping across the traits, and likewise, colocalisation analysis should converge on a set of credible SNPs that are candidates for the shared effect. Where multiple distinct variants confer the same effect, you would expect to see separate fine-mapping credible sets for these distinct variants that colocalise pairwise between the jointly-affected traits. Generally, therefore, evidence supporting the two distinct variants hypothesis would suggest the role of two distinct mechanisms except when certain credible sets identified through fine-mapping converge on a colocalised effect.

There is a further caveat which we also explored in response to Reviewer two: if a region includes long-spanning LD (and hence a larger number of variants are considered in the analysis), then the colocalisation analysis is more likely to favour the two distinct variants hypothesis since the probability of the variants implicated in both traits being shared decreases. It is likely that support for the two independent variants hypothesis is correct in most of the comparisons from this study that favour this conclusion. This is because, generally, the fine-mapping credible sets do not overlap across trait pairs (Figure S4) and consequently the colocalisation analysis does not find any support for the shared variant hypothesis. An exception is the analysis of PD and schizophrenia at the MAPT locus on chromosome 17. We have accordingly added the following clarification to the (page 18): “However, the colocalisation analysis will increasingly favour the two independent variants hypothesis as the number of analysed variants increases. Hence, the wide-spanning LD of this region may have obstructed identification of variants and mechanisms shared between the traits.”

Reviewer #2 (Public Review):

Summary:

Spargo and colleagues present an analysis of the shared genetic architectures of Schizoprehnia and several late-onset neurological disorders. In contrast to many polygenic traits for which global genetic correlation estimates are substantial, global genetic correlation estimates for neurological conditions are relatively small, likely for several reasons. One is that assortative mating, which will spuriously inflate genetic correlation estimates, is likely to be less salient for late-onset conditions. Another, which the authors explore in the current manuscript, is that some loci affecting two or more conditions (i.e., pleiotropic loci) may have effects in opposite directions, or shared loci are sparse, such that the global genetic correlation signal washes out.

The authors apply a local genetic correlation approach that assesses the presence and direction of pleiotropy in much smaller spatial windows across the genome. Then, within regions evidencing local genetic correlations for a given trait pair, they apply fine-mapping and colocalization methods to attempt to differentiate between two scenarios: that the two traits share the same causal variant in the region or that distinct loci within the region influence the traits. Interestingly, the authors only discover one instance of the former: an SNP in the HLA region appearing to confer risk for both AD and ALS. This is in contrast to six regions with distinct causal loci, and twenty regions with no clear shared loci.

Finally, the authors have published their analysis pipeline such that other researchers might easily apply the same techniques to other collections of traits.

Strengths:

- All such analysis pipelines involve many decision points where there is often no clear correct option. Nonetheless, the authors clearly present their reasoning behind each such decision.

- The authors have published their analytic pipeline such that future researchers might easily replicate and extend their findings.

Weaknesses:

- The majority of regions display no clear candidate causal variants for the traits, whether shared or distinct. Further, despite the potential of local genetic correlation analysis to identify regions with effects in opposing directions, all of the regions for causal variants were identified for both traits evidenced positive correlations. The reasons for this aren't clear and the authors would do well to explore this in greater detail.

- The authors very briefly discuss how their findings differ from previous analyses because of their strict inclusion for "high-quality" variants. This might be the case, but the authors do not attempt to demonstrate this via simulation or otherwise, making it difficult to evaluate their explanation.

We thank Reviewer two for their appraisal of this manuscript and kind comments regarding its strengths. We will now aim to address the identified weaknesses.

(1) The reviewer comments that we did not adequately investigate why loci with causal variants identified in both traits all had positive local genetic correlations. We agree that it would be helpful to better understand the underlying reasons. To address this issue, we have added a new supplementary figure to compare the positive and negative local genetic correlation results (see Figure S2). In the main-text we add the following clarification. ”Although both positive and negative local genetic correlations passed the FDR-adjusted significance threshold, we observed only positive local genetic correlations in loci where fine-mapping credible sets were identified for both traits in the pair. This reflects that the correlation coefficients and variant associations from the analysed GWAS studies were generally stronger in the positively correlated loci (see Figure S2).”

(2) The reviewer rightly suggests that the manuscript would benefit from an improved explanation of the somewhat inconsistent results for the colocalisation analysis of ALS and AD at the locus around the rs9275477 SNP from this work and a previous study. We have now further investigated this and believe that the discrepancy results partly from an inherent empirical characteristic of the colocalisation analysis. We have explained this in the manuscript (page 22) as follows: “The previous study analysed a 200Kb window of over 2,000 SNPs around the lead genome-wide significant SNP from the ALS GWAS, rs9275477, and found ~0.50 posterior probability for each of the shared and two independent variant(s) hypotheses. The current analysis used 475 SNPs occurring within a semi-independent LD block of ~50kb in this locus. Since the posterior probability of the two independent variants hypothesis (H3) increases exponentially with the number of variants in the region whilst the shared variant hypothesis (H4) scales linearly, it is expected that our analysis would give stronger support for the latter. Given that the previous study defined regions for analysis based on an arbitrary window of ±100kb around each lead genome-wide significant SNP from the ALS GWAS and we defined each analysis region based on patterns of LD in European ancestry populations, it is reasonable to favour the current finding.”

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. Extended results from local genetic correlation analysis and subsequent fine-mapping and colocalisation analyses across loci correlated between trait pairs.

    (a) Results of all bivariate local genetic correlation analyses. (b) Results of colocalisation analyses performed across all single-nucleotide polymorphisms (SNPs) sampled in the region. Number of SNPs refers to the number of SNPs in common between the two traits analysed and present within the 1000 genomes reference panel. Comparisons where univariate fine-mapping identified at least one credible set in each trait were also performed on the basis of these credible sets (see Table 3). Annotations column: * denotes comparisons with genetic correlation analysis p-values below the strict Bonferroni correction threshold; all others passed false discovery rate (FDR) correction. ∆ denotes locus with boundaries extended by ±10 kb compared to the region partition defined in genetic correlation analysis. (c) Overview of credible sets identified across fine-mapping analyses in summary statistics harmonised across trait pairs.

    elife-88768-supp1.xlsx (115.8KB, xlsx)
    MDAR checklist

    Data Availability Statement

    All data used in this study are publicly available, including summary statistics from genome-wide association studies (GWAS) across each of the five analysed traits (Van Rheenen et al., 2021; Kunkle et al., 2019; Ferrari et al., 2014; Nalls et al., 2019; Trubetskoy et al., 2022; Pain et al., 2021) and the 1000 Genomes Phase 3 reference dataset (Auton et al., 2015). Code used to conduct the analyses described in this manuscript are available on GitHub, copy archived at Spargo, 2023.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES