Skip to main content
Nature Portfolio logoLink to Nature Portfolio
letter
. 2023 Sep 6;621(7977):E7–E26. doi: 10.1038/s41586-023-06355-3

A second update on mapping the human genetic architecture of COVID-19

The COVID-19 Host Genetics Initiative1
PMCID: PMC10482689  PMID: 37674002

arising from: COVID-19 Host Genetics Initiative. Nature 10.1038/s41586-021-03767-x (2021)

Investigating the role of host genetic factors in COVID-19 severity and susceptibility can inform our understanding of the underlying biological mechanisms that influence adverse outcomes and drug development1,2. Here we present a second updated genome-wide association study (GWAS) on COVID-19 severity and infection susceptibility to SARS-CoV-2 from the COVID-19 Host Genetic Initiative (data release 7). We performed a meta-analysis of up to 219,692 cases and over 3 million controls, identifying 51 distinct genome-wide significant loci—adding 28 loci from the previous data release2. The increased number of candidate genes at the identified loci helped to map three major biological pathways that are involved in susceptibility and severity: viral entry, airway defence in mucus and type I interferon.

We conducted a meta-analysis for 3 phenotypes across 82 studies from 35 countries, including 36 studies of individuals with non-European ancestry (Fig. 1, Supplementary Figs. 1 and 2 and Supplementary Table 1): critical illness (respiratory support or death; 21,194 cases), hospitalization (49,033 cases) and SARS-CoV-2 infection (219,692 cases). Most of the studies were collected before the widespread introduction of COVID-19 vaccination. We found 30, 40 and 21 loci that are associated with critical illness, hospitalization and infection due to SARS-CoV-2, respectively, for a total of 51 distinct genome-wide significant loci across all three phenotypes (P < 5 × 10−8; Fig. 2, Supplementary Fig. 3 and Supplementary Table 2), adding 28 genome-wide significant loci to the 23 previously identified by the COVID-19 Host Genomics Initiative (HGI; data release 6)1,2. We observed a median increase of 2.9-fold in statistical power across lead variants owing to a median increase of 1.6-fold in effective sample sizes from the previous release (Supplementary Table 3). After correcting for the number of phenotypes examined, 46 loci remained significant (P < 1.67 × 10−8). Of the 28 additional loci, 6 loci were originally reported by the GenOMICC study3, which also contributed to the current meta-analysis, and 9 other loci were identified by the new GenOMICC meta-analysis4 during the preparation of this paper. We found nine more loci that reached genome-wide significance, but we excluded them as they were probably false positives, as determined using a leave-most-significant-biobank-out analysis (Supplementary Table 4 and Supplementary Note). Comparing the effect sizes and statistical significance between the previous2 and current analysis indicated that all of the previously identified loci were replicated and showed an increase in statistical significance (Supplementary Fig. 4). Using our previously developed two-class Bayesian model for classifying loci as being more likely involved in infection susceptibility or severity2, we determined that 36 loci are substantially more likely (higher than 99% posterior probability) to affect disease severity (hospitalization) and 9 loci are substantially more likely to influence susceptibility to SARS-CoV-2 infection, while the remaining 6 loci could not be classified (Supplementary Fig. 5, Supplementary Table 5 and Supplementary Note). We observed that the 1q22 locus (lead variant: rs12752585:G>A) showed significant effect-size heterogeneity across ancestries (Phet < 9.80 × 10−4 = 0.05/51), whereas the previously reported heterogenous locus (FOXP4) remained at the same level of significance as before2, despite an increase in sample size (Phet = 2.01 × 10−3; Supplementary Fig. 6 and Supplementary Table 6). We found significant observed-scale single-nucleotide polymorphism heritabilities of all the three phenotypes (1.2–8.2%, P < 0.0001). We also estimated liability-scale heritabilities for a range of population prevalences (Supplementary Fig. 7, Supplementary Table 7 and Supplementary Note).

Fig. 1. Overview of the contributing studies in the HGI data release 7.

Fig. 1

a, Geographical overview of the studies contributing to the COVID-19 HGI and the composition by major ancestry groups. Populations are defined as Middle Eastern (MID), South Asian (SAS), East Asian (EAS), African (AFR), admixed American (AMR), European (EUR) and other (OTH). b, A principal component analysis (PCA) highlights the population structure and the sample ancestry of the individuals participating in the COVID-19 HGI. Per-cohort PCA results are provided in Supplementary Fig. 2. This figure was reproduced from the original publication by the COVID-19 HGI1 with modifications reflecting the updated analysis from data release 7.

Fig. 2. GWAS results for COVID-19.

Fig. 2

a, The results of a GWAS analysis of hospitalized individuals with COVID-19 (n = 49,033 cases and n = 3,393,109 controls) (top), and the results for individuals with reported SARS-CoV-2 infection (n = 219,692 cases and n = 3,001,905 controls) (bottom). The loci highlighted in yellow (top) represent regions that are associated with severity of COVID-19. The loci highlighted in green (bottom) are regions associated with susceptibility to SARS-CoV-2 infection. Lead variants for the loci that were identified in this data release are annotated with their respective rsID. The y axis is on the −log10 (P) scale up to 10, after which it switches to a 10 × log10[−log10(P)] scale to aid presentation. b, Results of gene prioritization using different evidence measures of gene annotation. For the genes in a linkage-disequilibrium (LD) region, genes with coding variants and eGenes (fine-mapped cis-expression quantitative trait locus (cis-eQTL) variant with posterior inclusion probability (PIP) >0.1 in GTEx Lung) are annotated as such if they are in linkage disequilibrium with a COVID-19 lead variant (r2 > 0.6). V2G, the highest gene prioritized by OpenTargetGenetics V2G score. The pink circle indicates SARS-CoV-2 infection susceptibility, the green triangle indicates COVID-19 severity and the blue cross indicates unclassified. This figure was reproduced from the original publication by the COVID-19 HGI1 with modifications reflecting the updated analysis from data release 7.

To better understand the biological mechanisms underlying COVID-19 susceptibility and severity, we further characterized candidate causal genes by mapping them onto biological pathways and performing a phenome-wide association analysis (Extended Data Fig. 1, Supplementary Fig. 8 and Supplementary Tables 2, 8 and 9). In total, 15 out of 51 loci could be linked to three major pathways involved in susceptibility and severity defined by expert-driven classification (Supplementary Note): (1) viral entry; (2) entry defence in airway mucus; and (3) type I interferon response. Moreover, the phenome-wide association analysis identified nine loci involved in the upkeep of healthy lung tissue.

Extended Data Fig. 1. Major COVID-19 biological pathways mapped with susceptibility and severity GWAS loci.

Extended Data Fig. 1

Genome-wide significant variants associated with COVID-19 (white boxes) and the annotated genes (peach boxes) are mapped on to pathways known to be involved in (a) viral entry and innate immunity, (b) entry defence in airway mucus, and (c) type I interferon. The suggested phenotypic impact of the significant variants using the Bayesian approach2 are denoted with shapes; COVID-19 susceptibility (pink circles), COVID-19 disease severity (green triangles), and unclassified variants (blue cross). Other genes known to be involved in the aforementioned pathways are shown using grey boxes. Detailed list of references to studies used to design this pathway can be found in Supplementary Note.

First, five loci include candidate causal genes involved in the viral entry pathway (Extended Data Fig. 1a), such as previously reported SLC6A20 (3p21.31), ABO (9q34.2), SFTPD (10q22.3) and ACE2 (Xp22.2), as well as TMPRSS2 (21q22.3), which was also identified in the data release 7. We found that the lead variant rs9305744:G>A, an intronic variant of TMPRSS2, is protective against critical illness (odds ratio (OR) = 0.92, 95% CI = 0.89–0.95, P = 1.4 × 10−8) and is in linkage disequilibrium with the missense variant rs12329760:C>T (p.Val197Met; r2 = 0.68). SARS-CoV-2 uses the serine protease TMPRSS2 for viral spike protein priming, as well as the previously reported ACE2 for host cell entry which functionally interacts with SLC6A20 (refs. 5,6). Notably, the previously reported association between ABO blood groups and susceptibility could be attributed to the interference of anti-A and anti-B antibodies with the spike protein, potentially interfering with viral entry7. Furthermore, the previously reported SFTPD encodes pulmonary surfactant protein D (SP-D), which contributes dually to the lung’s innate immune molecules and viral-entry response in pulmonary epithelia8,9 along with other genes for airway defence.

Second, four loci contain candidate causal genes for entry defence in the airway mucus (Extended Data Fig. 1b), such as previously reported MUC1/THBS3 (1q22) and MUC5B (11p15.5) as well as novel MUC4 (3q29) and MUC16 (19p13.2). We found that the novel lead variants rs2260685:T>C in MUC4 (intronic variant; in linkage disequilibrium (r2 = 0.65) with a missense variant rs2259292:C>T (p.Gly4324Asp)) and rs73005873:G>A in MUC16 (intronic variant) increase the risk of SARS-CoV-2 infection (OR = 1.03 and 1.03, 95% CI = 1.02–1.04 and 1.02–1.04, P = 4.1 × 108 and 6.4 × 10−10, respectively). Moreover, the previously reported locus 1q22 contains an intergenic lead variant rs12752585:G>A that decreases the risk of infection (OR = 0.98, 95% CI = 0.97–0.98, P = 1.5 × 1011) and increases MUC1 expression in the oesophagus mucosa in GTEx v8 (P = 5.2 × 10−9). Notably, the 1q22 locus also contains an independent lead variant, rs35154152:T>C, a missense variant (p.Ser279Gly) of THBS3, that decreases the risk of hospitalization (OR = 0.88, 95% CI = 0.86–0.90, P = 5.6 × 1022) but not infection (P = 5.7 × 104), suggesting potential distinct mechanisms in the locus. Consistent with these association patterns, MUC1, MUC4 and MUC16 are three known major transmembrane mucins of the respiratory tracts that prevent microbial invasion, whereas previously reported MUC5B, together with nearby MUC5AC, are primary structural components of airways mucus that enable mucociliary clearance of pathogens10.

Third, six loci contain candidate causal genes that are linked to the type I interferon pathway (Extended Data Fig. 1c), such as previously reported IFNAR2 (21q22.11), OAS1 (12q24.13) and TYK2 (19p13.2), as well as additionally identified JAK1 (1p31.3), IRF1 (5q31.1) and IFNα-coding genes (9p21.3). Previous studies have reported additional genes in this pathway: TLR7 (refs. 11,12) and DOCK2 (ref. 13). Here we found that the lead variant rs28368148:C>G, a missense variant (p.Trp164Cys) of IFNA10 located within the IFNα gene cluster, increases the risk of critical illness (OR = 1.56, 95% CI = 1.38–1.77, P = 3.7 × 10−12). IFNα is one of the type I interferons that binds specifically to the IFNα receptor consisting of IFNAR1–IFNAR2 chains, in which mutations are also known to increase the risk of hospitalization and critical illness. In the genes that enable signalling downstream of IFNAR, we identified that the lead variant rs11208552:G>T, an intronic variant of JAK1, is protective against critical illness and hospitalization (OR = 0.92 and 0.95, 95% CI = 0.89–0.94 and 0.93–0.96, P = 5.5 × 10−10 and 2.2 × 109, respectively). This variant was previously reported to decrease lymphocyte counts14 (β = −0.016, P = 5.5 × 10−15) and increase the JAK1 expression in the thyroid in GTEx15 (P = 6.1 × 10−23). JAK1 and previously reported TYK2 are Janus kinases (JAKs) that are required for type I interferon-induced JAK–STAT signalling. JAK inhibitors are used to treat patients with severe COVID-19 (ref. 16). Furthermore, downstream of JAK–STAT signalling, we found that the lead variant rs10066378:T>C, located 67 kb upstream of IRF1, increases the risk of critical illness and hospitalization (OR = 1.09 and 1.07, 95% CI = 1.06–1.13 and 1.05–1.09, P = 2.7 × 10−9 and 3.74 × 10−10, respectively).

Furthermore, the phenome-wide association analysis identified nine loci previously associated with lung function and respiratory diseases. These loci contain genes involved in the upkeep of healthy lung tissue such as previously reported FOXP4 (6p21.1), SFTPD (10q22.3), MUC5B (11p15.5) and DPP9 (19p13.3), as well as additionally identified CIB4 (2p23.3), NPNT (4q24), ZKSCAN1 (7q22.1), ATP11A (13q34) and PSMD3 (17q21.1). For example, we found that three lead variants, rs1662979:G>T (intronic variant of CIB4), rs34712979:G>A (splice region variant of NPNT) and rs2897075:C>T (intronic variant of ZKSCAN1), are significantly associated with hospitalization (OR = 1.05, 0.94 and 1.05, 95% CI = 1.03–1.07, 0.92–0.96 and 1.03–1.07, P = 5.6 × 10−9, 3.8 × 10−8 and 8.9 × 10−9, respectively) and lung function (FEV1/FVC)17, similar to the previously reported lead variant rs3934643:G>A (intronic variant of SFTPD). Notably, whereas the alleles associated with increased risk of COVID-19 severity of rs1662979 and rs3934643 decrease lung function (β = −0.013 and −0.025, P = 5.3 × 10−8 and 6.3 × 10−10), those of rs34712979 and rs2897075 increase lung function (β = 0.068 and 0.023, P = 4.2 × 10−134 and 1.6 × 10−20, respectively). Likewise, we found lead variants that were significantly associated with hospitalization and idiopathic pulmonary fibrosis18,19, such as the aforementioned rs2897075 and rs12585036:C>T (intronic variant of ATP11A; OR = 1.10, 95% CI = 1.08–1.12, P = 3.2 × 10−21), in addition to the previously reported rs35705950:G>T (promoter variant of MUC5B). Whereas the COVID-19 severity risk-increasing alleles of rs2897075 and rs12585036 increase the risk of idiopathic pulmonary fibrosis (OR = 1.12 and 1.27, P = 3.0 × 10−14 and 7.0 × 10−9, respectively), those of rs35705950 decreases the risk (OR = 0.50, P = 3.9 × 10−80). These results highlight the complex pleiotropic relationships between COVID-19 severity, lung function and respiratory diseases.

We used genetic correlations and Mendelian randomization analyses to identify potential causal effects of risk factors on COVID-19 phenotypes (Supplementary Fig. 9 and Supplementary Tables 10 and 11). In total, 14 novel genetic correlations and 10 novel robust exposure-COVID-19 trait pairs showed evidence of causal associations (Supplementary Note). In particular, smoking initiation and the number of cigarettes per day were positively correlated with severity and susceptibility phenotypes; Mendelian randomization indicated that smoking was causally associated with increased risk of COVID-19, further highlighting the role of the healthy lung tissue in COVID-19 severity. Moreover, genetically instrumented higher glomerular filtration rate (indicative of better kidney function) was associated with a lower risk of COVID-19 critical illness, whereas genetically predicted chronic kidney disease was associated with an increased risk of COVID-19 critical illness, suggesting that better kidney function would be beneficial for a lower risk of COVID-19 severity.

In summary, we have substantially expanded the current knowledge of host genetics for COVID-19 susceptibility and severity by further doubling the case numbers from the previous data release2 and identifying 28 additional loci. The increased number of loci enables us to map genes to pathways that are involved in viral entry, airway defence and immune system response. Notably, we observed severity loci mapped to type I interferon pathway, while susceptibility loci mapped to viral entry and airway defence pathways, with notable exceptions for severity-classified TMPRSS2 and MUC5B loci. Further investigation of how such susceptibility and severity loci map to different pathways would provide mechanistic insights into the human genetic architecture of COVID-19.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Online content

Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-023-06355-3.

Supplementary information

Supplementary Information (5.9MB, pdf)

Supplementary Methods, Supplementary References, Supplementary Figs. 1–7 and 9, and the legend for Supplementary Fig. 8.

Reporting Summary (73.4KB, pdf)
Supplementary Fig. 8 (36MB, pdf)

Supplementary Fig. 8.

Supplementary Tables (1.1MB, xlsx)

Supplementary Tables 1–12.

Acknowledgements

We acknowledge M. O’Reilly and B. Cooley from the Pattern team at the Broad Institute of MIT and Harvard for designing the geographical map in Fig. 1. A full list of the members of the Genes & Health Research Team is available online (https://www.genesandhealth.org/research/scientific-publications-authorship-and-acknowledgments). A full list of acknowledgements is provided in Supplementary Table 12.

Extended data figures and tables

Author contributions

Detailed author contributions are integrated in the authorship list.

Data availability

Summary statistics generated by the COVID-19 HGI are available online, including per-ancestry summary statistics for African, admixed American, East Asian, European and South Asian ancestries (https://www.covid19hg.org/results/r7/). The analyses described here used the data release 7. If available, individual-level data can be requested directly from contributing studies, listed in Supplementary Table 1. We used publicly available data from GTEx (https://gtexportal.org/home/), the Neale laboratory (http://www.nealelab.is/uk-biobank/), the Finucane laboratory (https://www.finucanelab.org), the FinnGen Freeze 4 cohort (https://www.finngen.fi/en/access_results) and the eQTL catalogue release 3 (http://www.ebi.ac.uk/eqtl/).

Code availability

The code for summary statistics lift-over, the projection PCA pipeline including precomputed loadings and meta-analyses (https://github.com/covid19-hg/); for heritability estimation (https://github.com/AndrewsLabUCSF/COVID19_heritability); for Mendelian randomization and genetic correlation (https://github.com/marcoralab/MRcovid); and subtype analyses (https://github.com/mjpirinen/covid19-hgi_subtypes) are available at GitHub.

Competing interests

A full list of competing interests is provided in Supplementary Table 12.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Unaffiliated: Cindy Boer, Venceslas Douillard, Marie-Julie Fave, Rocío Gallego Durán, Yael Michaeli, Doaa Shaheen, Ilangumaran Shivanathan

A list of authors and their affiliations appears at the end of the paper

Contributor Information

The COVID-19 Host Genetics Initiative:

Masahiro Kanai, Shea J. Andrews, Mattia Cordioli, Christine Stevens, Benjamin M. Neale, Mark Daly, Andrea Ganna, Karolina Chwialkowska, Amy Trankiem, Mary K. Balaconis, and Matthew Solomonson

Extended data

is available for this paper at 10.1038/s41586-023-06355-3.

Supplementary information

The online version contains supplementary material available at 10.1038/s41586-023-06355-3.

References

  • 1.COVID-19 Host Genetics Initiative. Mapping the human genetic architecture of COVID-19. Nature. 2021;600:472–477. doi: 10.1038/s41586-021-03767-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.COVID-19 Host Genetics Initiative. A first update on mapping the human genetic architecture of COVID-19. Nature. 2022;608:E1–E10. doi: 10.1038/s41586-022-04826-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kousathanas A, et al. Whole-genome sequencing reveals host factors underlying critical COVID-19. Nature. 2022;607:97–103. doi: 10.1038/s41586-022-04576-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Pairo-Castineira, E. et al. GWAS and meta-analysis identifies 49 genetic variants underlying critical COVID-19. Nature617, 764–768 (2023). [DOI] [PMC free article] [PubMed]
  • 5.Hoffmann M, et al. SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell. 2020;181:271–280. doi: 10.1016/j.cell.2020.02.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Vuille-dit-Bille RN, et al. Human intestine luminal ACE2 and amino acid transporter expression increased by ACE-inhibitors. Amino Acids. 2015;47:693–705. doi: 10.1007/s00726-014-1889-6. [DOI] [PubMed] [Google Scholar]
  • 7.Tamayo-Velasco Á, et al. ABO blood system and COVID-19 susceptibility: anti-A and anti-B antibodies are the key points. Front. Med. 2022;9:882477. doi: 10.3389/fmed.2022.882477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sorensen GL. Surfactant protein D in respiratory and non-respiratory diseases. Front. Med. 2018;5:18. doi: 10.3389/fmed.2018.00018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Watson A, Madsen J, Clark HW. SP-A and SP-D: dual functioning immune molecules with antiviral and immunomodulatory properties. Front. Immunol. 2020;11:622598. doi: 10.3389/fimmu.2020.622598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Chatterjee M, van Putten JPM, Strijbis K. Defensive properties of mucin glycoproteins during respiratory infections-relevance for SARS-CoV-2. mBio. 2020;11:e02374-20. doi: 10.1128/mBio.02374-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.van der Made CI, et al. Presence of genetic variants among young men with severe COVID-19. JAMA. 2020;324:663. doi: 10.1001/jama.2020.13719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Butler-Laporte G, et al. Exome-wide association study to identify rare variants influencing COVID-19 outcomes: results from the Host Genetics Initiative. PLoS Genet. 2022;18:e1010367. doi: 10.1371/journal.pgen.1010367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Namkoong H, et al. DOCK2 is involved in the host genetics and biology of severe COVID-19. Nature. 2022;609:754–760. doi: 10.1038/s41586-022-05163-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Chen M-H, et al. Trans-ethnic and ancestry-specific blood-cell genetics in 746,667 individuals from 5 global populations. Cell. 2020;182:1198–1213. doi: 10.1016/j.cell.2020.06.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.The GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.RECOVERY Collaborative Group. Baricitinib in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial and updated meta-analysis. Lancet. 2022;400:359–368. doi: 10.1016/S0140-6736(22)01109-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Shrine N, et al. New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries. Nat. Genet. 2019;51:481–493. doi: 10.1038/s41588-018-0321-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Fingerlin TE, et al. Genome-wide association study identifies multiple susceptibility loci for pulmonary fibrosis. Nat. Genet. 2013;45:613–620. doi: 10.1038/ng.2609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Allen RJ, et al. Genome-wide association study of susceptibility to idiopathic pulmonary fibrosis. Am. J. Respir. Crit. Care Med. 2020;201:564–574. doi: 10.1164/rccm.201905-1017OC. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (5.9MB, pdf)

Supplementary Methods, Supplementary References, Supplementary Figs. 1–7 and 9, and the legend for Supplementary Fig. 8.

Reporting Summary (73.4KB, pdf)
Supplementary Fig. 8 (36MB, pdf)

Supplementary Fig. 8.

Supplementary Tables (1.1MB, xlsx)

Supplementary Tables 1–12.

Data Availability Statement

Summary statistics generated by the COVID-19 HGI are available online, including per-ancestry summary statistics for African, admixed American, East Asian, European and South Asian ancestries (https://www.covid19hg.org/results/r7/). The analyses described here used the data release 7. If available, individual-level data can be requested directly from contributing studies, listed in Supplementary Table 1. We used publicly available data from GTEx (https://gtexportal.org/home/), the Neale laboratory (http://www.nealelab.is/uk-biobank/), the Finucane laboratory (https://www.finucanelab.org), the FinnGen Freeze 4 cohort (https://www.finngen.fi/en/access_results) and the eQTL catalogue release 3 (http://www.ebi.ac.uk/eqtl/).

The code for summary statistics lift-over, the projection PCA pipeline including precomputed loadings and meta-analyses (https://github.com/covid19-hg/); for heritability estimation (https://github.com/AndrewsLabUCSF/COVID19_heritability); for Mendelian randomization and genetic correlation (https://github.com/marcoralab/MRcovid); and subtype analyses (https://github.com/mjpirinen/covid19-hgi_subtypes) are available at GitHub.


Articles from Nature are provided here courtesy of Nature Publishing Group

RESOURCES