Skip to main content
Springer logoLink to Springer
. 2021 Dec 10;141(1):147–173. doi: 10.1007/s00439-021-02397-7

Common, low-frequency, rare, and ultra-rare coding variants contribute to COVID-19 severity

Chiara Fallerini 1,2,#, Nicola Picchiotti 3,4,#, Margherita Baldassarri 1,2,#, Kristina Zguro 1, Sergio Daga 1,2, Francesca Fava 1,2,5, Elisa Benetti 1, Sara Amitrano 5, Mirella Bruttini 1,2,5, Maria Palmieri 1,2, Susanna Croci 1,2, Mirjam Lista 1,2, Giada Beligni 1,2, Floriana Valentino 1,2, Ilaria Meloni 1,2, Marco Tanfoni 3, Francesca Minnai 6, Francesca Colombo 6, Enrico Cabri 7, Maddalena Fratelli 7, Chiara Gabbi 8, Stefania Mantovani 9, Elisa Frullanti 1,2, Marco Gori 3,10, Francis P Crawley 11, Guillaume Butler-Laporte 12,13, Brent Richards 12,14,15, Hugo Zeberg 16, Miklos Lipcsey 17,18, Michael Hultström 17,19, Kerstin U Ludwig 20, Eva C Schulte 21,22,23, Erola Pairo-Castineira 24,25, John Kenneth Baillie 24,25,26, Axel Schmidt 20, Robert Frithiof 17; WES/WGS Working Group Within the HGI; GenOMICC Consortium; GEN-COVID Multicenter Study, Francesca Mari 1,2,5,#, Alessandra Renieri 1,2,5,27,✉,#, Simone Furini 1,#
PMCID: PMC8661833  PMID: 34889978

Abstract

The combined impact of common and rare exonic variants in COVID-19 host genetics is currently insufficiently understood. Here, common and rare variants from whole-exome sequencing data of about 4000 SARS-CoV-2-positive individuals were used to define an interpretable machine-learning model for predicting COVID-19 severity. First, variants were converted into separate sets of Boolean features, depending on the absence or the presence of variants in each gene. An ensemble of LASSO logistic regression models was used to identify the most informative Boolean features with respect to the genetic bases of severity. The Boolean features selected by these logistic models were combined into an Integrated PolyGenic Score that offers a synthetic and interpretable index for describing the contribution of host genetics in COVID-19 severity, as demonstrated through testing in several independent cohorts. Selected features belong to ultra-rare, rare, low-frequency, and common variants, including those in linkage disequilibrium with known GWAS loci. Noteworthily, around one quarter of the selected genes are sex-specific. Pathway analysis of the selected genes associated with COVID-19 severity reflected the multi-organ nature of the disease. The proposed model might provide useful information for developing diagnostics and therapeutics, while also being able to guide bedside disease management.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00439-021-02397-7.

Introduction

For almost 2 years, COVID-19 has demonstrated itself to be a disease having a broad spectrum of clinical presentations: from asymptomatic patients to those with severe symptoms leading to death or persistent disease (“long COVID”) (Livingston and Bucher 2020; Chen et al. 2019; Zhang et al. 2020a). While developing vaccination programmes and other preventive measures to significantly dampen infection transmission and reduce disease expression, a much deeper and more precise understanding of the interplay between SARS-CoV-2 and host genetics is required to support the development of treatments for new virus variants as they arise. Furthermore, advances in modelling the interplay between SARS-CoV-2 and host genetics hold significant promise for addressing other complex diseases. In this study, we demonstrate the value of genetic modelling with its direct translatability into drug development and clinical care in the context of a severe public health crisis.

The identification of host genetic factors modifying disease susceptibility and/or disease severity has the potential to reveal the biological basis of disease susceptibility and outcome as well as to subsequently contribute to treatment amelioration (Elhabyan et al. 2020). From a scientific point of view, COVID-19 represents a particularly interesting and accessible complex disorder for modeling host genetic data because the environmental factor (SARS-CoV-2) can be readily identified by a PCR-based swab test. The still moderate viral genome variability has thus far been shown to have relatively low impact on disease severity (Islam et al. 2020) where currently age, sex, and comorbidities are the major factors predicting disease susceptibility and outcome (Li et al. 2021). While these factors certainly have significant value for prediction, they provide limited insights into disease pathophysiology and are of limited relevance for drug development.

Common variants in the human genome affecting the susceptibility to SARS-CoV-2 infections and COVID-19 severity have been successfully identified by Genome-Wide Association Studies (GWASs) (Severe Covid-19 GWAS Group (2020); Pairo-Castineira et al. 2020; COVID-19 Host Genetics Initiative et al. 2021). However, these variants only explain a small fraction of trait variability and, as it is well documented, GWASs are difficult to interpret because they often associate non-coding variants with phenotype; therefore, the relevant genes need to be pinpointed by deeper follow-up analyses. In contrast, next-generation sequencing-based studies have identified variants in a few genes-related to innate immunity which can solely underlie rare severe forms of COVID-19 (Zhang et al. 2020b; Van der Made et al. 2020; Fallerini et al. 2021a; Solanich et al. 2021). In these rare affected families, the predictivity is high as the susceptibility for severe COVID-19 follows Mendelian inheritance patterns. However, these patients represent only a small proportion of those severely affected by COVID-19. Taken together, genetic findings can currently only explain a limited proportion of COVID-19 susceptibility and severity, in spite of the relatively high predicted heritability of COVID-19 and COVID-19 symptoms (Williams et al. 2020). A better and more holistic understanding of host genetics could support the development of more specific, or even targeted drugs and treatment interventions leading to less morbidity and mortality.

The Italian GEN-COVID Multicenter Study collected more than 2000 biospecimens and clinical data from SARS-CoV-2-positive individuals (Daga et al. 2021), and whole-exome sequencing (WES) analysis contributed to the identification of rare variants (Fallerini et al. 2021a) and common polymorphisms (Baldassarri et al. 2021a; Croci et al. 2021; Fallerini et al. 2021b) associated with COVID-19 severity. In 2020, we started to investigate how common variants may combine with rare variants to determine COVID-19 severity in WES data using a first small cohort of hospitalized patients. This pilot analysis revealed that the combination of rare and common variants could potentially impact clinical outcome (Benetti et al. 2020a). We then proposed a new post-Mendelian model for a genetic characterization of the disorder (Picchiotti et al. 2021) based on an adapted Polygenic Risk Score (PRS) (Mars et al. 2020), called Integrated PolyGenic Score (IPGS). This allowed us to reach a more precise disease severity prediction than that based on sex and age alone. In this article, we substantially improve this post-Mendelian model to include ultra-rare and low-frequency variants while also demonstrating that IPGS significantly contributes to predictivity in combination with—as well as alongside—age and sex, and is able to extract patient-specific genes. The IPGS predictivity was also sustained in three independent European cohorts of the WES/Whole-Genome Sequencing study working group within the COVID-19 Host Genetics Initiative (COVID-19 Host Genetics Initiative 2021).

Materials and methods

Contributing cohorts

Five different cohorts (from Germany, Italy, Quebec, Sweden, UK) contributed to this study as described in Supplementary Table 1. For multi ancestry cohorts (Quebec and UK), the subpopulation of European Ancestry was included in the study. Institutional Review Board approval was obtained for each study (Supplementary Table 1).

Phenotype definitions

The training of the model proposed for predicting the severity of COVID-19 requires as inputs the exome variants, age, sex, and COVID-19 severity assessed using a modified version of the WHO COVID-19 Outcome Scale (COVID-19 Therapeutic Trial Synopsis 2020) as coded into the following six classifications: (1) death; (2) hospitalized receiving invasive mechanical ventilation; (3) hospitalized, receiving continuous positive airway pressure (CPAP) or bilevel positive airway pressure (BiPAP) ventilation; (4) hospitalized, receiving low-flow supplemental oxygen; (5) hospitalized, not receiving supplemental oxygen; and (6) not hospitalized. The aim of the model is to predict a binary classification of patients into mild and severe cases, where a patient is considered severe if hospitalized and receiving any form of respiratory support (WHO severity-grading equal to 4 or higher in 8 points classification). The next section describes how the annotation of exome variants and the selection of patients were performed in the GEN-COVID cohort. Following this, the training and testing of the model are described.

Massive parallel sequencing

GEN-COVID cohort

Whole-exome sequencing with at least 97% coverage at 20x was performed using the Illumina NovaSeq6000 System (Illumina, San Diego, CA, USA). Library preparation was performed using the Illumina Exome Panel (Illumina) according to the manufacturer's protocol. Library enrichment was tested by qPCR, and the size distribution and concentration were determined using Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA). The Novaseq6000 System (Illumina) was used for DNA sequencing through 150 bp paired-end reads. Variant calling was performed according to the GATK4 (O’Connor and Auwera 2020) best practice guidelines, using BWA (Li and Durbin 2010) for mapping and ANNOVAR (Wang et al. 2010) for annotating.

Swedish cohort

Whole-exome sequencing was performed using the Twist Bioscience exome capture probe and was sequenced on the Illumina NovaSeq6000 platform. Data were then analyzed using the McGill Genome Center bioinformatics pipeline (https://doi.org/10.1093/gigascience/giz037) in accordance with GATK best practices.

DeCOI Germany

800–1000 ng of genomic DNA of each individual was fragmented to an average length of 350 bp. Library preparation was performed using the TruSeq DNA PCR-free kit (Illumina, San Diego, CA, USA) according to the manufacturer’s protocol. Whole-genome sequences were obtained as 150 bp paired-end reads on S4 flow cells using the NovaSeq6000 system (Illumina). The intended average sequencing depth was 30X. The DRAGEN pipeline (Illumina, version 3.6.3 or 3.5.7) was used for alignment and joint variant calling was performed with the Glnexus software (version 1.3.2). Individuals with a 20-fold coverage in less than 96% of the protein-coding sequence were removed as well as related individuals to retain only from related pairs. Variant QC was performed using hail (version 0.2.58). European individuals were selected by performing PCA analysis along with the 1000 genomes data. Finally, annotation was performed using Variant Effect Predictor (VEP, version 101). In the present study, only individuals from the BoSCO and CoMRI substudies were included.

BQC-19

Whole-genome sequencing at mean coverage of 30x was performed on the Illumina NovaSeq6000 platform, then analyzed using the McGill Genome Center bioinformatics pipeline (https://doi.org/10.1093/gigascience/giz037), in accordance with GATK best practice guidelines.

GenOMICC/ISARIC4C

Whole-genome sequencing at mean coverage of 20x was performed on the Illumina NovaSeq6000 platform and then analyzed using the DRAGEN pipeline (software v01.011.269.3.2.22 , hardware v01.011.269). Variants were genotyped with the GATK GenotypeGVCFs tool v4.1.8.1.

PC analysis

The genetic ancestry of the patients was estimated using a random forest classifier. SNPs of autosomes with MAF above 10% and in linkage disequilibrium were extracted from the 1000 genome project using BCFTOOLS (Danecek et al. 2021), and intersected with variants from the GEN-COVID cohort. The resulting set of variants was used to compute 6 Principal Components by PLINK (Purcell et al. 2007) using samples from the 1000 genome projects. GEN-COVID samples were projected along the same Principal Components. The Random Forest classifier, as implemented in the R library randomForest (Liaw and Wiener 2002), was trained using samples from 1000 genomes with known ancestry, and then used to predict ancestry for the GEN-COVID cohort. To avoid bias in the analysis due to the different ethnicity, only patients of genetic European ancestry were retained for further analyses.

Definition of the Boolean features

Variants were converted into 12 sets of Boolean features to better represent the variability at the gene-level. First, any variant not impacting on the protein sequence was discarded. Then, the remaining variants were classified according to their minor allele frequency (MAF) as reported in gnomAD for the reference population as: ultra-rare, MAF < 0.1%; rare, 0.1% ≤ MAF < 1%; low-frequency, 1% ≤ MAF < 5%; and common, MAF ≥ 5%. Non-Finnish European (NFE) was used as a reference population. SNPs with MAF not available in gnomAD were treated as ultra-rare. INDELs with frequency not available in gnomAD were treated as ultra-rare when present only once in the cohort and otherwise discarded as possible artefacts of sequencing. For the ultra-rare variants, 3 alternative Boolean representations were defined, which are designed to capture the autosomal dominant (AD), autosomal recessive (AR), and X-linked (XL) model of inheritance, respectively. The AD and AR representations included a feature for all the genes on autosomes. These features were equal to 1 when the corresponding gene presented at least 1 for the AD model, or 2 for the AR model, variants in the ultra-rare frequency range and 0 otherwise. The XL representation included only genes belonging to the X chromosome. These features were equal to 1 when the corresponding gene presented at least 1 variant in the ultra-rare frequency range and 0 otherwise. The same approach was used to define AD, AR, and XL Boolean features for the rare and low-frequency variants. Common variants were represented using a different approach that is designed to better capture the presence of alternative haplotypes. For each gene, all the possible combinations of common variants were computed. For instance, in the case of a gene belonging to an autosome with 2 common variants (named A and B), 3 combinations are possible (A, B, and AB), and (consequently) 3 Boolean features were defined both for the AD and AR model. In the AR model each of these 3 features was equal to 1 if all the variants in that particular combination were present in the homozygous state and 0 otherwise. The same rule was used for the AD model, but setting the feature to 1 even if the variants in that particular combination are in the heterozygous state. In both models, AD and AR, a further feature was defined for each gene to represent the absence of any of the previously defined combinations. In the AD model, this feature was equal to 1 if no common variant is present and 0 otherwise; in the AR model, it is equal to 1 if no common variant is present in the homozygous state and 0 otherwise. The same approach was used to define the set of Boolean features for common variants in genes belonging to the X chromosome. The full list of Boolean representations is reported in Supplementary Table 2.

Connectome analysis

To measure the relatedness of the genes in our list with known genes related to viral susceptibility and disease severity, we used the human gene connectome (HGC—http://hgc.rockefeller.edu/). To define the lists of core genes we used three different steps. To probe the initial processes of virus entry and intracellular replication, we used genes coding for the virus–human proteins interactome found by Gordon et al. (2020) and we added a small set of genes known to be important for virus entry (https://viralzone.expasy.org/9077). To extend the study to the whole host response to the viral infection, we used the PanelApp https://panelapp.genomicsengland.co.uk). Finally, we used the union of GWAS loci (Severe Covid-19 GWAS Group (2020; Pairo-Castineira et al. 2020; COVID-19 Host Genetics Initiative et al. (2021)) and the genes in eQTL with them (by GTEX analysis), the genes already demonstrated to be associated with severity in functional studies and the list deduced from integrative genomic analyses by Pathak et al. (2021).

Model training

The dataset was divided into a training set and a testing set (90/10), and the entire procedure described in this section was performed using only samples in the training set. A bootstrap approach with 100 iterations was adopted to train the model. At each bootstrap iteration, 90% of the samples were selected (without replication), and the following two steps were performed: (step 1) selection of the most relevant features for each Boolean representations; and (step 2) definition of the weights of the Integrated Polygenic Score (IPGS). After the 100 bootstrap iterations, the information extracted on relevant features and weighting factors are merged to define the final IPGS (step 3). The IPGS is then used, together with age and sex, for training a model that predicts the COVID-19 severity (step 4). These four steps of the training procedure are described in detail in the next subsections. Since the model is based on a combination of rare and common variants, the training procedure should be performed using a dataset with homogeneous ancestry.

Step 1: Features’ selection

The subsets of the most relevant features were identified using logistic regression models with least absolute shrinkage and selection operator (LASSO) regularization. Separate logistic models were trained for each of the 12 sets of Boolean features. The predicted outcome variable for each of these models was a re-classified phenotype adjusted by age and sex. To compute these re-classified phenotypes as adjusted by age and sex, the patients were first divided into males and females. Then, for each sex, an ordinal logistic regression model was fitted using the age to predict the WHO phenotype classification into six grades. The ordinal logistic regression model was chosen as: it imposes a simple monotone relation between input feature and target variable; and it provides easily interpretable thresholds between the predicted classes. The patients with a predicted grading equal to the actual grading were excluded. The remaining patients were divided into two classes depending on whether their actual phenotype was milder or more severe than the one expected for a patient of that age and sex. This procedure has the benefit of isolating patients whose genetic factors are most important for predicting COVID-19 severity. This binary trait, i.e. phenotype more/less severe than expected, was used as the outcome variable for the 12 LASSO logistic models based on the 12 separate Boolean representations. For each LASSO model, the regularization strength was optimized by tenfold cross-validation with 50 equally spaced values in the logarithmic scale in the range [10− 2, 101]. The optimal regularization strength was selected as the one with the best trade-off between the simplicity of the model and the cross-validation score, i.e. as the highest regularization strength providing an average score closer to the highest average score than 0.5 standard deviations. Once the regularization strength was defined, the LASSO model was re-trained using all the samples in that particular bootstrap iteration. The features with non-null coefficients are the ones selected for the next step. In summary, for each bootstrap iteration, this procedure returns 12 lists of features (one for each Boolean representation) that are expected to be the most important features for predicting the phenotype adjusted by age and sex (in that particular bootstrap iteration).

Step 2: Weights of the Integrated Polygenic Score (IPGS)

In the previous step, the Boolean representations are considered isolated from each other. The aim of the IPGS is to combine information from different representations (Eq. 1). To reach this goal, it is necessary to compute the relative weights of the different contributions. For each bootstrap iteration, the list of relevant features extracted as described in the previous section are used to compute the number of features that are associated with mildness or severity for the different frequency ranges. For instance, n corresponds to the number of features associated with r the mild phenotype coming from Boolean features computed for variants in the frequency range [0.1%, 1%]. A feature is considered associated with the mild phenotype when its coefficient in the LASSO model estimated in step 1 is negative, i.e. it contributes to the prediction of the phenotype adjusted by age and sex in the direction of a phenotype less severe than what expected at that particular age and sex. The same rule, applied to the corresponding Boolean representation, is used to define the other feature-counts appearing in Eq. (1). The weighting factors in Eq. (1) were estimated as the ones that maximize the Silhouette coefficient of the separation between the clusters of patients more/less severe than expected. The minimization was performed with weighting factors restricted to the following ranges: FLF ∈ [1, 4], FR ∈ [2, 8], and FUR ∈ [5, 100].

This procedure returns three optimal values for the weighing factors associated with each bootstrap iteration.

Step 3: IPGS definition

In this step, the data extracted at each bootstrap iteration in steps 1 and 2 are combined to define the IPGS. First, for each of the Boolean features, of all the 12 representations, the number of times this feature was selected in the 100 bootstrap iterations is computed. Then, the entire bootstrap procedure is repeated using random input phenotypes, and the 5th percentile of the number of times that a feature is associated with a random phenotype is estimated. This threshold, computed separately for each Boolean representation, was used to select which Boolean features are included in the final model. As no significant association is expected among the Boolean features and the random phenotype, the threshold of the 5th percentile is expected to exclude with a 95% level of confidence the possible false positive associations. For the GEN-COVID cohort, the features selected correspond to ~ 4.4% of the initial number of Boolean features. The weighing factors in Eq. (1) were computed as the median values of the estimates obtained in the 100 bootstrap iterations.

Step 4: Training of the predictive model based on age, sex, and IPGS

The procedure described in the previous sections completely defines how to calculate the IPGS. The predictive model of the binary COVID-19 severity (hospitalized patients with any form of respiratory support versus all other patients) was defined as a logistic model that uses as input features IPGS, age, and sex. It should be noted that in steps 1-3, only patients that deviates from their expected severity based on age and sex were used. The procedure was designed to isolate the genetic basis of COVID-19 severity. Instead, in this final step, IPGS, age and sex are combined to predict the actual COVID-19 severity, i.e. hospitalized patients with any form of respiratory support. To prevent overfitting, the model was fitted using 466 samples different from the training set adopted in steps 1–3. During the fitting procedure, the class unbalancing is tackled by penalizing the misclassification of the minority class with a multiplicative factor inversely proportional to the class frequencies. The percentile normalization of the IPGS scores is performed within each cohort. An alternative logistic model that used as input features only age and sex was also fitted on the same training set. The comparison between the two models is intended to evaluate if the genetic information summarized in the IPGS improves the prediction of severity compared to a model based on age and sex alone. A further logistic regression model is fitted by only considering the IPGS variable.

Model testing

The training procedure returned 2 logistic models to be compared: one using as input features only age and sex, and the other one using as input features age, sex, and the IPGS. These models were tested, without any further adjustment, using other cohorts of European ancestry. The performances of the two models, with and without IPGS, were evaluated and compared in terms of accuracy, precision, sensitivity, and specificity. The increases of the performances are evaluated with respect to the performances of a model where the values of the IPGS feature have been shuffled, by computing the p value on the empirical null distribution. In addition, the empirical probability density function of IPGS has been estimated for the severe and non-severe patients of the cohort including both train and test sets and a t test is carried out to evaluate whether the means of the two distributions were significantly different. As a further evaluation of the importance of the IPGS score on the severity prediction, univariate logistic regression models using as independent variables age (continuous represented in decades), sex, and IPGS were fitted to the dataset that combines both the training set and the testing sets for a total of 2240 patients. These models were used to estimate the odds ratios and the p values of the association with the severe phenotype. Furthermore, a multivariable logistic regression was fitted using IPGS, age, and sex together. Finally, a multivariable logistic regression was performed using as predictor variables: IPGS, age, sex and comorbidities (congestive/ischemic heart failure; asthma/COPD/OSAS; diabetes; hypertension; cancer). This latter model has been fitted in the training set, where the information on comorbidities was available.

Pathway analysis

Pathway analysis was made using a ranked GSEA approach (Subramanian et al. 2005; Mootha et al. 2003), modified according to the specificity of our data. The metrics for gene ranking was calculated on the basis of the results of feature selection models, weighting in each Boolean feature both beta values and the number of bootstrap iterations where it was found significantly associated with severity/mildness (Supplementary Tables 3–6). All the Boolean features that were found significant in at least one of the models were included in the list. As the sign of beta depends on which allele is taken as reference (which is relative for common variants), we decided to use absolute beta values for all the features. To also weight the importance according to variant frequency, we used the F values from the IPGS score for the four categories (ultra-rare 5, rare 4, low frequency 2, common 1). Finally, we summed all the weights of each Boolean feature by gene. Briefly,

WgeneA=Features Gene AABSmeanβ×count×F.

Pathway enrichment analysis was made using the GSEA-preranked module (v. 7.2.4) of the Genepattern platform (Wang et al. 2010), on several pathway categories (BIOCARTA, KEGG, REACTOME, GOBP, HALLMARKS, C7 and C8), limiting the size of gene sets to the 10-300 range and performing 10,000 permutations. The networks showing similarity of significant pathways were built using the EnrichmentMap algorithm (Subramanian et al. 2005) in the Cytoscape suite (v. 3.8.2) (Merico et al. 2011; Shannon et al. 2003). Parameters used for network creations are: Jaccard Overlap Combined Index (k constant = 0.5), edge cutoff 0.05.

Website and data distribution

The coordination of international partners has been possible through the Host Genetics Initiative (HGI) (https://www.covid19hg.org/projects/).

Results can be shared through the Gen-Covid website (https://sites.google.com/dbm.unisi.it/gen-covid).

Code availability

Data analyses were performed using Python with the Scipy ecosystem (Virtanen et al. 2020), and the scikit-learn library (Pedregosa et al. 2011). Statistical association was done with the statsmodel Python library. The code is freely available at the github repository: https://github.com/gen-covid/pmm.

Results

The post-Mendelian paradigm for COVID-19 modelization for combining interpretability with predictivity based on ultra-rare, rare, low-frequency, and common variants

The aim of the present study was to develop an easily interpretable model that could be used to predict the severity of COVID-19 from host genetic data. Patients were considered severe when hospitalized and receiving any form of respiratory support. The focus on this target variable is motivated by the practical importance of rapidly identifying which patients are more likely to require oxygen support, in an effort to prevent further complications. Interpretability has been a guiding principle in the definition of the machine-learning model, as only a readily interpretable model can provide useful and reliable information for clinical practice while also contributing significantly to diagnostic, and therapeutic targeting. The high dimensionality of host genetic data poses a serious challenge to evident and reliable interpretability. So far, the development of a robust predictive model able to make a direct association between single variants and disease severity grading based on an accurate analysis of the vast number of host genetic variants compared to a much smaller number of individual patients has proven to be too complex and ultimately unreliable. To address the complexity with predictive reliability, an enriched gene-level representation of host genetic data was modeled in a machine-learning framework. The complexity of COVID-19 immediately suggests that both common and rare variants are expected to contribute to the likelihood of developing a severe form of the disease. However, the contribution of common and rare variants to the severe phenotype is not expected to be the same. A single rare variant that impairs the protein function might cause a severe phenotype by itself after viral infection, while this is not so probable for a common polymorphism, which is likely to have a less marked effect on protein functionality. These observations led to the definition of a score, named IPGS, that includes data regarding the variants at different frequencies:

IPGS=nCs-nCm+FLF·nLFs-nLFm+R·nRs-nRm+FUR·nURs-nURm. 1

In Eq. (1), n variables are used to indicate the number of input features of the predictive model that promote the severe outcome (superscript s) or that protect from a severe outcome (superscript m) and with genetic variants having Minor Allele Frequency (MAF) ≥ 5% (common, subscript C), 1% < MAF ≤ 5% (low-frequency, subscript LF), 0.1% < MAF ≤1% (rare, subscript R), and MAF < 0.1% (ultra-rare, subscript UR). The features promoting or preventing severity were identified by an ensemble of logistic models, as described in the next section. The weighting factors FLF, R, and FUR were included to model the different penetrant effects of low-frequency, rare, and ultra-rare variants, compared to common variants. Thus, the 4 terms of Eq. (1) can be interpreted as the contributions of common, low-frequency, rare, and ultra-rare variants to a score that represents the genetic propensity of a patient to develop a severe form of COVID-19.

Feature selection and gene discovery

The definition of the single terms of the IPGS formula requires 4 separate steps (Fig. 1): (1) the definition of a severity phenotype adjusted by age and sex; (2) the conversion of genetic variants into Boolean features that represent the presence of variants in different frequency ranges in each gene; (3) the selection of those features that are associated with disease severity; and (4) the optimization of the weighting factors appearing in Equation 1. These 4 steps were executed using data from a training set extracted from the GEN-COVID dataset (90% of the patients, n=1780, see Methods). The phenotype adjusted by age and sex was computed using an ordered logistic regression model, with the purpose of facilitating the extraction of features associated with the genetic basis of COVID-19 severity (Fig. 1B). The conversion of genetic variants into Boolean features led to the definition of 12 separate sets of input features (Supplementary Table 2). The set of input features “ultra-rare_autosomal dominant” (UR_AD) is designed to represent in a binary way an autosomal dominant hereditary model associated with variants with MAF lower than 0.1%, i.e. these Boolean features are equal to 1 for genes presenting at least one variant in this frequency range. Similarly, the set of input features “ultra-rare_autosomal recessive” (UR_AR) and “ultra-rare_X-linked” (UR_X) were designed to describe the autosomal recessive and X-linked models of inheritance of ultra-rare variants. Analogous principles were used for rare and low-frequency variants. In the case of common variants, the same 3 sets of Boolean features representing the autosomal dominant, autosomal recessive, and X-linked models of inheritance were used. However, instead of simply defining the binary variables as “absence/presence of variants”, the absence/presence of variant combinations was tested (Fig. 1C).

Fig. 1.

Fig. 1

Feature selection and gene discovery. A Whole-exome sequencing (WES) data stored in the Genetic Data Repository of the GEN-COVID Multicenter Study (GCGDR) and coming from biospecimens of 1780 SARS-CoV-2 PCR-positive subjects of European ancestry of different severity were used as the training set. B Clinical severity classification into severe and mild cases was performed by Ordered Logistic Regression (OLR) starting from the WHO grading and patient age classifications. C WES data were binarized into 0 or 1 depending on the absence (0) or the presence (1) of variants (or the combination of two or more variants only for common polymorphisms) in each gene. D LASSO logistic regression feature selection methodology on multiple train-test splits of the cohort leads to the identification of the final set of features contributing to the clinical variability of COVID-19 (E). From the initial 163,099 cumulative features (divided into 36,540 ultra-rare, 23,470 rare, 13,056 low frequency and 90,033 common features) in 12 Boolean representations, the selected features contributing to COVID-19 clinical variability are 7249 and they are reported in the Supplementary Tables 3–6. The total number of genes contributing to COVID-19 clinical variability was 4260 in males and 4360 in females, 75% of which were in common

The Boolean representation of the genetic variability described in the previous paragraph significantly reduces the dimensionality of the problem. However, the number of input features is still orders of magnitude higher than the number of patients that can be reasonably collected for training the model. To reduce the number of input features, a feature selection strategy based on logistic models with the validated Least Absolute Shrinkage and Selection Operator (LASSO) regularization was employed (Fig. 1D). The aim of LASSO regularization is to minimize (shrink) the number of coefficients of the model, consequently minimizing the number of input features used for predicting outcomes. Separate logistic models with LASSO regularization were trained for the 12 sets of Boolean features for predicting COVID-19 severity, allowing us to identify the relevant features for each set. About 4% of the cumulative tested features were found to contribute to COVID-19 variability in severity (Fig. 1D). Twenty-six percent identity match between extracted genes and known viral susceptibility genes was found (Supplementary Table 12). We further investigate extracted genes using the Human Gene Connectome (HGC). Interestingly, of the 4943 genes of our model that are mapped in HGC, 4401 (89%) are biologically significantly connected (p < 0.05) and 2847 (57.6%) with a degree of 0 (overlap) or 1 (direct connection) with one of the genes of the three Core Lists (Supplementary Table 13).

Biological interpretability of extracted genetic features

Selected genes contributed by ultra-rare, rare, low-frequency variants, or/and common variants (Fig. 2A–D and Supplementary Tables 3a–g). Specifically, 54% contributed by only one, 29% by two, 11% by three, and 6% by four types of variants. Around 25% of the genes were sex-specific. The latter group includes either genes located on the X chromosome, such as TLR7 and TLR8 in males, or genes regulated in opposite directions by androgens and estrogens when contributing with less penetrant common variants, such as p.L412F in TLR3 and p.D603N in SELP gene (Fig. 2A–D).

Fig. 2.

Fig. 2

Biological impact of ultra-rare, rare, low-frequency, and common features. Examples of ultra-rare (A), rare (B), low-frequency (C), and common (D) features are illustrated in panel A–D. The complete list of features is presented in Supplementary Tables 3–6. Inline graphic = contributing to COVID-19 severity; Inline graphic = contributing to COVID-19 mildness. Pink faces = contributing to females only; blue faces = contributing to males only; pink/blue faces = contribution in both sexes. In parentheses: AD = autosomal dominant inheritance; AR = autosomal recessive inheritance; XL = X-linked recessive inheritance. A Ultra-rare mutations in the RNA sensor TLR7, TLR3, and TICAM1 (encoding TRIF protein), already reported associated with XL, AR and AD inheritance (Zhang et al. 2020a; Van der Made et al. 2020; Fallerini et al. 2021a; Solanich et al. 2021) impair interferon (IFNs) production in innate immune system cells. Mutations in TLR8, as well as of the signal transducer IRAK1 also impair interferon production. The specific location of TLR7/8 and IRAK1 (on the X chromosome) as well as X-inactivation escaping are responsible for opposite effects in males and females. Mutation in RNASEL impair the antiviral effect of the gene. In lung epithelial cells, ACE2 ultra-rare variants (on the X chromosome) exert protective effects (probably) due to lowering virus entrance, while ultra-rare variants in ADAM17 (might) reduce the shedding of ACE2 and induce a severe outcome. The same is true for CFTR and SCNN1A (encoding ENaCA protein and involved in a CFTR-related physiological pathway), and the lipid transporter ABCA3 (Baldassarri et al. 2021b).Mutations of ADAMTS13 in vessels reduce the cleavage of the multimeric von Willebrand Factor (VWF), leading to thrombosis; B) Rare variants of the estrogen regulated TLR5 are associated with severity in females. Rare variants of the CFTR-related SLC26A9 are associated with severity in both sexes. This ion transporter has three discrete physiological modes: nCl(–)-HCO(3)(–) exchanger, Cl(–) channel, and Na(+)-anion cotransporter. Other examples of rare mutations associated with severity are the NK and T cell receptor FCRL6, IFN signal transducer IRAK2, and the actin depolymerization MICAL2; C low-frequency variants in another CFTR-related gene, SCNN1D (encoding for ENaCD protein) are associated with mildness, while rare variants in the following genes are associated with severity: cargo protein SPMA6, vesicle formation PEX1, inflammatory protein NOD2 (CARD15); D A number of coding polymorphisms, indicated with an asterisk, are in LD with genomic SNPs already associated with COVID-19 (The complete list is presented in Supplementary Table 11) (Severe Covid-19 GWAS Group (2020; Pairo-Castineira et al. 2020). In some cases, such as the case of SFTDP, the genomic SNP is the coding polymorphism itself. Of note are the genes of surfactant proteins associated with severe disease: SFTDP gene encoding for SP-D protein and SFTPA1 gene encoding for SP-A protein; the signal transducer, PPP1R15A gene encoding for GADD34 protein. OAS1 and OAS3 related to RNA clearance of RNASEL (reported in panel A as having ultra-rare mutations; included here should also be the already reported TLR3412 (Croci et al. 2021); the already reported SELP603 related to thrombosis (Fallerini et al. 2021a). Note: OAS1 haplotype A = c.1039-1G>A (Wickenhagen et al. 2021), (p.(Gly162Ser)), (p.(Ala352Thr)), (p.(Arg361Thr)), (p.(Gly397Arg)), (p.(Thr358Profs*26)). OAS1 haplotype B = haplotype without the variant combination in haplotype A

Among the extracted ultra-rare variants there was a group of genes, such as TLR3, TLR7 and TICAM1, already shown to be directly involved in the Mendelian-like forms of COVID-19 (Fig. 2A and Supplementary Table 3a, b). Furthermore, another group of genes are natural candidates because of their function: these include the ACE2 shedding protein ADAM17, CFTR-related genes, genes involved in glycolipid metabolism, genes expressed by cells of the innate immune system, and genes involved in the coagulation pathway. Finally, a group of genes led by ACE2 (if affected by ultra-rare variants) confers protection from the severe disease. This group includes several genes whose mutations are responsible for auto-inflammatory disorders.

Among the rare variants extracted, we identified some genes as candidates for COVID-19 severity, including TLR5 and SLC26A9 as well as other genes involved in the inflammatory response (Fig. 2B and Supplementary Table 4a, b).

Among the low-frequency variants extracted, we identified some genes associated with either severity or protection from severe COVID-19 that are linked to the CFTR pathway (e.g., PSMA6) as well as specific genes involved in the immune response (e.g., NOD2) (Fig. 2C and Supplementary Table 5a, b).

The model was also able to identify a group of extracted common variants already shown to be linked to either severe or mild COVID-19 (Fig. 2D and Supplementary Table 6a, b). Among them are the L412F TLR3 and D603N SELP polymorphisms, already reported to be associated with the severe disease (Croci et al. 2021; Fallerini et al. 2021b) and several coding polymorphisms in Linkage Disequilibrium (LD) with already reported genomic SNP, such as the ABO blood group, OAS1-3 genes, PPP1R15A gene and others (Elhabyan et al. 2020). In conclusion, considering their functions, genes involved in the immune and inflammatory responses, or those involved in the coagulation pathway and NK and T cell receptor, are to be considered natural candidates for severe or mild COVID-19.

Integrated PolyGenic Score definition

The Boolean features selected by the LASSO logistic models were used to calculate ten variables in Eq. (1) (Fig. 3A, B). The corresponding weights (F variables) were defined by optimizing the separation between severe and mild cases as offered by the IPGS formula. The optimization was measured using the Silhouette coefficient, and the optimal values were computed using a grid-search approach over a predefined grid (FLF ∈ [1, 4], FR ∈ [2, 8], and FUR ∈ [5, 100]).

Fig. 3.

Fig. 3

Integrated PolyGenic Score Definition. A The model is based on the comparison of Boolean features of severity versus Boolean features of mildness. B Graphic representation of the IPGS formula used for this model. C Principle for the calibration of different weighting factors based on the separation of severe and mild cases. D The obtained value for low-frequency, rare, and ultra-rare, being F = 1 for common variants. Common variants are indicated as common haplotypes since they are intended as combinations of coding variants within a single gene (see Fig. 1C and the Material and methods section)

This optimization returned values of 2, 4, and 5 for the low-frequency, rare, and ultra-rare variants, respectively (Fig. 3C, D).

Pathway analysis

To understand the biological mechanisms underlying the variability of disease, we performed a pathways analysis of the genes carrying variants discovered in the feature selection described above. The features obtained with this approach do not have the same predicted impact and are not discovered with the same confidence. Therefore, we decided to perform a rank-based pathway analysis, with genes with the highest impact and confidence ranking highest in the list, rather than a simple over-representation approach. We ranked all the genes that were found to be significantly associated with severity/mildness in at least one bootstrap repetition, based on a score that takes into account three parameters: average coefficient in the LASSO models selecting the feature, number of significant bootstrap results, and the F correction factor for the frequency category used in the IPGS (detailed in the Methods section below). For genes with more than one significant Boolean feature, we summed up the scores of each feature. Gene Set Enrichment Analysis (GSEA) was then performed using two separate ranked gene lists (Supplementary Table 7) for females and males, followed by the generation of similarity networks using EnrichmentMap (Fig. 4A). The usage of rank-based search method allows to identify statistically significant pathways starting from extensive list genes, as each gene is associated with its specific importance. Although no pathways satisfied the 0.25% FDR threshold normally required for standard GSEA analyses, the set of pathways considered significant using more relaxed thresholds on p values were shown to group in meaningful modules, providing useful information on pathogenetic mechanisms and on the genes that could explain how they can be affected. The network of all the pathways significantly enriched in both females and males ranked gene lists (p < 0.01, n = 25) is depicted in Fig. 4B, while the network of all the pathways enriched in either females or males with a more stringent p value (p < 0.005, n = 100) is shown in Fig. 4C. Detailed information on the names of the pathways and p values of enrichment is reported in Supplementary Figures 1 and 3. For the most representative pathways of each network, the heatmaps of the genes with their weights of association to disease variability are shown in Fig. 4D and Supplementary Figures 2 and 4, while gene lists and gene weights for all the significant pathways are reported in Supplementary Table 8.

Fig. 4.

Fig. 4

Pathway enrichment analysis of the genes associated with disease severity/mildness. A Workflow of the analysis. Genes corresponding to Boolean features found to be associated at least once were ranked based on a composite score and subjected to Gene Set Enrichment Analysis. Two separate ranked gene lists for females (7317 genes, weight range 3 × 10–5-561) and males (7325 genes, weight range 7 × 10–5-452) were used. The list of significant pathways was analysed and presented as a similarity network: B Similarity network of the pathways with a significant enrichment both in females and males (p < 0.01). The size of the circles is proportional to the pathway size. Significance above threshold is indicated by the red color. C Similarity network of the pathways with a significant enrichment either in females (red left half of the circles) or males (red right half of the circles) (p < 0.005). D Heatmaps of the genes belonging to a selection of pathways of interest. The color gradient represents the weight of each gene, calculated and described in methods. Please note high ranking of TLR genes (TLR5, TLR8, TLR3 and TLR7) in the pathway of Response to Mechanical Stimulus, CFTR gene in Recognition for Clathrin-mediated endocytosis, RNASEL, TYK2, OAS1 and OAS3 genes in Interferon alpha–beta signaling. Note also the presence of the relevant pathway of Exhaust vs Memory CD8 T cell Up that also includes TLR7 gene

COVID-19 post-Mendelian model predictivity

The functional interpretation of the variants identified by the feature selection approach, complemented by the strong link between the involved human biological pathways and COVID-19 pathogenicity, support the hypothesis that the IPGS score developed here may contribute significantly to predicting the severity of COVID-19. This hypothesis was tested using a logistic regression model that predicts COVID-19 severity based on age, sex and the IPGS (after percentile normalization). The training set is composed of 466 patients not included in the training set previously exploited for the IPGS feature engineering. The model’s performance was then tested using three independent cohort sets of European ancestry (Fig. 5A). The model exhibited an overall accuracy of 0.73, precision equal to 0.78, with a sensitivity and specificity of 0.72 and 0.75, respectively. Noteworthy, all the aforementioned metrics are higher than the corresponding values obtained using a logistic model that adopted as input features only age and sex. The increase in performances of the model with IPGS suggests that this score indeed confers significant additional (genetic) information for predicting COVID-19 severity compared to only age and sex. The increase of the performances is statistically significant (p value < 0.05 for accuracy, precision, sensitivity, specificity) with respect to the distribution of performances for an ensemble of models where the IPGS feature has been randomized (Fig. 5C lower left). A third logistic regression model fitted with IPGS alone, shows performances well above the random guess. Furthermore, the empirical probability density function of IPGS scores (Fig. 5C right) has been estimated for the severe and non-severe patients of the cohort including both training and testing sets. It is worth noting the shift on the right of the IPGS distribution for the severe patients, with significant p value (< 0.001) for the t test of mean difference. This difference between severe and non-severe cases is preserved for the male and female cohorts when analyzed separately (p values < 0.001 and 0.024, respectively).

Fig. 5.

Fig. 5

Model predictivity. A The post-Mendelian model was trained using a sample of 466 patients from the GEN-COVID cohort n.2 and Swedish cohort (having cases only) and tested with three additional European cohorts from UK, Germany and Canada. B A logistic regression model was used for severity prediction. Severity was defined mainly on the basis of hospitalization versus not hospitalization. Hospitalized cases without respiratory support were included in controls. TN = true negative; TP =  true positive; FN =  false negative; FP =  false positive. C When the IPGS is added to age and gender as a regressor, the performances of the model increase: accuracy + 1%, precision + 1%, sensitivity + 2%, specificity + 1%. These increases are statistically significant (p value < 0.05 for accuracy, precision, sensitivity and specificity) with respect to the null distribution obtained by randomizing the IPGS. The performances of the model built with IPGS alone are all above the random guess. In addition, on the right, we reported the distributions of the IPGS for severe and non-severe patients. D In the three tested cohorts, when the IPGS is added to age and sex as a regressor, all the performances increase: the accuracy up to + 2%, the precision up to + 1%, the sensitivity up to + 3%, and the specificity up to + 2%. We conclude that IPGS is able to improve prediction of clinical outcome in addition to the well-established powerful factors of age and sex. E The univariate logistic regression models fitted on the cohort including both train and test, confirmed that the IPGS is associated with severity with an odds-ratio (OR) of 2.32, while age (continuous in decades) and sex have an OR of 1.89 and 2.99, respectively

In line with the results obtained using the overall test set, the model including IPGS, age, and sex performed better than the model considering only sex and age as inputs, in each of the testing cohorts, separately (Fig. 5D). The increase in performance was systematically observed throughout all the cohorts: on average + 1.33% for accuracy, + 1% for precision, + 1.33% for sensitivity, + 1.67% for specificity. Considering the difference in phenotype classification inherent to a comparison among various international cohorts, and the genetic variability among different European sub-populations, the consistent increase in performances observed for the model with IPGS demonstrates that this score provides a robust index for predicting COVID-19 severity. As a further test for the importance of the IPGS score for predicting COVID-19 severity, the univariate logistic models were used on the overall set including both train and test cohorts to estimate the OR of severe COVID-19 for IPGS, age, and sex, separately. The test confirmed that severity was associated with IPGS, showing an OR of 2.32 (p < 0.001, 95% confidence interval [1.79, 3.01]) with age, measured in decades, and sex, having OR of 1.89 (p < 0.001, 95% confidence interval [1.79, 2.00]) and 2.99 (p < 0.001, 95% confidence interval [2.58, 3.46]) respectively (Fig. 5E). The multivariate logistic regression using sex, age, and IPGS together, provided similar results reported in Supplementary Table 9 confirming the goodness of the regressors’ OR. When adjusting for comorbidities, in the train cohort where the comorbidities were available, with a multivariable logistic model, OR of IPGS was 2.46 (p = 0.05, 95% confidence interval [1.15, 5.25]) as shown in Supplementary Table 10. This result further confirms that IPGS is a reliable predictor of COVID-19 clinical severity.

Advantages of IPGS and clinical interpretability of connected features

We then wanted to compare the clinical outcome with the probability of severity obtained from three different models: IPGS alone, sex-age alone or combined model (represented as heatmap in Fig. 6). It appears evident that in a subset of patients, the 2 models based on sex-age alone and IPGS alone have a discordant prediction (left and right end of dendrogram in Fig. 6A). In these cases, IPGS appears to be a relevant predictor of severity (Fig. 6A). This is in accordance with the above-presented logistic regression analysis (Fig. 5E) that shows IPGS having an OR of 2.32 for severity. Moreover, the list of features on which the IPGS score is built, represent a biological handle for pathophysiological mechanisms and possible personalized adjuvant treatments.

Fig. 6.

Fig. 6

Clinically interpretability of IPGS. Panel A shows the GEN-COVID cohort dendrogram and heatmaps of the probabilities of severity based on the 3 different models: sex-age alone, IPGS alone and combined model. In the extreme ends of dendrogram (left and right) the probability of severity based on sex-age alone and IPGS alone is highly discordant (different colors). Selected examples corresponding to the arrows are illustrated in panels B-G. In each panel IPGS score, probabilities of severity and key features useful for bedside clinical management are shown. B) Male patient, in the 46–50 age range, treated with CPAP ventilation, tocilizumab, enoxaparin, hydroxychloroquine and lopinavir/ritonavir; no comorbidities except for asthma have been reported. The patient presented a rare TLR7 mutation that leads to an impaired production of interferon gamma (Made et al. 2020). C) Male patient, in the 51–55 age range, treated with invasive mechanical ventilation, steroids and enoxaparin. He had among comorbidities obesity, anxiety, hypertension and cerebral ischemia. He was found to be homozygous for the SELP rs6127 (p.Asp603Asn). Homozygosity of Asparagine in position 603 of Selectin P makes this endothelial protein more prone to clot formation and male patients more prone to COVID-19 thrombosis (Croci et al. 2021). Hence, the rationale for considering as putative adjuvant therapy in the management of similar cases the anti-Selectin P antibodies, a drug already approved for vascular events of sickle cell anemia. D) Male patient, in the 51–55 age range, treated with CPAP ventilation, tocilizumab, steroids, enoxaparin, hydroxychloroquine and lopinavir/ritonavir; no comorbidities except for diabetes. He was found to have the androgen receptor polyQ repeats > 23. The regular function of the androgen receptor is correlated with a beneficial immunomodulatory effect in those male patients in whom the increase in testosterone levels may overcome the receptor resistance. The rationale is to consider giving testosterone to those male subjects who cannot, on their own, raise the levels enough to overcome the receptor resistance due to poly-glutamine stretch longer than 23 repeats (Daga et al. 2021). E) Female patient, in the 31–35 age range, treated with CPAP ventilation and steroids, enoxaparin and azithromycin; no comorbidities except for hypothyroidism. She was a carrier of an ultra-rare mutation in ADAMTS13. Impaired function of ADAMTS13 leads to reduced cleavage of von Willebrand factor (vWF) and enhanced clot formation. The effect is enhanced in females and responsible for SARS-CoV-2 related thrombosis. Anti-vWF immunoglobulins would be a putative therapeutic option to consider in similar cases. F-G) examples of low IPGS and related key features. F) Male patient, in the 81–85 age range, treated with low-flow oxygen. No information regarding pharmacological therapy during hospitalization is present. Among comorbidities: diabetes mellitus, congestive heart failure and bowel cancer and steroids. He presented an ultra-rare mutation in ACE2. G) Male patient, in the 86–90 age range, treated with low-flow oxygen, steroid, enoxaparin and ceftriaxone plus azithromycin. Among his comorbidities: colon diverticulosis with constipation?, benign prostatic hyperplasia?, anxious-depressive syndrome, sideropenic anemia. He was a carrier of an ultra-rare mutation in AGTR2

For example, three male patients, within two distinct age ranges (46–50, 51–55) (panels B, C and D) with severe outcome (intubation and CPAP) are imperfectly represented by the sex-age model (probability of severity from 0.52 to 0.66) and better represented by the IPGS model (probability of severity from 0.91 to 0.95). The detected genetic variants that would allow to clinically consider putative personalized treatments in similar cases are: (1) TLR7 ultra-rare mutation indicating to consider possible adjuvant treatment with IFN gamma administration (Fallerini et al. 2021a); (2) homozygosity 603Asn in SELP gene suggesting putative adjuvant treatment with anti-selectin P autoantibodies (e.g. Crizanlizumab) (Fallerini et al. 2021b) and (3) polyQ longer than 23 in AR gene suggesting to consider possible adjuvant treatment with testosterone (Baldassarri et al. 2021a).

In a female patient, within age range 31–35, the sex-age model showed a probability of severity of 0.17 (panel D) while the IPGS score was 336 corresponding to a probability of severity of 0.95. The patient had no comorbidities except for hypothyroidism. She underwent steroid treatment and CPAP ventilation. She was found to be carrier of ADAMTS13 ultra-rare mutation, being more susceptible to thrombosis (due to reduced capacity of cutting von Willebrand factor); she had indeed a high D-dimer value. Caplacizumab (an antibody anti-vWF) would be an option to consider as possible adjuvant treatment in the clinical management of similar cases.

Two male patients, within two distinct age ranges (81–85, 86–90) (panel F and G) with a relatively mild respiratory outcome (hospitalised with low-flow oxygen therapy) presented an IPGS score of − 258 and − 141, respectively. Their severity probabilities calculated on sex-age (0.9 and 0.94) do not mirror the relatively mild clinical outcome, which is instead better represented by the severity probability calculated in IPGS only (0.23 and 0.41). Those two patients presented ultra-rare variants in ACE2 gene, likely responsible for reduced viral load (Benetti et al. 2020b), and in AGTR2 gene, which reduced activity is known to prevent cystic fibrosis pulmonary manifestation (Darrah et al. 2019).

Discussion

The most robust and traditional method for feature selection in complex disorders is Genome-wide association studies (GWASs). GWAS focuses on common variants only whose effects are small. The method is based on the comparison of about 700,000 genomic SNPs, mostly non-coding, in cases/controls. The coverage of coding SNPs is usually only through imputed data by Linkage Disequilibrium. The method needs ten/hundred of thousands subjects. The missing heritability of this method is rare variants. Until now, 35 loci have been identified for COVID-19 severity (Severe Covid-19 GWAS Group 2020; Pairo-Castineira et al. 2020; COVID-19 Host Genetics Initiative et al. 2021). Another robust and traditional method is the Burden test. Burden focuses on coding rare variants. The method is based on aggregation on a gene level of the variants and comparison between case and control subjects and likewise GWAS needs ten/hundred of thousands subjects. The missing heritability of this method is common variants. Few genes have been identified (Kosmicki et al. 2021) (WES/WGSHGI Working Group https://www.covid19hg.org/blog/2021-09-27-september-20-2021-meeting/). None of the above methods until now has been able to reach predictivity of COVID-19 severity useful in clinical practice.

The new proposed model takes into account both common and rare variants and has the ability to extract significant COVID-19 features (genes/variants) using a set of relatively low numbers of subjects. Furthermore, employment of oligo-asymptomatic SARS-CoV-2-infected subjects as controls instead of general population is providing a significant advantage in accuracy. Extracted genes were prioritized by matching with known viral susceptibility genes and transcriptomic data (Supplementary Table 12). Identity match was found for more than 25% of genes, even if the transcriptomic list is far to be representative of whole transcript alteration in the several tissues/organs involved in COVID-19 since most data comes from 2 tissues only (lung and swab). More than 55% of genes have a direct connection (degree 0 and 1 in HGC) with viral susceptibility genes, including those from GWAS loci and the genes in eQTL with them by GTEX analysis (Supplementary Table 13). Finally, up to 89% of genes are biologically significantly connected (p < 0.05) with the list above, supporting the validation of extracted genes.

The importance of combination of rare and low-frequency variants has already been demonstrated to contribute to the prediction of susceptibility in other complex disorders (Marouli et al. 2017; International Multiple Sclerosis Genetics Consortium 2018). Here, we further expand this approach while demonstrating that ultra-rare, rare, low frequency, as well as common variants contribute to the likelihood of developing a severe form of COVID-19. Furthermore, we included in our analyses a calibration of the relative weight of the variants vis-a-vis their impact on disease severity: a single ultra-rare variant might well by itself cause a severe phenotype of COVID-19, while this is less probable for a common polymorphism, one that is likely to have a markedly less direct effect on protein functionality. We performed a first modelization of COVID-19 genetics using both rare and common variants (Picchiotti et al. 2021). Since feature selection methodologies are generally sensitive to allele frequency, the extraction was performed separately for rare (MAF < 1/100) and common (MAF > 1/100) variants. However, the methodology revealed the insight that low-frequency variants (MAF from 1 to 5%) were disadvantaged if selected together with common ones. Furthermore, for extracting Mendelian-like genes a threshold of MAF < 0.1% (ultra-rare variants) appeared more effective than MAF < 1% and all mutations in the TLR7 gene that proved to have loss of function had indeed MAF < 1/1000 (Fallerini et al. 2021a). The model we arrived at, now considers separately ultra-rare, rare, low-frequency, and common variants.

Similar to the classical PRS (Polygenic Risk Score), the proposed IPGS (Integrated PolyGenic Score) may prove reliable for assessing the probability of severe COVID-19 following infection by SARS-CoV-2 (Mars et al. 2020). While PRS is based on common polymorphisms found at the genomic level with the majority of loci potentially conferring risk being not easily interpretable due to the uncertainty of linked genes, IPGS allows immediate biological interpretation because it only includes coding variants. Furthermore, as opposed to PRS, IPGS relies on both polymorphisms and rare variants is capable of differentially weighting features in an indirectly proportional way in respect to frequency, and therefore, to protein impact. Each patient indeed is assigned both a number and the list of her/his common and low-frequency polymorphisms relevant to COVID-19 supported by medically actionable information and of rare and ultra-rare variants conferring either risk of severity or protection from severe disease. Drawing on the entire picture presented through IPGS analysis, personalized adjuvant therapy could be envisaged. At the time of writing, a platform trial based on genetic markers is being discussed with the Italian Medicines Agency (EudraCT Number: 2021-002817-32).

Within 25 reported genomic SNPs demonstrably related to COVID-19 susceptibility/severity, 5 were reported to be in LD with coding variants (COVID-19 Host Genetics Initiative et al. 2021; Covid19hg.org 2021). The model presented here might provide useful information for uncovering the identity of the gene/coding variants responsible for COVID-19 susceptibility/severity linked to these genomic SNPs (Fig. 2D). For example, on chromosome 12, the genes mapping to the locus tagged by rs10774671 (COVID-19 Host Genetics Initiative et al. 2021) are both OAS1 and OAS3. In OAS3, the coding variant is an Arginine to Lysine substitution (rs1859330) in high LD (0.8) with the tag SNP. This polymorphism was already associated with viral infection (Tan et al. 2017) based on the presence of Lysine having been shown to lead to a decreased INF-γ production. In OAS1, the haplotype (including 4 missense variants: G162S, A352T, R361T, and G397R), the splicing variant 1039-G>A (the reported genomic polymorphism itself), and the truncating mutation T359fs*26 are associated with severity and predicted to impair OAS1 function. Both OAS1 and OAS3 induce RNASEL, which in turn exerts antiviral activity. Further support for the role of the OAS/RNASEL axis is indicated by the presence of ultra-rare recessive variants.

This innovative approach allowed us to better select genes located on the X chromosome related to COVID-19 that affect males and females in opposite ways (Fig. 2A and Supplementary Tables 3 and 6). Interestingly, many of these genes were previously confirmed or hypothesized to escape the X chromosome inactivation. With respect to these genes, females produce twice the levels of protein in comparison with males. Mutations in hemizygous state in males and heterozygous state in females appear silent until SARS-CoV-2 infection occurs. For example, TLR7 and TLR8 are selected for ultra-rare and associated with severity in males and with protection from severe disease in heterozygous females. We know that the activation of TLR7/8 induces the production of type 1 and type 2 IFN as well as pro-inflammatory cytokines, where the production defect in hemizygous males leads to severe COVID-19. However, an excess of the sensor can also lead to damage from hyperinflammation. Therefore, the condition of carrier females is the more favorable state and has in fact been associated with mild COVID-19 (Subramanian et al. 2006).

Pathway analysis pointed to the relevance of obvious actors in COVID-19 pathology, such as immune cells and interferon signaling, but also to the important role of specific organs (brain, digestive tract, kidney, reproductive system) and functions (metabolism of lipids and steroids). The pathways identified through GSEA analyses reflected the multi-organ nature of the disease. In addition, our analyses reveal new candidate determinants of disease variability. The four pathways linked to cilium motility suggest a role for ciliated cells of the respiratory tract (and possibly others) in antiviral defense. The functionality of the clathrin-mediated endocytosis pathway may likely affect viral entry (Bayati et al. 2021). Likewise, endoplasmic reticulum associated protein degradation (ERAD), which is linked to autophagy and SARS-CoV-2 life-cycle (Reggiori et al. 2010), may also be relevant. Other pathways with a less obvious but potentially interesting role in the disease include cell adhesion and mechanical stimulus signaling.

The strong link between the involved human biological pathways and COVID-19 pathogenicity support the hypothesis that the proposed IPGS equation may contribute significantly to predicting the disease severity of COVID-19. Indeed, an overall significant increase of performance was obtained in comparison with the model based on solely on age and sex. Furthermore, the IPGS is significantly associated with severity, showing an OR of 2.46 after adjustment for age, sex, and comorbidities. This indicates that IPGS is a novel prognostic factor that should be considered in the management of COVID-19 patients.

Modelling precisely the role of the entire range of host genomics affecting disease susceptibility and severity in COVID-19 is critical to obtaining a complete biological understanding of the aetiology and pathogenicity of COVID-19 as well as other severe complex diseases. The application of IPGS based on Machine Learning principles within a post-Mendelian model allows us to more precisely identify the gene variants at play in COVID-19 as well as their specific roles, individually and in combination. This deep dive into the genetic architecture that allows for, contributes to, or even helps prevent diseases while increasing or decreasing their impact is critical for, and directly translatable into, (personalized) medicines development as well as prevention and treatment protocols. An integrated modelling of genetic variants based on a limited patient cohort, even limited in its geographical spread, may be sufficient for the development of diagnosis, and therapeutics across a wider range of populations. The advantage of this IPGS post-Mendelian model is that it learns and continues to learn as well as being a model from which we can obtain insights on the fundamental architecture of human genomics when confronted with severe and complex diseases.

Supplementary Information

Below is the link to the electronic supplementary material.

439_2021_2397_MOESM1_ESM.jpg (129.3KB, jpg)

Supplementary Figure 1. Barplots for 0.01 both. Barplot of significance values (NOM p-values, -log10 transformation) from GSEA analysis for all the pathways significant in both females (orange) and males (blue), p<0.01. Vertical dotted line indicates the adopted significance threshold (JPG 130 kb)

439_2021_2397_MOESM2_ESM.jpg (209.9KB, jpg)

Supplementary Figure 2. Representative heatmaps for 0.01_both. Heatmaps of the genes belonging to representative pathways significant in both females and males, p<0.01. The color gradient represents the weight of each gene, calculated as described in methods (JPG 210 kb)

439_2021_2397_MOESM3_ESM.jpg (236.8KB, jpg)

Supplementary Figure 3. Barplots for 0.005_any. Barplot of significance values (NOM p values, -log10 transformation) from GSEA analysis for all the pathways significant in either females (orange) or males (blue), p<0.005. Vertical dotted line indicates the adopted significance threshold (JPG 237 kb)

439_2021_2397_MOESM4_ESM.jpg (222.4KB, jpg)

Supplementary Figure 4. Representative heatmaps for 0.005_any. Heatmaps of the genes belonging to representative pathways significant in either females or males, p<0.005. The color gradient represents the weight of each gene, calculated as described in methods (JPG 223 kb)

Acknowledgements

This study is part of the GEN-COVID Multicenter Study, https://sites.google.com/dbm.unisi.it/gen-covid, the Italian multicenter study aimed at identifying the COVID-19 host genetic bases. Specimens were provided by the COVID-19 Biobank of Siena, which is part of the Genetic Biobank of Siena, member of BBMRI-IT, of Telethon Network of Genetic Biobanks (project no. GTB18001), of EuroBioBank and of D-Connect. We thank the CINECA consortium for providing computational resources and the Network for Italian Genomes (NIG) http://www.nig.cineca.it for its support. We thank private donors for the support provided to A.R. (Department of Medical Biotechnologies, University of Siena) for the COVID-19 host genetics research project (D.L n.18 of March 17, 2020). We also thank the COVID-19 Host Genetics Initiative (https://www.covid19hg.org/). We thank Uppsala Intensive Care COVID-19 research group; Tomas Luther, Anders Larsson, Katja Hanslin, Anna Gradin, Sarah Galien, Sara Bulow Anderberg, Jacob Rosén and Sten Rubertsson. We thank the patients and their loved ones who volunteered to contribute to this study at one of the most difficult times in their lives, and the research staff in every intensive care unit who recruited patients at personal risk during the most extreme conditions we have ever witnessed in UK hospitals. Whole-genome sequencing was done by Illumina in partnership with the University of Edinburgh and Genomics England and was funded by UK Department of Health and Social Care, UKRI and LifeArc. The views expressed are those of the authors and not necessarily those of the DHSC, DID, NIHR, MRC, Wellcome Trust or PHE. The Health Research Board of Ireland (Clinical Trial Network Award 2014-12) funds collection of samples in Ireland. This study owes a great deal to the National Institute of Healthcare Research Clinical Research Network (NIHR CRN) and the Chief Scientist Office (Scotland), who facilitate recruitment into research studies in NHS hospitals, and to the global ISARIC and InFACT consortia. We also acknowledge funding provided by the German Research Foundation (DFG) through the NGS competence centres (INST 37/1049-1, INST 216/981-1, INST 257/605-1, INST 269/768-1 and INST 217/988-1) and scientific support through the German COVID multiomics initiative (DeCOI). Recruitment of the COMRI Study/of part of the DeCOI samples was funded by the Klinikum rechts der Isar, Technical University Munich, Munich, Germany. We received additional support by the DFG (LU 1944/3-1, to K.U.L. and SCHU 2419/2-1, to E.C.S.), the BONFOR program of the Medical Faculty of the University of Bonn (2020-1A-15, to A.S.) and the Munich Clinician Scientist Program (MCSP, to E.C.S.).

GEN-COVID Multicenter Study (https://sites.google.com/dbm.unisi.it/gen-covid) Francesca Montagnani1,27, Mario Tumbarello1,27, Ilaria Rancan1,27, Massimiliano Fabbiani27, Barbara Rossetti27, Laura Bergantini28, Miriana D’Alessandro28, Paolo Cameli28, David Bennett28, Federico Anedda29, Simona Marcantonio29, Sabino Scolletta29, Federico Franchi29, Maria Antonietta Mazzei30, Susanna Guerrini30, Edoardo Conticini31, Luca Cantarini31, Bruno Frediani31, Danilo Tacconi32, Chiara Spertilli Raffaelli32, Marco Feri33, Alice Donati33, Raffaele Scala34, Luca Guidelli34, Genni Spargi35, Marta Corridi35, Cesira Nencioni36, Leonardo Croci36, Gian Piero Caldarelli37, Maurizio Spagnesi38, Davide Romani38, Paolo Piacentini38, Maria Bandini38, Elena Desanctis38, Silvia Cappelli38, Anna Canaccini39, Agnese Verzuri39, Valentina Anemoli39, Manola Pisani39, Agostino Ognibene40, Alessandro Pancrazzi40, Maria Lorubbio40, Massimo Vaghi41, Antonella D’Arminio Monforte42, Federica Gaia Miraglia42, Mario U. Mondelli43,44,, Massimo Girardis45, Sophie Venturelli45, Stefano Busani45, Andrea Cossarizza46, Andrea Antinori47, Alessandra Vergori47, Arianna Emiliozzi47, Stefano Rusconi48,49, Matteo Siano49, Arianna Gabrieli49, Agostino Riva48,49, Daniela Francisci50,, Elisabetta Schiaroli50, Francesco Paciosi50, Andrea Tommasi50, Pier Giorgio Scotton51, Francesca Andretta51, Sandro Panese52, Stefano Baratti52, Renzo Scaggiante53, Francesca Gatti53, Saverio Giuseppe Parisi54, Francesco Castelli55, Eugenia Quiros-Roldan55, Melania Degli Antoni55, Isabella Zanella56,57, Matteo Della Monica58, Carmelo Piscopo58, Mario Capasso59,60,61, Roberta Russo59,60, Immacolata Andolfo59,60, Achille Iolascon59,60, Giuseppe Fiorentino62, Massimo Carella63, Marco Castori63, Filippo Aucella64, Pamela Raggi65, Rita Perna65, Matteo Bassetti66,67, Antonio Di Biagio66,67, Maurizio Sanguinetti68,69, Luca Masucci68,69, Alessandra Guarnaccia68, Serafina Valente70, Oreste De Vivo70, Gabriella Doddato1,2, Rossella Tita5, Annarita Giliberti1,2, Maria Antonietta Mencarelli5, Caterina Lo Rizzo5, Anna Maria Pinto5, Valentina Perticaroli1,2,5, Francesca Ariani1,2,5, Miriam Lucia Carriero1,2, Laura Di Sarno1,2, Diana Alaverdian1,2, Elena Bargagli28, Marco Mandalà71, Alessia Giorli71, Lorenzo Salerni71, Patrizia Zucchi72, Pierpaolo Parravicini72, Elisabetta Menatti73, Tullio Trotta74, Ferdinando Giannattasio74, Gabriella Coiro74, Fabio Lena75, Leonardo Gianluca Lacerenza75, Domenico A. Coviello76, Cristina Mussini77, Enrico Martinelli78, Sandro Mancarella79, Luisa Tavecchia79, Mary Ann Belli79, Lia Crotti80,81,82,83,84, Gianfranco Parati80,81, Maurizio Sanarico85, Francesco Raimondi86, Filippo Biscarini87, Alessandra Stella87, Marco Rizzi88, Franco Maggiolo88, Diego Ripamonti88, Claudia Suardi89, Tiziana Bachetti90, Maria Teresa La Rovere91, Simona Sarzi-Braga92, Maurizio Bussotti93, Katia Capitani94, Simona Dei95, Sabrina Ravaglia96, Rosangela Artuso97, Elena Andreucci97, Giulia Gori97, Angelica Pagliazzi97, Erika Fiorentini97, Antonio Perrella98, Francesco Bianchi98, Paola Bergomi99, Emanuele Catena99, Riccardo Colombo99, Sauro Luchi100, Giovanna Morelli100, Paola Petrocelli100, Sarah Iacopini100, Sara Modica100, Silvia Baroni101, Francesco Vladimiro Segala102, Francesco Menichetti103, Marco Falcone103, Giusy Tiseo103, Chiara Barbieri103, Tommaso Matucci103, Davide Grassi104, Claudio Ferri105, Franco Marinangeli106, Francesco Brancati107, Antonella Vincenti108, Valentina Borgo108, Lombardi Stefania108, Mirco Lenzi108, Massimo Antonio Di Pietro109, Francesca Vichi109, Benedetta Romanin109, Letizia Attala109, Cecilia Costa109, Andrea Gabbuti109, Menè Roberto80,81, Umberto Zuccon110, Lucia Vietri110, Stefano Ceri111, Pietro Pinoli111, Patrizia Casprini112, Giuseppe Merla113,114, Gabriella Maria Squeo113, Marcello Maffezzoni115, Raffaele Bruno116,117, Marco Vecchia116, Marta Colaneri116, Serena Ludovisi118. WES/WGS working group within the HGI (https://www.covid19hg.org/projects/) The genetic predisposition to severe COVID-19 (SweCovid): Yanara Marincevic-Zuniga119, Jessica Nordlund119, Tomas Luther16, Anders Larsson120, Katja Hanslin16, Anna Gradin16, Sarah Galien16, Sara Bulow Anderberg16, Jacob Rosén16, Sten Rubertsson16, Hugo Zeberg15,, Robert Frithiof16, Miklós Lipcsey16,17, Michael Hultström16,18. GenOMICC Consortium GenOMICC co-investigators: Sara Clohisey25, Peter Horby121, Johnny Millar25, Julian Knight122, Hugh Montgomery123, David Maslove124, Lowell Ling125, Alistair Nichol126, Charlotte Summers127, Tim Walsh26, Charles Hinds128, Malcolm G. Semple129,130, Peter J.M. Openshaw131,132, Manu Shankar-Hari133, Antonia Ho134, Danny McAuley135,136, Chris Ponting24, Kathy Rowan137, J. Kenneth Baillie24,25,26. Central management and laboratory team: Fiona Griffiths25, Wilna Oosthuyzen25 Jen Meikle25, Paul Finernan25, James Furniss25, Ellie Mcmaster25, Andy Law25, Sara Clohisey25, J. Kenneth Baillie24,25,26, Trevor Paterson25, Tony Wackett25, Ruth Armstrong25, Lee Murphy138, Angie Fawkes138, Richard Clark138, Audrey Coutts138, Lorna Donnelly138, Tammy Gilchrist138, Katarzyna Hafezi138, Louise Macgillivray138, Alan Maclean138, Sarah McCafferty138, Kirstie Morrice138, Jane Weaver25, Ceilia Boz25, Ailsa Golightly25, Mari Ward25, Hanning Mal25, Helen Szoor-McElhinney25, Adam Brown25, Ross Hendry25, Andrew Stenhouse25, Louise Cullum25, Dawn Law25, Sarah Law25, Rachel Law25, Max Head Fourman25, Maaike Swets25, Nicky Day25, Filip Taneski25, Esther Duncan25, Marie Zechner25, Nicholas Parkinson25. Data analysis team: Erola Pairo-Castineira24,25, Sara Clohisey25, Lucija Klaric24, Andrew D. Bretherick24, Konrad Rawlik25, Dorota Pasko139, Susan Walker139, Nick Parkinson25, Max Head Fourman25, Clark D. Russell25,140, James Furniss25, Anne Richmond24, Elvina Gountouna141, David Harrison137, Bo Wang24, Yang Wu142, Alison Meynert24, Athanasios Kousathanas139, Loukas Moutsianas139, Zhijian Yang143, Ranran Zhai143, Chenqing Zheng143, Graeme Grimes24, Jonathan Millar25, Barbara Shih25, Marie Zechner25, Jian Yang144,145, Xia Shen143,146,147, Chris P. Ponting2, Albert Tenesa2,25,146, Kathy Rowan137, Andrew Law25, Veronique Vitart24, James F. Wilson24,146, J. Kenneth Baillie24,25,26. Barts Health NHS Trust, London, UK: D. Collier149, S. Wood148, A. Zak148, C. Borra148, M. Matharu148, P. May148, Z. Alldis148, O. Mitchelmore148, R. Bowles148, A. Easthorpe148, F. Bibi148, I. Lancoma-Malcolm148, J. Gurasashvili148, J. Pheby148, J. Shiel148, M. Bolton148, M. Patel148, M. Taylor148, O. Zongo148, P. Ebano148, P. Harding148, R. Astin-Chamberlain148, Y. Choudhury148, A. Cox148, D. Kallon148, M. Burton148, R. Hall148, S. Blowes148, Z. Prime148, J. Biddle148, O. Prysyazhna148, T. Newman148, C. Tierney148, J. Kassam148. Guys and St Thomas’ Hospital, London, UK: M. Shankar-Hari149, M. Ostermann149, S. Campos149, A. Bociek149, R. Lim149, N. Grau149, T. O. Jones149, C. Whitton149, M. Marotti40, G. Arbane149. James Cook University Hospital, Middlesburgh, UK: S. Bonner150, K. Hugill150, J. Reid150. The Royal Liverpool University Hospital, Liverpool, UK: I. Welters151, V. Waugh151, K. Williams151, D. Shaw151, J. Fernandez Roman151, M. Lopez Martinez151, E. Johnson151, A. Waite151, B. Johnson151, O. Hamilton151, S. Mulla151. King’s College Hospital, London, UK: M. McPhail152, J. Smith152. Royal Infirmary of Edinburgh, Edinburgh, UK: J. K. Baillie23,24,25, L. Barclay153, D. Hope153, C. McCulloch153, L. McQuillan153, S. Clark153, J. Singleton153, K. Priestley153, N. Rea153, M. Callaghan153, R. Campbell153, G. Andrew153, L. Marshall153. John Radcliffe Hospital, Oxford, UK: S. McKechnie154, P. Hutton154, A. Bashyal154, N. Davidson154. Addenbrooke’s Hospital, Cambridge, UK: C. Summers155, P. Polgarova155, K. Stroud155, N. Pathan155, K. Elston155, S. Agrawal155. Morriston Hospital, Swansea, UK: C. Battle156, L. Newey156, T. Rees156, R. Harford156, E. Brinkworth156, M. Williams156, C. Murphy156. Ashford and St Peter’s Hospital, Surrey, UK: I. White157, M. Croft157. Royal Stoke University Hospital, Staffordshire, UK: N. Bandla158, M. Gellamucho158, J. Tomlinson158, H. Turner158, M. Davies158, A. Quinn158, I. Hussain158, C. Thompson158, H. Parker158, R. Bradley158, R. Griffiths158. Queen Elizabeth Hospital, Birmingham, UK: J. Scriven159, J. Gill159. Glasgow Royal Infirmary, Glasgow, UK: A. Puxty160, S. Cathcart160, D. Salutous160, L. Turner160, K. Duffy160, K. Puxty160. Kingston Hospital, Surrey, UK: A. Joseph161, R. Herdman-Grant161, R. Simms161, A. Swain161, A. Naranjo161, R. Crowe161, K. Sollesta161, A. Loveridge161, D. Baptista161, E. Morino161. The Tunbridge Wells Hospital and Maidstone Hospital, Kent, UK: M. Davey162, D. Golden162, J. Jones162. North Middlesex University Hospital NHS trust, London, UK: J. Moreno Cuesta163, A. Haldeos163, D. Bakthavatsalam163, R. Vincent163, M. Elhassan163, K. Xavier163, A. Ganesan163, D. Purohit M. Abdelrazik163. Bradford Royal Infirmary, Bradford, UK: J. Morgan164, L. Akeroyd164, S. Bano164, D. Warren164, M. Bromley164, K. Sellick164, L. Gurr164, B. Wilkinson164, V. Nagarajan164, P. Szedlak164. Blackpool Victoria Hospital, Blackpool, UK: J. Cupitt165, E. Stoddard165, L. Benham165, S. Preston165, N. Slawson165, Z. Bradshaw165, J. brown165, M. Caswell165, SMelling165. Countess of Chester Hospital, Chester, UK: P. Bamford165, M. Faulkner166, K. Cawley166, H. Jeffrey166, E. London166, H. Sainsbury166, I. Nagra166, F. Nasir166, Ce Dunmore166, R. Jones166, A. Abraheem166, M. Al-Moasseb166, R. Girach166. Wythenshawe Hospital, Manchester, UK: C. Brantwood167, P. Alexander167, J. Bradley-Potts167, S. Allen167, T. Felton167. St George’s Hospital, London, UK: S. Manna168, S. Farnell-Ward168, S. Leaver168, J. Queiroz168, E. Maccacari168, D. Dawson168, C. Castro Delgado168, R. Pepermans Saluzzio168, O. Ezeobu168, L. Ding168, C. Sicat168, R. Kanu168, G. Durrant168, J. Texeira168, A. Harrison168, T. Samakomva168. Good Hope Hospital, Birmingham, UK: J. Scriven169, H. Willis169, B. Hopkins169, L. Thrasyvoulou169. Stepping Hill Hospital, Stockport, UK: M. Jackson170, A. Zaki170, C. Tibke170, S. Bennett170, W. Woodyatt170, A. Kent170, E. Goodwin170. Manchester Royal Infirmary, Manchester, UK: C. Brandwood171, R. Clark171, L. Smith171. Royal Alexandra Hospital, Paisley, UK: K. Rooney172, N. Thomson172, N. Rodden172, E. Hughes172, D. McGlynn172, C. Clark172, P. Clark172, L. Abel172, R. Sundaram172, L. Gemmell172, M. Brett172, J. Hornsby172, P. MacGoey172, R. Price172, B. Digby172, P. O’Neil172, P. McConnell172, P. Henderson172. Queen Elizabeth University Hospital, Glasgow, UK: S. Henderson173, M. Sim173, S. Kennedy-Hay173, C. McParland173, L. Rooney173, N. Baxter173. Queen Alexandra Hospital, Portsmouth, UK: D. Pogson174, S. Rose174, Z. Daly174, L. Brimfield174. BHRUT (Barking Havering)—Queens Hospital and King George Hospital, Essex, UK: M. K. Phull175, M. Hussain175, T. Pogreban175, L. Rosaroso175, E. Salciute L. Grauslyte175. University College Hospital, London, UK: D. Brealey176, E. Wraith176, N. MacCallum176, G. Bercades176, I. Hass176, D. Smyth176, A. Reyes176, G. Martir176. Royal Victoria Infirmary, Newcastle Upon Tyne, UK. I. D. Clement177, K. Webster177, C. Hays177, A. Gulati177. Western Sussex Hospitals, West Sussex, UK. L. Hodgson178, M. Margarson178, R. Gomez178, Y. Baird178, Y. Thirlwall178, L. Folkes178, A. Butler178, E. Meadows178, S. Moore178, D. Raynard178, H. Fox178, L. Riddles178, K. King178, S. Kimber178, G. Hobden178, A. McCarthy178, V. Cannons178, I. Balagosa178, I. Chadbourn178, A. Gardner178. Salford Royal Hospital, Manchester, UK. D. Horner179, D. McLaughlanv179, B. Charles179, N. Proudfoot179, T. Marsden179, L. Mc Morrow179, B. Blackledge179, J. Pendlebury179, A. Harvey179, E. Apetri179, C. Basikolo179, L. Catlow179, R. Doonan179, K. Knowles179, S. Lee179, D. Lomas179, C. Lyons179, J. Perez179, M. Poulaka179, M. Slaughter179, K. Slevin179, M. Taylor179, V. Thomas179, D. Walker179, J. Harris179. The Royal Oldham Hospital, Manchester, UK. A. Drummond180, R. Tully180, J. Dearden180, J. Philbin180, S. Munt180, C. Rishton180, G. O’Connor180, M. Mulcahy180, E. Dobson180, J. Cuttler180, M. Edward180. Pinderfields General Hospital, Wakefield, UK: A. Rose181, B. Sloan181, S. Buckley181, H. Brooke181, E. Smithson181, R. Charlesworth181, R. Sandu181, M. Thirumaran181, V. Wagstaff181, J. Cebrian Suarez181. Basildon Hospital, Basildon, UK: A. Kaliappan182, M. Vertue182, A. Nicholson182, J. Riches182, A. Solesbury182, L. Kittridge182, M. Forsey182, G. Maloney182. University Hospital of Wales, Cardiff, UK: J. Cole183, M. Davies183, R. Davies183, H. Hill183, E. Thomas183, A. Williams183, D. Duffin183, B. Player183. Broomfield Hospital, Chelmsford, UK: J. Radhakrishnan184, S. Gibson184, A. Lyle184, F. McNeela184. Royal Brompton Hospital, London, UK: B. Patel185, M. Gummadi185, G. Sloane185, N. Dormand185, S. Salmi185, Z. Farzad185, D. Cristiano185, K. Liyanage185, V. Thwaites185, M. Varghese185. Nottingham University Hospital, Nottingham, UK: M. Meredith186. Royal Hallamshire Hospital and Northern General Hospital, Sheffield, UK. G. Mills187, J. Willson187, K. Harrington187, B. Lenagh187, K. Cawthron187, S. Masuko187, A. Raithatha187, K. Bauchmuller187, N. Ahmad187, J. Barker187, Y. Jackson187, F. Kibutu187, S. Bird187. Royal Hampshire County Hospital, Hampshire, UK. G. Watson188, J. Martin188, E. Bevan188, C. Wrey Brown188, D. Trodd188. Queens Hospital Burton, Burton-On-Trent, UK: K. English189, G. Bell189, L. Wilcox189, A. Katary189, New Cross Hospital, Wolverhampton, UK: S. Gopal190, V. Lake190, N. Harris190, S. Metherell190, E. Radford190. Heartlands Hospital, Birmingham, UK: J. Scriven191, F. Moore191, H. Bancroft191, J. Daglish191, M. Sangombe191, M. Carmody191, J. Rhodes191, M. Bellamy191. Walsall Manor Hospital, Walsall, UK: A. Garg192, A. Kuravi192, E. Virgilio192, P. Ranga192, J. Butler192, L. Botfield192, C. Dexter192, J. Fletcher192. Stoke Mandeville Hospital, Buckinghamshire, UK: P. Shanmugasundaram193, G. Hambrook193, I. Burn193, K. Manso193, D. Thornton193, J. Tebbutt193, R. Penn193. Sandwell General Hospital, Birmingham, UK: J. Hulme194, S. Hussain194, Z. Maqsood194, S. Joseph194, J. Colley194, A. Hayes194, C. Ahmed194, R. Haque194, S. Clamp194, R. Kumar194, M. Purewal194, B. Baines194. Royal Berkshire NHS Foundation Trust, Berkshire, UK: M. Frise195, N. Jacques195, H. Coles195, J. Caterson195, S. Gurung Rai195, M. Brunton195, E. Tilney195, L. Keating195, A. Walden195. Charing Cross Hospital, St Mary’s Hospital and Hammersmith Hospital, London, UK: D. Antcliffe196, A. Gordon196, M. Templeton196, R. Rojo196, D. Banach196, S. Sousa Arias196, Z. Fernandez196, P. Coghlan196. Dumfries and Galloway Royal Infirmary, Dumfries, UK: D. Williams197, C. Jardine197. Bristol Royal Infirmary, Bristol, UK: J. Bewley198, K. Sweet198, L. Grimmer198, R. Johnson198, Z. Garland198, B. Gumbrill198. Royal Sussex County Hospital, Brighton, UK: C. Phillips199, L. Ortiz-Ruiz de Gordoa199, E. Peasgood199. Whiston Hospital, Prescot, UK: A. Tridente200, K. Shuker S. Greer200. Royal Glamorgan Hospital, Cardiff, UK: C. Lynch201, C. Pothecary201, L. Roche201, B. Deacon201, K. Turner201, J. Singh201, G. Sera Howe201. King’s Mill Hospital, Nottingham, UK: P. Paul202, M. Gill202, I. Wynter202, V. Ratnam202, S. Shelton202. Fairfield General Hospital, Bury, UK: J. Naisbitt203, J. Melville203. Western General Hospital, Edinburgh, UK: R. Baruah204, S. Morrison204. Northwick Park Hospital, London, UK: A. McGregor205, V. Parris205, M. Mpelembue205, S. Srikaran205, C. Dennis205, A. Sukha205. Royal Preston Hospital, Preston, UK: A. Williams206, M. Verlande206. Royal Derby Hospital, Derby, UK: K. Holding207, K. Riches207, C. Downes207, C. Swan207. Sunderland Royal Hospital, Sunderland, UK: A. Rostron208, A. Roy208, L. Woods208, S. Cornell208, F. Wakinshaw208. Royal Surrey County Hospital, Guildford, UK. B. Creagh-Brown209, H. Blackman209, A. Salberg209, E. Smith209, S. Donlon209, S. Mtuwa209, N. Michalak-Glinska209, S. Stone209, C. Beazley209, V. Pristopan209. Derriford Hospital, Plymouth, UK: N. Nikitas210, L. Lankester210, C. Wells210. Croydon University Hospital, Croydon, UK: A. S. Raj211, K. Fletcher211, R. Khade211, G. Tsinaslanidis211. Victoria Hospital, Kirkcaldy, UK: M. McMahon212, S. Fowler212, A. McGregor212, T. Coventry212. Milton Keynes University Hospital, Milton Keynes, UK: R. Stewart213, L. Wren213, E. Mwaura213, L. Mew213, A. Rose213, D. Scaletta213, F. Williams213. Barnsley Hospital, Barnsley, UK: K. Inweregbu214, A. Nicholson214, N. Lancaster214, M. Cunningham214, A. Daniels214, L. Harrison214, S. Hope214, S. Jones214, A. Crew214, G. Wray214, J. Matthews214, R. Crawley214. York Hospital, York, UK: J. Carter215, I. Birkinshaw215, J. Ingham215, Z. Scott215, K. Howard215, R. Joy215, S. Roche215. University Hospital of North Tees, Stockton on Tees, UK: M. Clark216, S. Purvis216. University Hospital Wishaw, Wishaw, UK: A. Morrison217, D. Strachan217, M. Taylor217, S. Clements217, K. Black217. Whittington Hospital, London, UK: C. Parmar218, A. Altabaibeh218, K. Simpson218, L. Mostoles218, K. Gilbert218, L. Ma218, A. Alvaro218. Southmead Hospital, Bristol, UK: M. Thomas219, B. Faulkner219, R. Worner219, K. Hayes219, E. Gendall219, H. Blakemore219, B. Borislavova219, E. Goff219. The Royal Papworth Hospital, Cambridge, UK: A. Vuylsteke220, L. Mwaura220, J. Zamikula220, L. Garner220, A. Mitchell220, S. Mepham220, L. Cagova220, A. Fofano220, H. Holcombe220, K. Praman220. Royal Gwent Hospital, Newport, UK: T. Szakmany221, A. E. Heron221, S. Cherian221, S. Cutler221, A. Roynon-Reed221. Norfolk and Norwich University hospital (NNUH), Norwich, UK: G. Randell222, K. Convery222, K. Stammers D. Fottrell-Gould222, L. Hudig222, J. Keshet-price222. Great Ormond St Hospital and UCL Great Ormond St Institute of Child Health NIHR Biomedical Research Centre, London, UK: M. Peters223, L. O’Neill223, S. Ray223, H. Belfield223, T. McHugh223, G. Jones223, O. Akinkugbe223, A. Tomas223, E. Abaleke223, E. Beech223, H. Meghari223, S. Yussuf223, A. Bamford223. Airedale General Hospital, Keighley, UK: B. Hairsine224, E. Dooks224, F. Farquhar224, S. Packham224, H. Bates224, C. McParland224, L. Armstrong224. Aberdeen Royal Infirmary, Aberdeen, UK: C. Kaye225, A. Allan225, J. Medhora225, J. Liew225, A. Botello225, F. Anderson225. Southampton General Hospital, Southampton, UK: R. Cusack226, H. Golding226, K. Prager226, T. Williams226, S. Leggett226, K. Golder226, M. Male226, O. Jones226, K. Criste226, M. Marani226. Russell’s Hall Hospital, Dudley, UK: V. Anumakonda227, V. Amin227, K. Karthik227, R. Kausar227, E. Anastasescu227, K. Reid227, M. Jacqui227. Rotherham General Hospital, Rotherham, UK: A. Hormis228, R. Walker228, D. Collier228. North Manchester General Hospital, Manchester, UK: T. Duncan229, A. Uriel229, A. Ustianowski229, H. T-Michael229, M. Bruce229, K. Connolly229, K. Smith229. Basingstoke and North Hampshire Hospital, Basingstoke, UK: R. Partridge230, D. Griffin230, M. McDonald230, N. Muchenje230. Royal Free Hospital, London, UK: D. Martin231, H. Filipe231, C. Eastgate231, C. Jackson231. Hull Royal Infirmary, Hull, UK: A. Gratrix232, L. Foster232, V. Martinson232, E. Stones232, Caroline Abernathy232, P. Parkinson232. Harefield Hospital, London, UK: A. Reed233, C. Prendergast233, P. Rogers233, M. Woodruff233, R. Shokkar233, S. Kaul233, A. Barron233, C. Collins233. Chesterfield Royal Hospital Foundation Trust, Chesterfield, UK: S. Beavis234, A. Whileman234, K. Dale234, J. Hawes234, K. Pritchard234, R. Gascoyne234, L. Stevenson234. Barnet Hospital, London, UK: R. Jha235, L. Lim235, V. Krishnamurthy235. Aintree University Hospital, Liverpool, UK: R. Parker236, I. Turner-Bone236, L. Wilding236, A. Reddy236. St James’s University Hospital and Leeds General Infirmary, Leeds, UK: S. Whiteley237, E. Wilby237, C. Howcroft237, A. Aspinwall237, S. Charlton237, B. Ogg237. Glan Clwyd Hospital, Bodelwyddan, UK: D. Menzies238, R. Pugh238, E. Allan238, R. Lean238, F. Davies238, J. Easton238, X. Qiu238, S. Kumar238, K. Darlington238. University Hospital Crosshouse, Kilmarnock, UK: G. Houston239, P. O’Brien239, T. Geary239, J. Allan239, A. Meikle239. Royal Bolton Hospital, Bolton, UK: G. Hughes240, M. Balasubramaniam240, S. Latham240, E. McKenna240, R. Flanagan240. Princess of Wales Hospital, Llantrisant, UK: S. Sathe24, E. Davies24, L. Roche241. Pilgrim Hospital, Lincoln, UK: M. Chablani242, A. Kirkby242, K. Netherton242, S. Archer242. Northumbria Healthcare NHS Foundation Trust, North Shields, UK: B. Yates243, C. Ashbrook-Raby243. Ninewells Hospital, Dundee, UK: S. Cole244, M. Casey244, L. Cabrelli244, S. Chapman244, M. Casey244, P. Austin244, A. Hutcheon244, C. Whyte244, C. Almaden-Boyle244. Lister Hospital, Stevenage, UK: N. Pattison245, C. Cruz245. Bedford Hospital, Bedford, UK: A. Vochin246, H. Kent246, A. Thomas246, S. Murdoch246, B. David246, M. Penacerrada246, G. Lubimbi246, V. Bastion246, R. Wulandari246, J. Valentine246, D. Clarke246. Royal United Hospital, Bath, UK: A. Serrano-Ruiz247, S. Hierons247, L. Ramos247, C. Demetriou247, S. Mitchard247, K. White247. Royal Bournemouth Hospital, Bournemouth, UK: N. White248, S. Pitts248, D. Branney248, J. Frankham248. The Great Western Hospital, Swindon, UK: M. Watters249, H. Langton249, R. Prout249. Watford General Hospital, Watford, UK: V. Page250, T. Varghes250. University Hospital North Durham, Darlington, UK: A. Cowton251, A. Kay251, K. Potts251, M. Birt251, M. Kent251, A. Wilkinson251. Tameside General Hospital, Ashton Under Lyne, UK: E. Jude252, V. Turner252, H. Savill252, J. McCormick252, M. Clark252, M. Coulding252, S. Siddiqui252, O. Mercer252, H. Rehman252, D. Potla252. Princess Royal Hospital, Telford and Royal Shrewsbury Hospital, Shrewsbury, UK: N. Capps253, D. Donaldson253, J. Jones253, H. Button253, T. Martin253, K. Hard253, A. Agasou253, L. Tonks253, T. Arden253, P. Boyle253, M. Carnahan253, J. Strickley253, C. Adams253, D. Childs253, R. Rikunenko253, M. Leigh253, M. Breekes253, R. Wilcox253, A. Bowes253, H. Tiveran253, F. Hurford253, J. Summers253, A. Carter253, Y. Hussain253, L. Ting253, A. Javaid253, N. Motherwell253, H. Moore253, H. Millward253, S. Jose253, N. Schunki253, A. Noakes253, C. Clulow253. Arrowe Park Hospital, Wirral, UK: G. Sadera254, R. Jacob254, C. Jones254. The Queen Elizabeth Hospital, King’s Lynn, UK: M. Blunt255, Z. Coton255, H. Curgenven255, S. Mohamed Ally255, K. Beaumont255, M. Elsaadany255, K. Fernandes255, I. Ali Mohamed Ali255, H. Rangarajan255, V. Sarathy255, S. Selvanayagam255, D. Vedage255, M. White255. Royal Blackburn Teaching Hospital, Blackburn, UK: M. Smith256, N. Truman256, S. Chukkambotla256, S. Keith256, J. Cockerill-Taylor256, J. Ryan-Smith256, R. Bolton256, P. Springle256, J. Dykes256, J. Thomas256, M. Khan256, M. T. Hijazi256, E. Massey256, G. Croston256. Poole Hospital, Poole, UK: H. Reschreite r258, J. Camsooksai257, S. Patch257, S. Jenkins257, C. Humphrey257, B. Wadams257, J. Camsooksai257. Medway Maritime Hospital, Gillingham, UK: N. Bhatia258, M. Msiska258, O. Adanini258. Warwick Hospital, Warwick, UK: B. Attwood259, P. Parsons259. The Royal Marsden Hospital, London, UK: K. Tatham260, S. Jhanji260, E. Black260, A. Dela Rosa260, R. Howle260, B. Thomas260, T. Bemand260, R. Raobaikady260. The Princess Alexandra Hospital, Harlow, UK: R. Saha261, N. Staines261, A. Daniel261, J. Finn261. Musgrove Park Hospital, Taunton, UK: J. Hutter262, P. Doble262, C. Shovelton262, C. Pawley262. George Eliot Hospital NHS Trust, Nuneaton, UK: T. Kannan263, M. Hill263. East Surrey Hospital, Redhill, UK: E. Combes264, S. Monnery264, T. Joefield264. West Middlesex Hospital, Isleworth, UK: M. Popescu265, M. Thankachen265, M. Oblak265. Warrington General Hospital, Warrington, UK: J. Little266, S. McIvor266, A. Brady266, H. Whittle266, H. Prady266, R. Chan266. Southport and Formby District General Hospital, Ormskirk, UK: A. Ahmed267, A. Morris267. Royal Devon and Exeter Hospital, Exeter, UK: C. Gibson268, E. Gordon268, S. Keenan268, H. Quinn268, S. Benyon268, S. Marriott268, L. Zitter268, L. Park268, K. Baines268. Macclesfield District General Hospital, Macclesfield, UK: M. Lyons269, M. Holland269, N. Keenan269, M. Young269. Borders General Hospital, Melrose, UK: S. Garrioch270,J. Dawson270, M. Tolson270. Birmingham Children’s Hospital, Birmingham, UK: B. Scholefield271, R. Bi271. William Harvey Hospital, Ashford, UK: N. Richardson272, N. Schumacher272, T. Cosier272, G. Millen272. Royal Lancaster Infirmary, Lancaster, UK: A. Higham273, K. Simpson273. Queen Elizabeth the Queen Mother Hospital, Margate, UK: S. Turki274, L. Allen274, N. Crisp274, T. Hazleton274, A. Knight274, J. Deery274, C. Price274, S. Turney274, S. Tilbey274, E. Beranova274. Liverpool Heart and Chest Hospital, Liverpool, UK: D. Wright275, L. Georg275, S. Twiss275. Darlington Memorial Hospital, Darlington, UK: A. Cowton276, S. Wadd276, K. Postlethwaite276. Southend University Hospital, Westcliff-on-Sea, UK: P. Gondo277, B. Masunda277, A. Kayani277, B. Hadebe277. Raigmore Hospital, Inverness, UK: J. Whiteside278, R. Campbell278, N. Clarke278. Salisbury District Hospital, Salisbury, UK: P. Donnison278, F. Trim279, I. Leadbitter279. Peterborough City Hospital, Peterborough, UK: D. Butcher280, S. O’Sullivan280. Ipswich Hospital, Ipswich, UK: B. Purewal281, S. Bell281, V. Rivers’281. Hereford County Hospital, Hereford, UK: R. O’Leary282, J. Birch282, E. Collins282, S. Anderson282, K. Hammerton282, E. Andrews282. Furness General Hospital, Barrow-in-Furness, UK: A. Higham283, K. Burns283. Forth Valley Royal Hospital, Falkirk, UK: I. Edmond284, D. Salutous284, A. Todd284, J. Donnachie284, P. Turner284, L. Prentice284, L. Symon284, N. Runciman284, F. Auld284. Torbay Hospital, Torquay, UK: M. Halkes285, P. Mercer285, L. Thornton285. St Mary’s Hospital, Newport, UK: G. Debreceni286, J. Wilkins286, A. Brown286, V. Crickmore286. Royal Manchester Children’s Hospital, Manchester, UK: G. Subramanian287, R. Marshall287, C. Jennings287, M. Latif278, L. Bunni287. Royal Cornwall Hospital, Truro, UK: M. Spivey288, S. Bean288, K. Burt288. Queen Elizabeth Hospital Gateshead, Gateshead, UK: V. Linnett289, J. Ritzema289, A. Sanderson289, W. McCormick289, M. Bokhari289. Kent & Canterbury Hospital, Canterbury, UK: R. Kapoor290, D. Loader290. James Paget University Hospital NHS Trust, Great Yarmouth, UK: A. Ayers291, W. Harrison291, J. North291. Darent Valley Hospital, Dartford, UK: Z. Belagodu292, R. Parasomthy292, O. Olufuwa292, A. Gherman292, B. Fuller292, C. Stuart292. The Alexandra Hospital, Redditch and Worcester Royal Hospital, Worcester, UK: O. Kelsall293, C. Davis293, L. Wild293, H. Wood293, J. Thrush293, A. Durie293, K. Austin’293, K. Archer293, P. Anderson2934, C. Vigurs293. Ysbyty Gwynedd, Bangor, UK: C. Thorpe294, A. Thomas294, E. Knights294, N. Boyle294, A. Price294. Yeovil Hospital, Yeovil, UK: A. Kubisz-Pudelko295, D. Wood295, A. Lewis295, S. Board295, L. Pippard295, J. Perry295, K. Beesley295. University Hospital Hairmyres, East Kilbride, UK: A. Rattray296, M. Taylor296, E. Lee296, L. Lennon296, K. Douglas296, D. Bell296, R. Boyle296, L. Glass296. Scunthorpe General Hospital, Scunthorpe, UK: M. Nauman Akhtar297, K. Dent297, D. Potoczna297, S. Pearson297, E. Horsley297, S. Spencer297. Princess Royal Hospital Brighton, West Sussex, UK: C. Phillips298, D. Mullan298, D. Skinner298, J. Gaylard298, L. Ortiz-Ruizdegordoa298. Lincoln County Hospital, Lincoln, UK: R. Barber299, C. Hewitt299, A. Hilldrith299, S. Shepardson299, M. Wills299, K. Jackson-Lawrence299. Homerton University Hospital, London, UK: A. Gupta300, A. Easthope300, E. Timlick300, C. Gorman300. Glangwili General Hospital, Camarthen, UK: I. Otaha301, A. Gales301, S. Coetzee301, M. Raj301, M. Peiu301. Ealing Hospital, Southall, UK: V. Parris302, S. Quaid302, E. Watson302. Scarborough General Hospital, Scarborough, UK: K. Elliott303, J. Mallinson303, B. Chandler303, A. Turnbull303. Royal Albert Edward Infirmary, Wigan, UK: A. Quinn304, C. Finch304, C. Holl304, J. Cooper304, A. Evans304. Queen Elizabeth Hospital, Woolwich, London, UK: W. Khaliq305, A. Collins305, E. Treus Gude305. North Devon District Hospital, Barnstaple, UK: N. Love306, L. van Koutrik306, J. Hunt306, D. Kaye306, E. Fisher306, A. Brayne306, V. Tuckey306, P. Jackson306, J. Parkin306. National Hospital for Neurology and Neurosurgery, London, UK: D. Brealey307, E. Raith307, A. Tariq307, H. Houlden307, A. Tucci307, J. Hardy307, E. Moncur307. Eastbourne District General Hospital, East Sussex, UK and Conquest Hospital, East Sussex, UK: J. Highgate308, A. Cowley308. Diana Princess of Wales Hospital, Grimsby, UK: A. Mitra309, R. Stead309, T. Behan309, C. Burnett309, M. Newton309, E. Heeney309, R. Pollard309, J. Hatton309. The Christie NHS Foundation Trust, Manchester, UK: A. Patel310, V. Kasipandian310, S. Allibone310, R. M. Genetu310. Prince Philip Hospital, Lianelli, UK: I. Otahal311, L. O’Brien311, Z. Omar311, E. Perkins311, K. Davies311. Prince Charles Hospital, Merthyr Tydfil, UK: D. Tetla312, C. Pothecary312, B. Deacon312. Golden Jubilee National Hospital, Clydebank, UK: B. Shelley313, V. Irvine313. Dorset County Hospital, Dorchester, UK: S. Williams314, P. Williams314, J. Birch314, J. Goodsell314, R. Tutton314, L. Bough314, B. Winter-Goodwin314. Calderdale Royal Hospital, Halifax, UK: R. Kitson315, J. Pinnell315, A. Wilson315, T. Nortcliffe315, T. Wood315, M. Home315, K. Holdroyd315, M. Robinson315, R. Shaw315, J. Greig315, M. Brady315, A. Haigh315, L. Matupe315, M. Usher315, S. Mellor315, S. Dale315, L. Gledhill315, L. Shaw315, G. Turner315, D. Kelly315, B. Anwar315, H. Riley315, H. Sturgeon315, A. Ali315, L. Thomis315, D. Melia315, A. Dance315, K. Hanson315. West Suffolk Hospital, Suffolk, UK: S. Humphreys316, I. Frost316, V. Gopal316, J. Godden316, A. Holden316, S. Swann316. West Cumberland Hospital, Whitehaven, UK: T. Smith317, M. Clapham317, U. Poultney317, R. Harper317, P. Rice317. University Hospital Lewisham, London, UK: W. Khaliq318, R. Reece-Anthony318, B. Gurung318. St John’s Hospital Livingston, Livingston, UK: S. Moultrie319, M. Odam319. Sheffield Children’s Hospital, Sheffield, UK: A. Mayer320, A. Bellini320, A. Pickard320, J. Bryant320, N. Roe320, J. Sowter320. Hinchingbrooke Hospital, Huntingdon, UK: D. Butcher321, K. Lang321, J. Taylor321. Glenfield Hospital, Leicester, UK: P. Barry322. Bronglais General Hospital, Aberystwyth, UK: M. Hobrok323, H. Tench323, R. Wolf-Roberts323, H. McGuinness323, R. Loosley323. Alder Hey Children’s Hospital, Liverpool, UK: D. Hawcutt324, L. Rad324, L. O’Malley324, P. Saunderson324, G. Seddon324, T. Anderson324, N. Rogers324. University Hospital Monklands, Airdrie, UK: J. Ruddy325, M. Harkins325, M. Taylor325, C. Beith325, A. McAlpine325, L. Ferguson325, P. Grant325, S. MacFadyen325, M. McLaughlin325, T. Baird325, S. Rundell32, L. Glass325, B. Welsh325, R. Hamill325, F. Fisher325. Cumberland Infirmary, Carlisle, UK: T. Smith326, J. Gregory326, A. Brown326. German COVID-19 OMICS Initiative (DeCOI)—Host genetics subgroup Axel Schmidt20, Kerstin U. Ludwig20, Selina Rolker20, Markus M. Nöthen20, Julia Fazaal20, Verena Keitel328, Björn Jensen327, Torsten Feldt327, Lisa Knopp327, Julia Schröder21, Carlo Maj32, Fabian Brand328, Marc M. Berger328, Thorsten Brenner329, Anke Hinney330, Oliver Witzke331, Robert Bals332, Christian Herr332, Nicole Ludwig333, Jörn Walter334, Jochen Schneider335, Johanna Erber335, Christoph D. Spinner335,336, Clemens M. Wendtner337, Christof Winter338, Ulrike Protzer339, Nicolas Casadei340, Stephan Ossowski341, Susanne Motameny342, Olaf H. Riess342, Eva C. Schulte21,22.23. Quebec COVID-19 Biobank (BQC-19): J. Brent Richards12,14,15, Guillaume Butler-Laporte12,13. POLCOVID GEnomika: Mirosław Kwasniewski343, Urszula Korotko343, Karolina Chwialkowska343, Magdalena Niemira344, Jerzy Jaroszewicz345, Barbara Sobala-Szczygiel345, Beata Puzanowska346, Anna Parfieniuk-Kowerda347, Diana Martonik347, Anna Moniuszko-Malinowska348, Sławomir Pancewicz348, Dorota Zarębska-Michaluk349, Krzysztof Simon350,Monika Pazgan-Simon350, Iwona Mozer-Lisewska351, Maciej Bura351, Agnieszka Adamek351, Krzysztof Tomasiewicz352 Małgorzata Pawłowska353, Anna Piekarska354, Aleksandra Berkan-Kawinska354, Andrzej Horban355, Justyna Kowalska355, Regina Podlasin356, Piotr Wasilewski356, Arsalin Azzadin357, Miroslaw Czuczwar358, Slawomir Czaban359, Paweł Olszewski360, Jacek Bogocz360, Magdalena Ochab360, Anna Kruk360, Sandra Uszok360, Agnieszka Bielska360, Anna Szałkowska361, Justyna Raczkowska361, Gabriela Sokołowska361 , Joanna Chorostowska-Wynimko362, Aleksandra Jezela-Stanek362, Adriana Roży362, Urszula Lechowicz362, Urszula Polowianiuk363, Kamil Grubczak364, Aleksandra Starosz364, Andrzej Eljaszewicz364, Wiktoria Izdebska364, Adam Krętowski365, Robert Flisiak366, Marcin Moniuszko367. Saudi GENOME: Malak Abedalthagafi368, Manal Alaamery369,370,371, Salam Massadeh369,370,371, Mohamed Fawzy368, Hadeel AlBardis368, Nora Aljawini369,370, Moneera Alsuwailm369, Faisal Almalki369, Serghei Mangul372, Junghyun Jung372. QATAR GENOME: Hamdi Mbarek373, Chadi Saad373, Yaser Al-Sarraj373, Wadha Al-Muftah373, Radja Badji373, Asma Al Thani373, Said I. Ismail373.

(1) Med Biotech Hub and Competence Center, Department of Medical Biotechnologies, University of Siena, Italy. (2) Medical Genetics, University of Siena, Italy. (3) University of Siena, DIISM-SAILAB, Siena, Italy. (4) Department of Mathematics, University of Pavia, Pavia, Italy. (5) Genetica Medica, Azienda Ospedaliero-Universitaria Senese, Italy. (6) Istituto di Tecnologie Biomediche – Consiglio Nazionale delle Ricerche, Segrate (MI), Italy. (7) Pharmacogenomics Unit, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy. (8) Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden. (9) Department of Medicine, Clinical Immunology and Infectious Diseases, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy. (10) Models and Algorithms for Artificial Intelligence (MAASAI) Research Group, Université Côte d’Azur, Inria, CNRS, I3S, France. (11) Good Clinical Practice Alliance-Europe (GCPA) and Strategic Initiative for Developing Capacity in Ethical Review (SIDCER), Leuven, Belgium. (12) Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada. (13) Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, Québec, Canada. (14) Department of Human Genetics, McGill University, Montréal, Québec, Canada. (15) Department of Twin Research, King's College London, London, United Kingdom. (16) Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden. (17) Anaesthesiology and Intensive Care Medicine, Department of Surgical Sciences, Uppsala University, Uppsala, Sweden. (18) Hedenstierna Laboratory, CIRRUS, Anaesthesiology and Intensive Care Medicine, Department of Surgical Sciences, Uppsala University, Uppsala, Sweden. (19) Integrative Physiology, Department of Medical Cell Biology, Uppsala University, Uppsala, Sweden. (20) Institute of Human Genetics, University of Bonn, School of Medicine & University Hospital Bonn, Bonn, Germany. (21) Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, 80336, Germany. (22) Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, 80336, Germany. (23) Institute of Virology, Technical University Munich/Helmholtz Zentrum München, Munich, Germany. (24) MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU, UK. (25) Roslin Institute, University of Edinburgh, Easter Bush, Edinburgh, EH25 9RG, UK. (26) Intensive Care Unit, Royal Infirmary of Edinburgh, 54 Little France Drive, Edinburgh, EH16 5SA, UK. (27) Department of Medical Sciences, Infectious and Tropical Diseases Unit, Azienda Ospedaliera Universitaria Senese, Siena, Italy. (28) Unit of Respiratory Diseases and Lung Transplantation, Department of Internal and Specialist Medicine, University of Siena, Italy. (29) Dept of Emergency and Urgency, Medicine, Surgery and Neurosciences, Unit of Intensive Care Medicine, Siena University Hospital, Italy. (30) Department of Medical, Surgical and Neuro Sciences and Radiological Sciences, Unit of Diagnostic Imaging, University of Siena, Italy. (31) Rheumatology Unit, Department of Medicine, Surgery and Neurosciences, University of Siena, Policlinico Le Scotte, Italy. (32) Department of Specialized and Internal Medicine, Infectious Diseases Unit, San Donato Hospital Arezzo, Italy. (33) Dept of Emergency, Anesthesia Unit, San Donato Hospital, Arezzo, Italy. (34) Cardio-thoraco-neuro-vascular Department, Pneumology Unit and UTIP, San Donato Hospital, Arezzo, Italy. (35) Department of Emergency, Anesthesia Unit, Misericordia Hospital, Grosseto, Italy. (36) Department of Specialized and Internal Medicine, Infectious Diseases Unit, Misericordia Hospital, Grosseto, Italy. (37) Clinical Chemical Analysis Laboratory, Misericordia Hospital, Grosseto, Italy. (38) Dipartimento di Prevenzione, Azienda USL Toscana Sud Est, Italy. (39) Dipartimento Tecnico-Scientifico Territoriale, Azienda USL Toscana Sud Est, Italy. (40) Clinical Chemical Analysis Laboratory, San Donato Hospital, Arezzo, Italy. (41) Chirurgia Vascolare, Ospedale Maggiore di Crema, Italy. (42) Department of Health Sciences, Clinic of Infectious Diseases, ASST Santi Paolo e Carlo, University of Milan, Italy. (43) Division of Infectious Diseases and Immunology, Department of Medical Sciences and Infectious Diseases, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy. (44) Department of Internal Medicine and Therapeutics, University of Pavia, Italy. (45) Department of Anesthesia and Intensive Care, University of Modena and Reggio Emilia, Modena, Italy. (46) Department of Medical and Surgical Sciences for Children and Adults, University of Modena and Reggio Emilia, Modena, Italy. (47) HIV/AIDS Department, National Institute for Infectious Diseases, IRCCS, Lazzaro Spallanzani, Rome, Italy. (48) III Infectious Diseases Unit, ASST-FBF-Sacco, Milan, Italy. (49) Department of Biomedical and Clinical Sciences Luigi Sacco, University of Milan, Milan, Italy. (50) Clinica di Malattie Infettive, Dipartimento di Medicina e Chirurgia, Università di Perugia, Perugia, Italy. (51) Department of Infectious Diseases, Treviso Hospital, Local Health Unit 2 Marca Trevigiana, Treviso, Italy. (52) Clinical Infectious Diseases, Mestre Hospital, Venezia, Italy. (53) Infectious Diseases Clinic, ULSS1, Belluno, Italy. (54) Department of Molecular Medicine, University of Padova, Italy. (55) Department of Infectious and Tropical Diseases, University of Brescia and ASST Spedali Civili Hospital, Brescia, Italy. (56) Department of Molecular and Translational Medicine, University of Brescia, Italy. (57) Clinical Chemistry Laboratory, Cytogenetics and Molecular Genetics Section, Diagnostic Department, ASST Spedali Civili di Brescia, Italy. (58) Medical Genetics and Laboratory of Medical Genetics Unit, A.O.R.N. "Antonio Cardarelli", Naples, Italy. (59) Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, Naples, Italy. (60) CEINGE Biotecnologie Avanzate, Naples, Italy. (61) IRCCS SDN, Naples, Italy. (62) Unit of Respiratory Physiopathology, AORN dei Colli, Monaldi Hospital, Naples, Italy. (63) Division of Medical Genetics, Fondazione IRCCS Casa Sollievo della Sofferenza Hospital, San Giovanni Rotondo, Italy. (64) Department of Medical Sciences, Fondazione IRCCS Casa Sollievo della Sofferenza Hospital, San Giovanni Rotondo, Italy. (65) Clinical Trial Office, Fondazione IRCCS Casa Sollievo della Sofferenza Hospital, San Giovanni Rotondo, Italy. (66) Department of Health Sciences, University of Genova, Genova, Italy. (67) Infectious Diseases Clinic, Policlinico San Martino Hospital, IRCCS for Cancer Research Genova, Italy. (68) Microbiology, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University of Medicine, Rome, Italy. (69) Department of Laboratory Sciences and Infectious Diseases, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy. (70) Department of Cardiovascular Diseases, University of Siena, Siena, Italy. (71) Otolaryngology Unit, University of Siena, Italy. (72) Department of Internal Medicine, ASST Valtellina e Alto Lario, Sondrio, Italy. (73) Study Coordinator Oncologia Medica e Ufficio Flussi Sondrio, Italy. (74) First Aid Department, Luigi Curto Hospital, Polla, Salerno, Italy. (75) Department of Pharmaceutical Medicine, Misericordia Hospital, Grosseto, Italy. (76) U.O.C. Laboratorio di Genetica Umana, IRCCS Istituto G. Gaslini, Genova, Italy. (77) Infectious Diseases Clinics, University of Modena and Reggio Emilia, Modena, Italy. (78) Department of Respiratory Diseases, Azienda Ospedaliera di Cremona, Cremona, Italy. (79) U.O.C. Medicina, ASST Nord Milano, Ospedale Bassini, Cinisello Balsamo (MI), Italy. (80) Istituto Auxologico Italiano, IRCCS, Department of Cardiovascular, Neural and Metabolic Sciences, San Luca Hospital, Milan, Italy. (81) Department of Medicine and Surgery, University of Milano-Bicocca, Milan, Italy. (82) Istituto Auxologico Italiano, IRCCS, Center for Cardiac Arrhythmias of Genetic Origin, Milan, Italy. (83) Istituto Auxologico Italiano, IRCCS, Laboratory of Cardiovascular Genetics, Milan, Italy. (84) Member of the European Reference Network for Rare, Low Prevalence and Complex Diseases of the Heart-ERN GUARD-Heart. (85) Independent Data Scientist, Milan, Italy. (86) Scuola Normale Superiore, Pisa, Italy. (87) CNR-Consiglio Nazionale delle Ricerche, Istituto di Biologia e Biotecnologia Agraria (IBBA), Milano, Italy. (88) Unit of Infectious Diseases, ASST Papa Giovanni XXIII Hospital, Bergamo, Italy. (89) Fondazione per la ricerca Ospedale di Bergamo, Bergamo, Italy. (90) Direzione Scientifica, Istituti Clinici Scientifici Maugeri IRCCS, Pavia, Italy. (91) Istituti Clinici Scientifici Maugeri IRCCS, Department of Cardiology, Institute of Montescano, Pavia, Italy. (92) Istituti Clinici Scientifici Maugeri, IRCCS, Department of Cardiac Rehabilitation, Institute of Tradate (VA), Italy. (93) Istituti Clinici Scientifici Maugeri IRCCS, Department of Cardiology, Institute of Milan, Milan, Italy. (94) Core Research Laboratory, ISPRO, Florence, Italy. (95) Health Management, Azienda USL Toscana Sudest, Tuscany, Italy. (96) IRCCS C. Mondino Foundation, Pavia, Italy. (97) Medical Genetics Unit, Meyer Children's University Hospital, Florence, Italy. (98) Department of Medicine, Pneumology Unit, Misericordia Hospital, Grosseto, Italy. (99) Department of Anesthesia and Intensive Care Unit, ASST Fatebenefratelli Sacco, Luigi Sacco Hospital, Polo Universitario, University of Milan, Milan. (100) Infectious Disease Unit, Hospital of Lucca, Italy. (101) Department of Diagnostic and Laboratory Medicine, Unity of Chemistry, Biochemistry and Clinical Molecular Biology, Fondazione Policlinico Universitario A. Gemelli IRCCS, Catholic University of the Sacred Heart, Rome, Italy. (102) Clinic of Infectious Diseases, Catholic University of the Sacred Heart, Rome, Italy. (103) Department of Clinical and Experimental Medicine, Infectious Diseases Unit, University of Pisa, Pisa, Italy. (104) Dept. of Clinical Medicine, Public Health, Life and Environment Sciences, University of L'Aquila, L'Aquila, Italy. (105) Department of Clinical Medicine, Public Health, Life and Environment Sciences, University of L'Aquila, Italy. (106) Anesthesiology and Intensive Care, University of L'Aquila, L'Aquila, Italy. (107) Medical Genetics Unit, Department of Life, Health and Environmental Sciences, University of L'Aquila, L'Aquila, Italy. (108) Infectious Disease Unit, Hospital of Massa, Italy. (109) Infectious Diseases Unit, Santa Maria Annunziata Hospital, USL Centro, Florence, Italy. (110) Respiratory Diseases Unit, "Santa Maria degli Angeli" Hospital, Pordenone, Italy. (111) Department of Electronics, Information and Bioengineering (DEIB), Politecnico di Milano, Milano, Italy. (112) Laboratory of Clinical Pathology and Immunoallergy, Florence-Prato, Italy. (113) Laboratory of Regulatory and Functional Genomics, Fondazione IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo (Foggia), Italy. (114) Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, Naples, Italy. (115) University of Pavia, Pavia, Italy. (116) Division of Infectious Diseases I, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy. (117) Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Pavia, Italy. (118) Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Milan, Italy. (119) Department of Medical Sciences, Science for Life Laboratory, Uppsala University, Uppsala, Sweden. (120) Department of Medical Sciences, Clinical Chemistry, Uppsala University, Uppsala, Sweden. (121) Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7FZ, UK. (122) Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK. (123) UCL Centre for Human Health and Performance, London, W1T 7HA, UK. (124) Department of Critical Care Medicine, Queen’s University and Kingston Health Sciences Centre, Kingston, ON, Canada. (125) Department of Anaesthesia and Intensive Care, The Chinese University of Hong Kong, Prince of Wales Hospital, Hong Kong, China. (126) Clinical Research Centre at St Vincent’s University Hospital, University College Dublin, Dublin, Ireland. (127) Department of Medicine, University of Cambridge, Cambridge, UK. (128) William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK. (129) NIHR Health Protection Research Unit for Emerging and Zoonotic Infections, Institute of Infection, Veterinary and Ecological Sciences University of Liverpool, Liverpool, L69 7BE, UK. (130) Respiratory Medicine, Alder Hey Children’s Hospital, Institute in The Park, University of Liverpool, Alder Hey Children’s Hospital, Liverpool, UK. (131) National Heart and Lung Institute, Imperial College London, London, UK. (132) Imperial College Healthcare NHS Trust: London, London, UK. (133) Department of Intensive Care Medicine, Guy’s and St. Thomas NHS Foundation Trust, London, UK. (134) MRC-University of Glasgow Centre for Virus Research, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK. (135) Wellcome-Wolfson Institute for Experimental Medicine, Queen’s University Belfast, Belfast, Northern Ireland, UK. (136) Department of Intensive Care Medicine, Royal Victoria Hospital, Belfast, Northern Ireland, UK. (137) Intensive Care National Audit & Research Centre, London, UK. (138) Edinburgh Clinical Research Facility, Western General Hospital, University of Edinburgh, EH4 2XU, UK. (139) Genomics England, London, UK. (140) Centre for Inflammation Research, The Queen’s Medical Research Institute, University of Edinburgh, 47 Little France Crescent, Edinburgh, UK. (141) Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU, UK. (142) Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia. (143) Biostatistics Group, School of Life Sciences, Sun Yat-sen University, Guangzhou, China. (144) School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China. (145) Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China. (146) Centre for Global Health Research, Usher Institute of Population Health Sciences and Informatics, Teviot Place, Edinburgh EH8 9AG, UK. (147) Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden. (148) Barts Health NHS Trust, London, UK. (149) Guys and St Thomas’ Hospital, London, UK. (150) James Cook University Hospital, Middlesburgh, UK. (151) The Royal Liverpool University Hospital, Liverpool, UK. (152) King’s College Hospital, London, UK. (153) Royal Infirmary of Edinburgh, Edinburgh, UK. (154) John Radcliffe Hospital, Oxford, UK. (155) Addenbrooke’s Hospital, Cambridge, UK. (156) Morriston Hospital, Swansea, UK. (157) Ashford and St Peter’s Hospital, Surrey, UK. (158) Royal Stoke University Hospital, Staffordshire, UK. (159) Queen Elizabeth Hospital, Birmingham, UK. (160) Glasgow Royal Infirmary, Glasgow, UK. (161) Kingston Hospital, Surrey, UK. (162) The Tunbridge Wells Hospital and Maidstone Hospital, Kent, UK. (163) North Middlesex University Hospital NHS trust, London, UK. (164) Bradford Royal Infirmary, Bradford, UK. (165) Blackpool Victoria Hospital, Blackpool, UK. (166) Countess of Chester Hospital, Chester, UK. (167) Wythenshawe Hospital, Manchester, UK. (168) St George’s Hospital, London, UK. (169) Good Hope Hospital, Birmingham, UK. (170) Stepping Hill Hospital, Stockport, UK. (171) Manchester Royal Infirmary, Manchester, UK. (172) Royal Alexandra Hospital, Paisley, UK. (173) Queen Elizabeth University Hospital, Glasgow, UK. (174) Queen Alexandra Hospital, Portsmouth, UK. (175) BHRUT (Barking Havering)—Queens Hospital and King George Hospital, Essex, UK. (176) University College Hospital, London, UK. (177) Royal Victoria Infirmary, Newcastle Upon Tyne, UK. (178) Western Sussex Hospitals, West Sussex, UK. (179) Salford Royal Hospital, Manchester, UK. (180) The Royal Oldham Hospital, Manchester, UK. (181) Pinderfields General Hospital, Wakefield, UK. (182) Basildon Hospital, Basildon, UK. (183) University Hospital of Wales, Cardiff, UK. (184) Broomfield Hospital, Chelmsford, UK. (185) Royal Brompton Hospital, London, UK. (186) Nottingham University Hospital, Nottingham, UK. (187) Royal Hallamshire Hospital and Northern General Hospital, Sheffield, UK. (188) Royal Hampshire County Hospital, Hampshire, UK. (189) Queens Hospital Burton, Burton-On-Trent, UK. (190) New Cross Hospital, Wolverhampton, UK. (191) Heartlands Hospital, Birmingham, UK. (192) Walsall Manor Hospital, Walsall, UK. (193) Stoke Mandeville Hospital, Buckinghamshire, UK. (194) Sandwell General Hospital, Birmingham, UK. (195) Royal Berkshire NHS Foundation Trust, Berkshire, UK. (196) Charing Cross Hospital, St Mary’s Hospital and Hammersmith Hospital, London, UK. (197) Dumfries and Galloway Royal Infirmary, Dumfries, UK. (198) Bristol Royal Infirmary, Bristol, UK. (199) Royal Sussex County Hospital, Brighton, UK. (200) Whiston Hospital, Prescot, UK. (201) Royal Glamorgan Hospital, Cardiff, UK. (223) King’s Mill Hospital, Nottingham, UK. (203) Fairfield General Hospital, Bury, UK. (204) Western General Hospital, Edinburgh, UK. (205) Northwick Park Hospital, London, UK. (206) Royal Preston Hospital, Preston, UK. (207) Royal Derby Hospital, Derby, UK. (208) Sunderland Royal Hospital, Sunderland, UK. (209) Royal Surrey County Hospital, Guildford, UK. (210) Derriford Hospital, Plymouth, UK. (211) Croydon University Hospital, Croydon, UK. (212) Victoria Hospital, Kirkcaldy, UK. (213) Milton Keynes University Hospital, Milton Keynes, UK. (214) Barnsley Hospital, Barnsley, UK. (215) York Hospital, York, UK. (216) University Hospital of North Tees, Stockton on Tees, UK. (217) University Hospital Wishaw, Wishaw, UK. (218) Whittington Hospital, London, UK. (219) Southmead Hospital, Bristol, UK. (220) The Royal Papworth Hospital, Cambridge, UK. (221) Royal Gwent Hospital, Newport, UK. (222) Norfolk and Norwich University hospital (NNUH), Norwich, UK. (223) Great Ormond St Hospital and UCL Great Ormond St Institute of Child Health NIHR Biomedical Research Centre, London, UK. (224) Airedale General Hospital, Keighley, UK. (225) Aberdeen Royal Infirmary, Aberdeen, UK. (226) Southampton General Hospital, Southampton, UK. (227) Russell’s Hall Hospital, Dudley, UK. (228) Rotherham General Hospital, Rotherham, UK. (229) North Manchester General Hospital, Manchester, UK. (230) Basingstoke and North Hampshire Hospital, Basingstoke, UK. (231) Royal Free Hospital, London, UK. (232) Hull Royal Infirmary, Hull, UK. (233) Harefield Hospital, London, UK. (234) Chesterfield Royal Hospital Foundation Trust, Chesterfield, UK. (235) Barnet Hospital, London, UK. (236) Aintree University Hospital, Liverpool, UK. (237) St James’s University Hospital and Leeds General Infirmary, Leeds, UK. (238) Glan Clwyd Hospital, Bodelwyddan, UK. (239) University Hospital Crosshouse, Kilmarnock, UK. (240) Royal Bolton Hospital, Bolton, UK. (241) Princess of Wales Hospital, Llantrisant, UK. (242) Pilgrim Hospital, Lincoln, UK. (243) Northumbria Healthcare NHS Foundation Trust, North Shields, UK. (244) Ninewells Hospital, Dundee, UK. (245) Lister Hospital, Stevenage, UK. (246) Bedford Hospital, Bedford, UK. (247) Royal United Hospital, Bath, UK. (248) Royal Bournemouth Hospital, Bournemouth, UK. (249) The Great Western Hospital, Swindon, UK. (250) Watford General Hospital, Watford, UK. (251) University Hospital North Durham, Darlington, UK. (252) Tameside General Hospital, Ashton Under Lyne, UK. (253) Princess Royal Hospital Shrewsbury and Royal Shrewsbury Hospital, Shrewsbury, UK. (254) Arrowe Park Hospital, Wirral, UK. (255) The Queen Elizabeth Hospital, King’s Lynn, UK. (256) Royal Blackburn Teaching Hospital, Blackburn, UK. (257) Poole Hospital, Poole, UK. (258) Medway Maritime Hospital, Gillingham, UK. (259) Warwick Hospital, Warwick, UK. (260) The Royal Marsden Hospital, London, UK. (261) The Princess Alexandra Hospital, Harlow, UK. (262) Musgrove Park Hospital, Taunton, UK. (263) George Eliot Hospital NHS Trust, Nuneaton, UK. (264) East Surrey Hospital, Redhill, UK. (265) West Middlesex Hospital, Isleworth, UK. (266) Warrington General Hospital, Warrington, UK. (267) Southport and Formby District General Hospital, Ormskirk, UK. (268) Royal Devon and Exeter Hospital, Exeter, UK. (269) Macclesfield District General Hospital, Macclesfield, UK. (270) Borders General Hospital, Melrose, UK. (271) Birmingham Children’s Hospital, Birmingham, UK. (272) William Harvey Hospital, Ashford, UK. (273) Royal Lancaster Infirmary, Lancaster, UK. (274) Queen Elizabeth the Queen Mother Hospital, Margate, UK. (275) Liverpool Heart and Chest Hospital, Liverpool, UK. (276) Darlington Memorial Hospital, Darlington, UK. (277) Southend University Hospital, Westcliff-on-Sea, UK. (278) Raigmore Hospital, Inverness, UK. (279) Salisbury District Hospital, Salisbury, UK. (280) Peterborough City Hospital, Peterborough, UK. (281) Ipswich Hospital, Ipswich, UK. (282) Hereford County Hospital, Worcester, UK. (283) Furness General Hospital, Barrow-in-Furness, UK. (284) Forth Valley Royal Hospital, Falkirk, UK. (285) Torbay Hospital, Torquay, UK. (286) St Mary’s Hospital, Newport, UK. (287) Royal Manchester Children’s Hospital, Manchester, UK. (288) Royal Cornwall Hospital, Truro, UK. (289) Queen Elizabeth Hospital Gateshead, Gateshead, UK. (290) Kent & Canterbury Hospital, Canterbury, UK. (291) James Paget University Hospital NHS Trust, Great Yarmouth, UK. (292) Darent Valley Hospital, Dartford, UK. (293) The Alexandra Hospital, Redditch and Worcester Royal Hospital, Worcester, UK. (294) Ysbyty Gwynedd, Bangor, UK. (295) Yeovil Hospital, Yeovil, UK. (296) University Hospital Hairmyres, East Kilbride, UK. (297) Scunthorpe General Hospital, Scunthorpe, UK. (298) Princess Royal Hospital Brighton, West Sussex, UK. (299) Lincoln County Hospital, Lincoln, UK. (300) Homerton University Hospital, London, UK. (301) Glangwili General Hospital, Camarthen, UK. (303) Ealing Hospital, Southall, UK. (303) Scarborough General Hospital, Scarborough, UK. (304) Royal Albert Edward Infirmary, Wigan, UK. (305) Queen Elizabeth Hospital, Woolwich, London, UK. (306) North Devon District Hospital, Barnstaple, UK. (307) National Hospital for Neurology and Neurosurgery, London, UK. (308) Eastbourne District General Hospital, East Sussex, UK and Conquest Hospital, East Sussex, UK. (309) Diana Princess of Wales Hospital, Grimsby, UK. (310) The Christie NHS Foundation Trust, Manchester, UK. (311) Prince Philip Hospital, Lianelli, UK. (312) Prince Charles Hospital, Merthyr Tydfil, UK. (313) Golden Jubilee National Hospital, Clydebank, UK. (314) Dorset County Hospital, Dorchester, UK. (315) Calderdale Royal Hospital, Halifax, UK. (316) West Suffolk Hospital, Suffolk, UK. (317) West Cumberland Hospital, Whitehaven, UK. (318) University Hospital Lewisham, London, UK. (319) St John’s Hospital Livingston, Livingston, UK. (320) Sheffield Children’s Hospital, Sheffield, UK. (321) Hinchingbrooke Hospital, Huntingdon, UK. (322) Glenfield Hospital, Leicester, UK. (323) Bronglais General Hospital, Aberystwyth, UK. (324) Alder Hey Children’s Hospital, Liverpool, UK. (325) University Hospital Monklands, Airdrie, UK. (326) Cumberland Infirmary, Carlisle, UK. (327) Department of Gastroenterology, Hepatology and Infectious Diseases, University Hospital Duesseldorf, Medical Faculty Heinrich Heine University, Duesseldorf, Germany. (328) Institute of Genomic Statistics and Bioinformatics, University Hospital Bonn, Medical Faculty University of Bonn, Venusberg-Campus 1, Bonn Germany. (329) Department of Anesthesiology and Intensive Care Medicine, University Hospital Essen, University Duisburg-Essen, Essen, Germany. (330) Department of Child and Adolescent Psychiatry, University Hospital Essen, University of Duisburg-Essen, Essen, Germany. (331) Department of Infectious Diseases, University Hospital Essen, University Duisburg-Essen, Essen, Germany. (332) Department of Internal Medicine V—Pneumology, Allergology and Intensive Care Medicine, University Hospital Saarland, Homburg/Saar, Germany. (333) Center of Human and Molecular Biology, Department of Human Genetics, University Hospital Saarland, Homburg/Saar, Germany. (334) Department of Genetics & Epigenetics, Saarland University, Saarbrücken, Germany. (335) Department of Internal Medicine II, School of Medicine, University Hospital Rechts der Isar, Technical University of Munich, Munich, Germany. (338) German Center for Infection Research (DZIF), Partner Site Munich, Munich, Germany. (339) Department of Hematology, Oncology, Immunology, Palliative Medicine, Infectious Diseases and Tropical Medicine, Munich Clinic Schwabing, Academic Teaching Hospital (LMU), Kölner Platz 1, 80804, Munich, Germany. (338) Institute of Clinical Chemistry and Pathobiochemistry, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany. (339) Institute of Virology, School of Medicine, Technical University of Munich/Helmholtz Zentrum München, 81675 Munich, Germany; DZIF, partner sites Munich and Cologne/Bonn, Germany. (340) Institute of Medical Genetics and Applied Genomics, The University of Tübingen, Germany; NGS Competence Center Tübingen, The University of Tübingen, Germany. (341) Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany. (342) Cologne Center for Genomics (CCG), University of Cologne and University Hospital Cologne, Cologne, Germany. (343) Centre for Bioinformatics and Data Analysis, Medical University of Bialystok, Bialystok, Poland. (344) Clinical Research Centre, Medical University of Bialystok, Bialystok, Poland. (345) Department of Infectious Diseases in Bytom, Medical University of Silesia, Silesia, Poland. (346) Department of Infectious Diseases, Megrez Hospital in Tychy, Tychy, Poland. (347) Department of Infectious Diseases and Hepatology, Medical University of Bialystok, Bialystok, Poland. (348) Department of Infectious Diseases and Neuroinfection, Medical University of Bialystok, Biastolyk, Poland. (349) Department of Infectious Diseases, Jan Kochanowski University, Kielce, Poland. (350) Department of Infectious Diseases and Hepatology, Wroclaw Medical University, Wrocław, Poland. (351) Department of Infectious Diseases and Hepatology, Poznan University of Medical Sciences, Poznan, Poland. (352) Department of Infectious Diseases, Medical University of Lublin, Lublin, Poland. (353) Department of Infectious Diseases and Hepatology, Nicolaus Copernicus University, Toruń, Poland. (354) Department of Infectious Diseases and Hepatology, Medical University of Lodz, Lodz, Poland. (355) Department of Adult Infectious Diseases, Warsaw Medical University, Warsaw, Poland. (356) IV-th Department, Hospital for Infectious Diseases, Poland. (357) District Hospital in Bielsk Podlaski, Poland. (358) Department of Anaesthesiology and Intensive Therapy, Medical University of Lublin, Lublin, Poland. (359) Department of Anaesthesiology and Intensive Therapy, Medical University of Bialystok, Bialystok, Poland. (360) IMAGENE.ME SA, Bialystok, Poland. (361) Clinical Research Centre, Medical University of Bialystok, Biastolyk, Poland. (362) Department of Genetics and Clinical Immunology, National Institute of Tuberculosis and Lung Diseases in Warsaw, Warsaw, Poland. (363) District Sanitary Inspectorate in Bialystok, Biastolyk, Poland. (364) Department of Regenerative Medicine and Immune Regulation, Medical University of Bialystok, Biastolyk, Poland. (365) Clinical Research Centre, Medical University of Bialystok, Bialystok, Poland. (366) Department of Infectious Diseases and Hepatology, Medical University of Bialystok, Bialystok, Poland. (367) Department of Allergology and Internal Medicine; Department of Regenerative Medicine and Immune Regulation, Medical University of Bialystok, Bialystok, Poland. (368) Genomics Research Department, Saudi Human Genome Project, King Fahad Medical City and King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia. (369) Developmental Medicine Department, King Abdullah International Medical Research Center, King Saud Bin Abdulaziz University for Health Sciences, King Abdulaziz Medical City, Ministry of National Guard- Health Affairs. (370) KACST-BWH Centre of Excellence for Biomedicine, Joint Centers of Excellence Program, King Abdulaziz City for Science and Technology (KACST), Riyadh, Saudi Arabia. (371) King Abdulaziz City for Science and Technology (KACST)-Saudi Human Genome Satellite Lab at Abdulaziz Medical City, Ministry of National Guard Health Affairs (MNGHA), Riyadh, Saudi Arabia. (372) Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, California, USA. (373) Qatar Genome Program, Qatar Foundation Research, Development and Innovation, Qatar Foundation, Doha, Qatar.

Author contributions

Conceptualization: FM, MF, SF, and AR; data curation: IM, SC, MP, ML, and GB; formal analysis, NP, EB, CF, SD, FV, MB, FF, KZ, MT, FC, FM, EC, MF, CG, EF, GEN-COVID Multicenter Study, MG, AR and SF; funding acquisition: AR and FM; methodology, NP, EB, MT, FC, EC, MF, CG, SM, SA, MB and SF; project administration: FM and AR; supervision: FM, GEN-COVID Multicenter Study and AR; validation: SF, GB-L, BR, HZ, ML, MH, KUL, ECS, EP-C, MC, LM, AK, SW, JKB, AS, RF; writing—original draft: NP, EB, CF, SD, MB, FF, Kristina Zguro, Sara Amitrano, Mirella Bruttini, Maria Palmieri, Mirjam Lista, Giada Beligni, Floriana Valentino, Susanna Croci, Ilaria Meloni, Francis P. Crawley, Marco Tanfoni, Francesca Colombo, Enrico Cabri, Maddalena Fratelli, Chiara Gabbi, Elisa Frullanti, Francesca Mari, GEN-COVID Multicenter Study, Marco Gori, Alessandra Renieri and Simone Furini.

Funding

MIUR project “Dipartimenti di Eccellenza 2018-2020” to Department of Medical Biotechnologies University of Siena, Italy (Italian D.L. n.18 March 17, 2020). Private donors for COVID-19 research. “Bando Ricerca COVID-19 Toscana” project to Azienda Ospedaliero-Universitaria Senese. Charity fund 2020 from Intesa San Paolo dedicated to the project/ N. B/2020/0119 “Identificazione delle basi genetiche determinanti la variabilità clinica della risposta a COVID-19 nella popolazione italiana”. The Italian Ministry of University and Research for funding within the “Bando FISR 2020” in COVID-19 and the Istituto Buddista Italiano Soka Gakkai with the 8x1000 funds for funding the project “PAT-COVID: Host genetics and pathogenetic mechanisms of COVID-19” (ID n. 2020-2016_RIC_3). GenOMICC was funded by Sepsis Research (the Fiona Elizabeth Agnew Trust), the Intensive Care Society, a Wellcome-Beit Prize award to J. K. Baillie (Wellcome Trust 103258/Z/13/A) and a BBSRC Institute Program Support Grant to the Roslin Institute (BBS/E/D/20002172, BBS/E/D/10002070 and BBS/E/D/30002275). The Richards research group is supported by the Canadian Institutes of Health Research (CIHR: 365825; 409511, 100558), the McGill Interdisciplinary Initiative in Infection and Immunity (MI4), the Lady Davis Institute of the Jewish General Hospital, the Jewish General Hospital Foundation, the Canadian Foundation for Innovation, the NIH Foundation, Cancer Research UK, Genome Québec, the Public Health Agency of Canada, McGill University, Cancer Research UK [Grant number C18281/A29019] and the Fonds de Recherche Québec Santé (FRQS). JBR is supported by a FRQS Mérite Clinical Research Scholarship. Support from Calcul Québec and Compute Canada is acknowledged. TwinsUK is funded by the Welcome Trust, Medical Research Council, European Union, the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. These funding agencies had no role in the design, implementation or interpretation of this study. JBR has served as an advisor to GlaxoSmithKline and Deerfield Capital. His institution has received investigator-initiated grant funding from Eli Lilly, GlaxoSmithKline and Biogen for projects unrelated to this research. He is the founder of 5 Prime Sciences. SweCovid received funding from the European Union’s Horizon 2020 research and innovation program under Grant agreement No 824110, the SciLifeLab/KAW national Covid-19 research program project grants to MH (KAW 2020.0182 and KAW 2020.0241) and the Swedish Research Council Grants to RF (2014-02569 and 2014-07606). Sequencing was performed by the SNP&SEQ Technology Platform in Uppsala. The facility is part of the National Genomics Infrastructure (NGI) Sweden and Science for Life Laboratory. The SNP&SEQ Platform is supported by the Swedish Research Council and the Knut and Alice Wallenberg Foundation.

Declarations

Conflict of interest

The authors declare no conflict interests.

Informed consent

The patients were informed of this research and agreed to it through the informed consent process.

Institutional review board statement

The study was conducted according to the guidelines of the Declaration of Helsinki. The GEN-COVID is a multicentre academic observational study that was approved by the Internal Review Boards (IRB) of each participating centre (protocol code 16917, dated March 16, 2020 for GEN-COVID at the University Hospital of Siena). BQC19 received ethical approval from the Jewish General Hospital (JGH) research ethics board (2020-2137). The Swedish part of the study was performed at the general intensive care unit (ICU) of Uppsala University Hospital, Sweden (a tertiary hospital) and approved by the National Ethical Review Agency (No. 2020-01623). The Declaration of Helsinki and its subsequent revisions were followed. DeCOI received ethical approval by the Ethical Review Board (ERB) of the participating hospitals/centres for CoMRI (Technical University Munich, Munich, Germany) and BoSCO (Medical Faculty Bonn, Bonn, Germany; Medical Board of the Saarland, Germany; University Duisburg-Essen, Germany; Medical Faculty Duesseldorf, Düsseldorf, Germany). GenOMICC and ISARIC4C were approved by the appropriate research ethics committees (Scotland, 15/SS/0110; England, Wales and Northern Ireland, 19/WM/0247).

Data availability and data sharing statement

The data and samples referenced here are housed in the GEN-COVID Patient Registry and the GEN-COVID Biobank and are available for consultation. You may contact the corresponding author, Prof. Alessandra Renieri (e-mail: alessandra.renieri@unisi.it).

Footnotes

The members of WES/WGS Working Group Within the HGI, GenOMICC Consortium, GEN-COVID Multicenter Study are listed in Acknowledgements.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Co-first authors: Chiara Fallerini, Nicola Picchiotti, Margherita Baldassarri.

Co-last authors: Francesca Mari, Alessandra Renieri, Simone Furini.

Contributor Information

Alessandra Renieri, Email: alessandra.renieri@unisi.it.

GEN-COVID Multicenter Study:

Simone Furini, Francesca Montagnani, Mario Tumbarello, Ilaria Rancan, Massimiliano Fabbiani, Barbara Rossetti, Laura Bergantini, Miriana D’Alessandro, Paolo Cameli, David Bennett, Federico Anedda, Simona Marcantonio, Sabino Scolletta, Federico Franchi, Maria Antonietta Mazzei, Susanna Guerrini, Edoardo Conticini, Luca Cantarini, Bruno Frediani, Danilo Tacconi, Chiara Spertilli Raffaelli, Marco Feri, Alice Donati, Raffaele Scala, Luca Guidelli, Genni Spargi, Marta Corridi, Cesira Nencioni, Leonardo Croci, Gian Piero Caldarelli, Maurizio Spagnesi, Davide Romani, Paolo Piacentini, Maria Bandini, Elena Desanctis, Silvia Cappelli, Anna Canaccini, Agnese Verzuri, Valentina Anemoli, Manola Pisani, Agostino Ognibene, Alessandro Pancrazzi, Maria Lorubbio, Massimo Vaghi, Antonella D.’Arminio Monforte, Federica Gaia Miraglia, Mario U. Mondelli, Massimo Girardis, Sophie Venturelli, Stefano Busani, Andrea Cossarizza, Andrea Antinori, Alessandra Vergori, Arianna Emiliozzi, Stefano Rusconi, Matteo Siano, Arianna Gabrieli, Agostino Riva, Daniela Francisci, Elisabetta Schiaroli, Francesco Paciosi, Andrea Tommasi, Pier Giorgio Scotton, Francesca Andretta, Sandro Panese, Stefano Baratti, Renzo Scaggiante, Francesca Gatti, Saverio Giuseppe Parisi, Francesco Castelli, Eugenia Quiros-Roldan, Melania Degli Antoni, Isabella Zanella, Matteo Della Monica, Carmelo Piscopo, Mario Capasso, Roberta Russo, Immacolata Andolfo, Achille Iolascon, Giuseppe Fiorentino, Massimo Carella, Marco Castori, Filippo Aucella, Pamela Raggi, Rita Perna, Matteo Bassetti, Antonio Di Biagio, Maurizio Sanguinetti, Luca Masucci, Alessandra Guarnaccia, Serafina Valente, Oreste De Vivo, Gabriella Doddato, Rossella Tita, Annarita Giliberti, Maria Antonietta Mencarelli, Caterina Lo Rizzo, Anna Maria Pinto, Valentina Perticaroli, Francesca Ariani, Miriam Lucia Carriero, Laura Di Sarno, Diana Alaverdian, Elena Bargagli, Marco Mandalà, Alessia Giorli, Lorenzo Salerni, Patrizia Zucchi, Pierpaolo Parravicini, Elisabetta Menatti, Tullio Trotta, Ferdinando Giannattasio, Gabriella Coiro, Fabio Lena, Leonardo Gianluca Lacerenza, Domenico A. Coviello, Cristina Mussini, Enrico Martinelli, Sandro Mancarella, Luisa Tavecchia, Mary Ann Belli, Lia Crotti, Gianfranco Parati, Maurizio Sanarico, Francesco Raimondi, Filippo Biscarini, Alessandra Stella, Marco Rizzi, Franco Maggiolo, Diego Ripamonti, Claudia Suardi, Tiziana Bachetti, Maria Teresa La Rovere, Simona Sarzi-Braga, Maurizio Bussotti, Katia Capitani, Simona Dei, Sabrina Ravaglia, Rosangela Artuso, Elena Andreucci, Giulia Gori, Angelica Pagliazzi, Erika Fiorentini, Antonio Perrella, Francesco Bianchi, Paola Bergomi, Emanuele Catena, Riccardo Colombo, Sauro Luchi, Giovanna Morelli, Paola Petrocelli, Sarah Iacopini, Sara Modica, Silvia Baroni, Francesco Vladimiro Segala, Francesco Menichetti, Marco Falcone, Giusy Tiseo, Chiara Barbieri, Tommaso Matucci, Davide Grassi, Claudio Ferri, Franco Marinangeli, Francesco Brancati, Antonella Vincenti, Valentina Borgo, Lombardi Stefania, Mirco Lenzi, Massimo Antonio Di Pietro, Francesca Vichi, Benedetta Romanin, Letizia Attala, Cecilia Costa, Andrea Gabbuti, Menè Roberto, Umberto Zuccon, Lucia Vietri, Stefano Ceri, Pietro Pinoli, Patrizia Casprini, Giuseppe Merla, Gabriella Maria Squeo, Marcello Maffezzoni, Raffaele Bruno, Marco Vecchia, Marta Colaneri, Serena Ludovisi, Yanara Marincevic-Zuniga, Jessica Nordlund, Tomas Luther, Anders Larsson, Katja Hanslin Anna Gradin, Sarah Galien, Sara Bulow Anderberg, Jacob Rosén, Sten Rubertsson, Hugo Zeberg, Robert Frithiof, Miklós Lipcsey, Michael Hultström, Sara Clohisey Peter Horby, Johnny Millar, Julian Knight, Hugh Montgomery, David Maslove, Lowell Ling, Alistair Nichol, Charlotte Summers, Tim Walsh, Charles Hinds, Malcolm G. Semple, Peter J. M. Openshaw, Manu Shankar-Hari, Antonia Ho, Danny McAuley, Chris Ponting, Kathy Rowan, J. Kenneth Baillie, Fiona Griffiths, Wilna Oosthuyzen, Jen Meikle, Paul Finernan, James Furniss, Ellie Mcmaster, Andy Law, Sara Clohisey, J. Kenneth Baillie, Trevor Paterson, Tony Wackett, Ruth Armstrong, Lee Murphy, Angie Fawkes, Richard Clark, Audrey Coutts, Lorna Donnelly, Tammy Gilchrist, Katarzyna Hafezi, Louise Macgillivray, Alan Maclean, Sarah McCafferty, Kirstie Morrice, Jane Weaver, Ceilia Boz, Ailsa Golightly, Mari Ward, Hanning Mal, Helen Szoor-McElhinney, Adam Brown, Ross Hendry, Andrew Stenhouse, Louise Cullum, Dawn Law, Sarah Law, Rachel Law, Max Head Fourman, Maaike Swets, Nicky Day, Filip Taneski, Esther Duncan, Marie Zechner, Nicholas Parkinson, Erola Pairo-Castineira, Sara Clohisey, Lucija Klaric, Andrew D. Bretherick, Konrad Rawlik, Dorota Pasko, Susan Walker, Nick Parkinson, Max Head Fourman, Clark D. Russell, James Furniss, Anne Richmond, Elvina Gountouna, David Harrison, Bo Wang, Yang Wu, Alison Meynert, Athanasios Kousathanas, Loukas Moutsianas, Zhijian Yang, Ranran Zhai, Chenqing Zheng, Graeme Grimes, Jonathan Millar, Barbara Shih, Marie Zechner, Jian Yang, Xia Shen, Chris P. Ponting, Albert Tenesa, Kathy Rowan, Andrew Law, Veronique Vitart, James F. Wilson, J. Kenneth Baillie, D. Collier, S. Wood, A. Zak, C. Borra, M. Matharu, P. May, Z. Alldis, O. Mitchelmore, R. Bowles, A. Easthorpe, F. Bibi, I. Lancoma-Malcolm, J. Gurasashvili, J. Pheby, J. Shiel, M. Bolton, M. Patel, M. Taylor, O. Zongo, P. Ebano, P. Harding, R. Astin-Chamberlain, Y. Choudhury, A. Cox, D. Kallon, M. Burton, R. Hall, S. Blowes, Z. Prime, J. Biddle, O. Prysyazhna, T. Newman, C. Tierney, J. Kassam, M. Shankar-Hari, M. Ostermann, S. Campos, A. Bociek, R. Lim, N. Grau, T. O. Jones, C. Whitton, M. Marotti, G. Arbane, S. Bonner, K. Hugill, J. Reid, I. Welters, V. Waugh, K. Williams, D. Shaw, J. Fernandez Roman, M. Lopez Martinez, E. Johnson, A. Waite, B. Johnson, O. Hamilton, S. Mulla, M. McPhail, J. Smith, J. K. Baillie, L. Barclay, D. Hope, C. McCulloch, L. McQuillan, S. Clark, J. Singleton, K. Priestley, N. Rea, M. Callaghan, R. Campbell, G. Andrew, L. Marshall, S. McKechnie, P. Hutton, A. Bashyal, N. Davidson, C. Summers, P. Polgarova, K. Stroud, N. Pathan, K. Elston, S. Agrawal, C. Battle, L. Newey, T. Rees, R. Harford, E. Brinkworth, M. Williams, C. Murphy, I. White, M. Croft, N. Bandla, M. Gellamucho, J. Tomlinson, H. Turner, M. Davies, A. Quinn, I. Hussain, C. Thompson, H. Parker, R. Bradley, R. Griffiths, J. Scriven, J. Gill, A. Puxty, S. Cathcart, D. Salutous, L. Turner, K. Duffy, K. Puxty, A. Joseph, R. Herdman-Grant, R. Simms, A. Swain, A. Naranjo, R. Crowe, K. Sollesta, A. Loveridge, D. Baptista, E. Morino, M. Davey, D. Golden, J. Jones, J. Moreno Cuesta, A. Haldeos, D. Bakthavatsalam, R. Vincent, M. Elhassan, K. Xavier, A. Ganesan, D. Purohit, M. Abdelrazik, J. Morgan, L. Akeroyd, S. Bano, D. Warren, M. Bromley, K. Sellick, L. Gurr, B. Wilkinson, V. Nagarajan, P. Szedlak, J. Cupitt, E. Stoddard, L. Benham, S. Preston, N. Slawson, Z. Bradshaw, J. brown, M. Caswell, SMelling, P. Bamford, M. Faulkner, K. Cawley, H. Jeffrey, E. London, H. Sainsbury, I. Nagra, F. Nasir, Ce Dunmore, R. Jones, A. Abraheem, M. Al-Moasseb, R. Girach, C. Brantwood, P. Alexander, J. Bradley-Potts, S. Allen, T. Felton, S. Manna, S. Farnell-Ward, S. Leaver, J. Queiroz, E. Maccacari, D. Dawson, C. Castro Delgado, R. Pepermans Saluzzio, O. Ezeobu, L. Ding, C. Sicat, R. Kanu, G. Durrant, J. Texeira, A. Harrison, T. Samakomva, J. Scriven, H. Willis, B. Hopkins, L. Thrasyvoulou, M. Jackson, A. Zaki, C. Tibke, S. Bennett, W. Woodyatt, A. Kent, E. Goodwin, C. Brandwood, R. Clark, L. Smith, K. Rooney, N. Thomson, N. Rodden, E. Hughes, D. McGlynn, C. Clark, P. Clark, L. Abel, R. Sundaram, L. Gemmell, M. Brett, J. Hornsby, P. MacGoey, R. Price, B. Digby, P. O’Neil, P. McConnell, P. Henderson, S. Henderson, M. Sim, S. Kennedy-Hay, C. McParland, L. Rooney, N. Baxter, D. Pogson, S. Rose, Z. Daly, L. Brimfield, M. K. Phull, M. Hussain, T. Pogreban, L. Rosaroso, E. Salciute L. Grauslyte, D. Brealey, E. Wraith, N. MacCallum, G. Bercades, I. Hass, D. Smyth, A. Reyes, G. Martir, I. D. Clement, K. Webster, C. Hays, A. Gulati, L. Hodgson, M. Margarson, R. Gomez, Y. Baird, Y. Thirlwall, L. Folkes, A. Butler, E. Meadows, S. Moore, D. Raynard, H. Fox, L. Riddles, K. King, S. Kimber, G. Hobden, A. McCarthy, V. Cannons, I. Balagosa, I. Chadbourn, A. Gardner, D. Horner, D. McLaughlanv, B. Charles, N. Proudfoot, T. Marsden, L. Mc Morrow, B. Blackledge, J. Pendlebury, A. Harvey, E. Apetri, C. Basikolo, L. Catlow, R. Doonan, K. Knowles, S. Lee, D. Lomas, C. Lyons, J. Perez, M. Poulaka, M. Slaughter, K. Slevin, M. Taylor, V. Thomas, D. Walker, J. Harris, A. Drummond, R. Tully, J. Dearden, J. Philbin, S. Munt, C. Rishton, G. O’Connor, M. Mulcahy, E. Dobson, J. Cuttler, M. Edward, A. Rose, B. Sloan, S. Buckley, H. Brooke, E. Smithson, R. Charlesworth, R. Sandu, M. Thirumaran, V. Wagstaff, J. Cebrian Suarez, A. Kaliappan, M. Vertue, A. Nicholson, J. Riches, A. Solesbury, L. Kittridge, M. Forsey, G. Maloney, J. Cole, M. Davies, R. Davies, H. Hill, E. Thomas, A. Williams, D. Duffin, B. Player, J. Radhakrishnan, S. Gibson, A. Lyle, F. McNeela, B. Patel, M. Gummadi, G. Sloane, N. Dormand, S. Salmi, Z. Farzad, D. Cristiano, K. Liyanage, V. Thwaites, M. Varghese, M. Meredith, G. Mills, J. Willson, K. Harrington, B. Lenagh, K. Cawthron, S. Masuko, A. Raithatha, K. Bauchmuller, N. Ahmad, J. Barker, Y. Jackson, F. Kibutu, S. Bird, G. Watson, J. Martin, E. Bevan, C. Wrey Brown, D. Trodd, K. English, G. Bell, L. Wilcox, A. Katary, S. Gopal, V. Lake, N. Harris, S. Metherell, E. Radford, J. Scriven, F. Moore, H. Bancroft, J. Daglish, M. Sangombe, M. Carmody, J. Rhodes, M. Bellamy, A. Garg, A. Kuravi, E. Virgilio, P. Ranga, J. Butler, L. Botfield, C. Dexter, J. Fletcher, P. Shanmugasundaram, G. Hambrook, I. Burn, K. Manso, D. Thornton, J. Tebbutt, R. Penn, J. Hulme, S. Hussain, Z. Maqsood, S. Joseph, J. Colley, A. Hayes, C. Ahmed, R. Haque, S. Clamp, R. Kumar, M. Purewal, B. Baines, M. Frise, N. Jacques, H. Coles, J. Caterson, S. Gurung Rai, M. Brunton, E. Tilney, L. Keating, A. Walden, D. Antcliffe, A. Gordon, M. Templeton, R. Rojo, D. Banach, S. Sousa Arias, Z. Fernandez, P. Coghlan, D. Williams, C. Jardine, J. Bewley, K. Sweet, L. Grimmer, R. Johnson, Z. Garland, B. Gumbrill, C. Phillips, L. Ortiz-Ruiz de Gordoa, E. Peasgood, A. Tridente, K. Shuker S. Greer, C. Lynch, C. Pothecary, L. Roche, B. Deacon, K. Turner, J. Singh, G. Sera Howe, P. Paul, M. Gill, I. Wynter, V. Ratnam, S. Shelton, J. Naisbitt, J. Melville, R. Baruah, S. Morrison, A. McGregor, V. Parris, M. Mpelembue, S. Srikaran, C. Dennis, A. Sukha, A. Williams, M. Verlande, K. Holding, K. Riches, C. Downes, C. Swan, A. Rostron, A. Roy, L. Woods, S. Cornell, F. Wakinshaw, B. Creagh-Brown, H. Blackman, A. Salberg, E. Smith, S. Donlon, S. Mtuwa, N. Michalak-Glinska, S. Stone, C. Beazley, V. Pristopan, N. Nikitas, L. Lankester, C. Wells, A. S. Raj, K. Fletcher, R. Khade, G. Tsinaslanidis, M. McMahon, S. Fowler, A. McGregor, T. Coventry, R. Stewart, L. Wren, E. Mwaura, L. Mew, A. Rose, D. Scaletta, F. Williams, K. Inweregbu, A. Nicholson, N. Lancaster, M. Cunningham, A. Daniels, L. Harrison, S. Hope, S. Jones, A. Crew, G. Wray, J. Matthews, R. Crawley, J. Carter, I. Birkinshaw, J. Ingham, Z. Scott, K. Howard, R. Joy, S. Roche, M. Clark, S. Purvis, A. Morrison, D. Strachan, M. Taylor, S. Clements, K. Black, C. Parmar, A. Altabaibeh, K. Simpson, L. Mostoles, K. Gilbert, L. Ma, A. Alvaro, M. Thomas, B. Faulkner, R. Worner, K. Hayes, E. Gendall, H. Blakemore, B. Borislavova, E. Goff, A. Vuylsteke, L. Mwaura, J. Zamikula, L. Garner, A. Mitchell, S. Mepham, L. Cagova, A. Fofano, H. Holcombe, K. Praman, T. Szakmany, A. E. Heron, S. Cherian, S. Cutler, A. Roynon-Reed, G. Randell, K. Convery, K. Stammers D. Fottrell-Gould, L. Hudig, J. Keshet-price, M. Peters, L. O’Neill, S. Ray, H. Belfield, T. McHugh, G. Jones, O. Akinkugbe, A. Tomas, E. Abaleke, E. Beech, H. Meghari, S. Yussuf, A. Bamford, B. Hairsine, E. Dooks, F. Farquhar, S. Packham, H. Bates, C. McParland, L. Armstrong, C. Kaye, A. Allan, J. Medhora, J. Liew, A. Botello, F. Anderson, R. Cusack, H. Golding, K. Prager, T. Williams, S. Leggett, K. Golder, M. Male, O. Jones, K. Criste, M. Marani, V. Anumakonda, V. Amin, K. Karthik, R. Kausar, E. Anastasescu, K. Reid, M. Jacqui, A. Hormis, R. Walker, D. Collier, T. Duncan, A. Uriel, A. Ustianowski, H. T-Michael, M. Bruce, K. Connolly, K. Smith, R. Partridge, D. Griffin, M. McDonald, N. Muchenje, D. Martin, H. Filipe, C. Eastgate, C. Jackson, A. Gratrix, L. Foster, V. Martinson, E. Stones, Caroline Abernathy, P. Parkinson, A. Reed, C. Prendergast, P. Rogers, M. Woodruff, R. Shokkar, S. Kaul, A. Barron, C. Collins, S. Beavis, A. Whileman, K. Dale, J. Hawes, K. Pritchard, R. Gascoyne, L. Stevenson, R. Jha, L. Lim, V. Krishnamurthy, R. Parker, I. Turner-Bone, L. Wilding, A. Reddy, S. Whiteley, E. Wilby, C. Howcroft, A. Aspinwall, S. Charlton, B. Ogg, D. Menzies, R. Pugh, E. Allan, R. Lean, F. Davies, J. Easton, X. Qiu, S. Kumar, K. Darlington, G. Houston, P. O’Brien, T. Geary, J. Allan, A. Meikle, G. Hughes, M. Balasubramaniam, S. Latham, E. McKenna, R. Flanagan, S. Sathe, E. Davies, L. Roche, M. Chablani, A. Kirkby, K. Netherton, S. Archer, B. Yates, C. Ashbrook-Raby, S. Cole, M. Casey, L. Cabrelli, S. Chapman, M. Casey, P. Austin, A. Hutcheon, C. Whyte, C. Almaden-Boyle, N. Pattison, C. Cruz, A. Vochin, H. Kent, A. Thomas, S. Murdoch, B. David, M. Penacerrada, G. Lubimbi, V. Bastion, R. Wulandari, J. Valentine, D. Clarke, A. Serrano-Ruiz, S. Hierons, L. Ramos, C. Demetriou, S. Mitchard, K. White, N. White, S. Pitts, D. Branney, J. Frankham, M. Watters, H. Langton, R. Prout, V. Page, T. Varghes, A. Cowton, A. Kay, K. Potts, M. Birt, M. Kent, A. Wilkinson, E. Jude, V. Turner, H. Savill, J. McCormick, M. Clark, M. Coulding, S. Siddiqui, O. Mercer, H. Rehman, D. Potla, N. Capps, D. Donaldson, J. Jones, H. Button, T. Martin, K. Hard, A. Agasou, L. Tonks, T. Arden, P. Boyle, M. Carnahan, J. Strickley, C. Adams, D. Childs, R. Rikunenko, M. Leigh, M. Breekes, R. Wilcox, A. Bowes, H. Tiveran, F. Hurford, J. Summers, A. Carter, Y. Hussain, L. Ting, A. Javaid, N. Motherwell, H. Moore, H. Millward, S. Jose, N. Schunki, A. Noakes, C. Clulow, G. Sadera, R. Jacob, C. Jones, M. Blunt, Z. Coton, H. Curgenven, S. Mohamed Ally, K. Beaumont, M. Elsaadany, K. Fernandes, I. Ali Mohamed Ali, H. Rangarajan, V. Sarathy, S. Selvanayagam, D. Vedage, M. White, M. Smith, N. Truman, S. Chukkambotla, S. Keith, J. Cockerill-Taylor, J. Ryan-Smith, R. Bolton, P. Springle, J. Dykes, J. Thomas, M. Khan, M. T. Hijazi, E. Massey, G. Croston, H. Reschreiter, J. Camsooksai, S. Patch, S. Jenkins, C. Humphrey, B. Wadams, J. Camsooksai, N. Bhatia, M. Msiska, O. Adanini, B. Attwood, P. Parsons, K. Tatham, S. Jhanji, E. Black, A. Dela Rosa, R. Howle, B. Thomas, T. Bemand, R. Raobaikady, R. Saha, N. Staines, A. Daniel, J. Finn, J. Hutter, P. Doble, C. Shovelton, C. Pawley, T. Kannan, M. Hill, E. Combes, S. Monnery, T. Joefield, M. Popescu, M. Thankachen, M. Oblak, J. Little, S. McIvor, A. Brady, H. Whittle, H. Prady, R. Chan, A. Ahmed, A. Morris, C. Gibson, E. Gordon, S. Keenan, H. Quinn, S. Benyon, S. Marriott, L. Zitter, L. Park, K. Baines, M. Lyons, M. Holland, N. Keenan, M. Young, S. Garrioch, J. Dawson, M. Tolson, B. Scholefield, R. Bi, N. Richardson, N. Schumacher, T. Cosier, G. Millen, A. Higham, K. Simpson, S. Turki, L. Allen, N. Crisp, T. Hazleton, A. Knight, J. Deery, C. Price, S. Turney, S. Tilbey, E. Beranova, D. Wright, L. Georg, S. Twiss, A. Cowton, S. Wadd, K. Postlethwaite, P. Gondo, B. Masunda, A. Kayani, B. Hadebe, J. Whiteside, R. Campbell, N. Clarke, P. Donnison, F. Trim, I. Leadbitter, D. Butcher, S. O’Sullivan, B. Purewal, S. Bell, V. Rivers’, R. O’Leary, J. Birch, E. Collins, S. Anderson, K. Hammerton, E. Andrews, A. Higham, K. Burns, I. Edmond, D. Salutous, A. Todd, J. Donnachie, P. Turner, L. Prentice, L. Symon, N. Runciman, F. Auld, M. Halkes, P. Mercer, L. Thornton, G. Debreceni, J. Wilkins, A. Brown, V. Crickmore, G. Subramanian, R. Marshall, C. Jennings, M. Latif, L. Bunni, M. Spivey, S. Bean, K. Burt, V. Linnett, J. Ritzema, A. Sanderson, W. McCormick, M. Bokhari, R. Kapoor, D. Loader, A. Ayers, W. Harrison, J. North, Z. Belagodu, R. Parasomthy, O. Olufuwa, A. Gherman, B. Fuller, C. Stuart, O. Kelsall, C. Davis, L. Wild, H. Wood, J. Thrush, A. Durie, K. Austin’, K. Archer, P. Anderson, C. Vigurs, C. Thorpe, A. Thomas, E. Knights, N. Boyle, A. Price, A. Kubisz-Pudelko, D. Wood, A. Lewis, S. Board, L. Pippard, J. Perry, K. Beesley, A. Rattray, M. Taylor, E. Lee, L. Lennon, K. Douglas, D. Bell, R. Boyle, L. Glass, M. Nauman Akhtar, K. Dent, D. Potoczna, S. Pearson, E. Horsley, S. Spencer, C. Phillips, D. Mullan, D. Skinner, J. Gaylard, L. Ortiz-Ruizdegordoa, R. Barber, C. Hewitt, A. Hilldrith, S. Shepardson, M. Wills, K. Jackson-Lawrence, A. Gupta, A. Easthope, E. Timlick, C. Gorman, I. Otaha, A. Gales, S. Coetzee, M. Raj, M. Peiu, V. Parris, S. Quaid, E. Watson, K. Elliott, J. Mallinson, B. Chandler, A. Turnbull, A. Quinn, C. Finch, C. Holl, J. Cooper, A. Evans., W. Khaliq, A. Collins, E. Treus Gude, N. Love, L. van Koutrik, J. Hunt, D. Kaye, E. Fisher, A. Brayne, V. Tuckey, P. Jackson, J. Parkin, D. Brealey, E. Raith, A. Tariq, H. Houlden, A. Tucci, J. Hardy, E. Moncur, J. Highgate, A. Cowley, A. Mitra, R. Stead, T. Behan, C. Burnett, M. Newton, E. Heeney, R. Pollard, J. Hatton, A. Patel, V. Kasipandian, S. Allibone, R. M. Genetu, I. Otahal, L. O’Brien, Z. Omar, E. Perkins, K. Davies, D. Tetla, C. Pothecary, B. Deacon, B. Shelley, V. Irvine, S. Williams, P. Williams, J. Birch, J. Goodsell, R. Tutton, L. Bough, B. Winter-Goodwin, R. Kitson, J. Pinnell, A. Wilson, T. Nortcliffe, T. Wood, M. Home, K. Holdroyd, M. Robinson, R. Shaw, J. Greig, M. Brady, A. Haigh, L. Matupe, M. Usher, S. Mellor, S. Dale, L. Gledhill, L. Shaw, G. Turner, D. Kelly, B. Anwar, H. Riley, H. Sturgeon, A. Ali, L. Thomis, D. Melia, A. Dance, K. Hanson, S. Humphreys, I. Frost, V. Gopal, J. Godden, A. Holden, S. Swann, T. Smith, M. Clapham, U. Poultney, R. Harper, P. Rice, W. Khaliq, R. Reece-Anthony, B. Gurung, S. Moultrie, M. Odam, A. Mayer, A. Bellini, A. Pickard, J. Bryant, N. Roe, J. Sowter, D. Butcher, K. Lang, J. Taylor, P. Barry, M. Hobrok, H. Tench, R. Wolf-Roberts, H. McGuinness, R. Loosley, D. Hawcutt, L. Rad, L. O’Malley, P. Saunderson, G. Seddon, T. Anderson, N. Rogers, J. Ruddy, M. Harkins, M. Taylor, C. Beith, A. McAlpine, L. Ferguson, P. Grant, S. MacFadyen, M. McLaughlin, T. Baird, S. Rundell, L. Glass, B. Welsh, R. Hamill, F. Fisher, T. Smith, J. Gregory, A. Brown, Axel Schmidt, Kerstin U. Ludwig, Selina Rolker, Markus M. Nöthen, Julia Fazaal, Verena Keitel, Björn Jensen, Torsten Feldt, Lisa Knopp, Julia Schröder, Carlo Maj, Fabian Brand, Marc M. Berger, Thorsten Brenner, Anke Hinney, Oliver Witzke, Robert Bals, Christian Herr, Nicole Ludwig, Jörn Walter, Jochen Schneider, Johanna Erber, Christoph D. Spinner, Clemens M. Wendtner, Christof Winter, Ulrike Protzer, Nicolas Casadei, Stephan Ossowski, Olaf H. Riess, Eva C. Schulte, J. Brent Richards, Guillaume Butler-Laporte, Mirosław Kwasniewski, Urszula Korotko, Karolina Chwialkowska, Magdalena Niemira, Jerzy Jaroszewicz, Barbara Sobala-Szczygiel, Beata Puzanowska, Anna Parfieniuk-Kowerda, Diana Martonik, Anna Moniuszko-Malinowska, Sławomir Pancewicz, Dorota Zarębska-Michaluk, Krzysztof Simon, Monika Pazgan-Simon, Iwona Mozer-Lisewska, Maciej Bura, Agnieszka Adamek, Krzysztof Tomasiewicz Małgorzata Pawłowska, Anna Piekarska, Aleksandra Berkan-Kawinska, Andrzej Horban, Justyna Kowalska, Regina Podlasin, Piotr Wasilewski, Arsalin Azzadin, Miroslaw Czuczwar, Slawomir Czaban, Paweł Olszewski, Jacek Bogocz, Magdalena Ochab, Anna Kruk, Sandra Uszok, Agnieszka Bielska, Anna Szałkowska, Justyna Raczkowska, Gabriela Sokołowska, Joanna Chorostowska-Wynimko, Aleksandra Jezela-Stanek, Adriana Roży, Urszula Lechowicz, Urszula Polowianiuk, Kamil Grubczak, Aleksandra Starosz, Andrzej Eljaszewicz, Wiktoria Izdebska, Adam Krętowski, Robert Flisiak, Marcin Moniuszko, Malak Abedalthagafi Manal Alaamery, Salam Massadeh, Mohamed Fawzy, Hadeel AlBardis, Nora Aljawini, Moneera Alsuwailm, Faisal Almalki, Serghei Mangul, Junghyun Jung, Hamdi Mbarek, Chadi Saad, Yaser Al-Sarraj, Wadha Al-Muftah, Radja Badji, Asma Al Thani, and Said I. Ismail

References

  1. Baldassarri M, Picchiotti N, Fava F, et al. Shorter androgen receptor polyQ alleles protect against life-threatening COVID-19 disease in European males. EBioMedicine. 2021;65:103246. doi: 10.1016/j.ebiom.2021.103246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Baldassarri M, Fava F, Fallerini C, et al. Severe COVID-19 in hospitalized carriers of single CFTR pathogenic variants. J Person Med. 2021;11(6):558. doi: 10.3390/jpm11060558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bayati A, Kumar R, Francis V, McPherson PS. SARS-CoV-2 infects cells after viral entry via clathrin-mediated endocytosis. J Biol Chem. 2021;296:100306. doi: 10.1016/j.jbc.2021.100306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Benetti E, Giliberti A, Emiliozzi A, et al. Clinical and molecular characterization of COVID-19 hospitalized patients. PLoS ONE. 2020;15(11):e0242534. doi: 10.1371/journal.pone.0242534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Benetti E, Tita R, Spiga O, et al. ACE2 gene variants may underlie interindividual variability and susceptibility to COVID-19 in the Italian population. Eur J Hum Genet. 2020;28(11):1602–1614. doi: 10.1038/s41431-020-0691-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chen N, Zhou M, Dong X, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–551. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. COVID-19 Host Genetics Initiative. https://www.covid19hg.org/. Accessed 13 Apr 2020
  8. COVID-19 Host Genetics Initiative et al (2021) Mapping the human genetic architecture of COVID-19. Published online ahead of print. Nature. 10.1038/s41586-021-03767-x
  9. COVID-19 Therapeutic Trial Synopsis (2020) WHO R&D blueprint novel coronavirus. Covid 19 therapeutic trial synopsis. R&D Blueprint
  10. Covid19hg.org (2021) June 14, 2021 meeting. https://www.covid19hg.org/blog/2021-06-17-june-14-2021-meeting/. Accessed 13 Apr 2020
  11. Croci S, Venneri MA, Mantovani S et al (2021) The polymorphism L412F in TLR3 inhibits autophagy and is a marker of severe COVID-19 in males. Autophagy (in press) [DOI] [PMC free article] [PubMed]
  12. Daga S, Fallerini C, Baldassarri M, et al. Employing a systematic approach to biobanking and analyzing clinical and genetic data for advancing COVID-19 research. Eur J Hum Genet. 2021;29(5):745–759. doi: 10.1038/s41431-020-00793-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Danecek P, Bonfield JK, Liddle J, et al. Twelve years of SAM tools and BCF tools. GigaScience. 2021;10(2):giab00. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Darrah RJ, Jacono FJ, Joshi N, et al. AGTR2 absence or antagonism prevents cystic fibrosis pulmonary manifestations. J Cyst Fibros. 2019;18(1):127–134. doi: 10.1016/j.jcf.2018.05.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Elhabyan A, Elyaacoub S, Sanad E, et al. The role of host genetics in susceptibility to severe viral infections in humans and insights into host genetics of severe COVID-19: a systematic review. Virus Res. 2020;289:198163. doi: 10.1016/j.virusres.2020.198163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fallerini C, et al. Association of Toll-like receptor 7 variants with life-threatening COVID-19 disease in males: findings from a nested case-control study. Elife. 2021;10:e67569. doi: 10.7554/eLife.67569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fallerini C, Daga S, Benetti E, et al. SELP Asp603Asn and severe thrombosis in COVID-19 males: implication for anti P-selectin monoclonal antibodies treatment. J Hematol Oncol. 2021;14(1):123. doi: 10.1186/s13045-021-01136-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gordon DE, Jang GM, Bouhaddou M, et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature. 2020;583(7816):459–468. doi: 10.1038/s41586-020-2286-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. International Multiple Sclerosis Genetics Consortium Low-frequency and rare-coding variation contributes to multiple sclerosis risk. Cell. 2018;175(6):1679–1687.e7. doi: 10.1016/j.cell.2018.09.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Islam MR, Hoque MN, Rahman MS, et al. Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity. Sci Rep. 2020;10(1):14004. doi: 10.1038/s41598-020-70812-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kosmicki JA, Horowitz JE, Banerjee N, et al. Pan-ancestry exome-wide association analyses of COVID-19 outcomes in 586,157 individuals. Am J Hum Genet. 2021;108(7):1350–1355. doi: 10.1016/j.ajhg.2021.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26(5):589–595. doi: 10.1093/bioinformatics/btp698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Li X, Zhong X, Wang Y, et al. Clinical determinants of the severity of COVID-19: a systematic review and meta-analysis. PLoS ONE. 2021;16(5):e0250602. doi: 10.1371/journal.pone.0250602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2(3):18–22. [Google Scholar]
  25. Livingston E, Bucher K (2020) Coronavirus disease 2019 (COVID-19) in Italy. JAMA 323(14):1335 [DOI] [PubMed]
  26. Marouli E, Graff M, Medina-Gomez C, et al. Rare and low-frequency coding variants alter human adult height. Nature. 2017;542(7640):186–190. doi: 10.1038/nature21039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Mars N, Widén E, Kerminen S, et al. (2020) The role of polygenic risk and susceptibility genes in breast cancer over the course of life. Nat Commun. 2020;11(1):6383. doi: 10.1038/s41467-020-19966-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Merico D, Isserlin R, Bader GD. Visualizing gene-set enrichment results using the cytoscape plug-in enrichment map. In: Cagney G, Emili A (eds) (2011) Network biology. Methods in molecular biology (methods and protocols), vol 781. Humana Press [DOI] [PubMed]
  29. Mootha VK, Lindgren CM, Eriksson K, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267–273. doi: 10.1038/ng1180. [DOI] [PubMed] [Google Scholar]
  30. O’Connor BD, Van Auwera G (2020) Genomics in the cloud: using Docker, Gatk, and WDL in Terra. O’Reilly Media
  31. Pairo-Castineira E, Clohisey S, Klaric L, et al. Genetic mechanisms of critical illness in COVID-19. Nature. 2020;591(7848):92–98. doi: 10.1038/s41586-020-03065-y. [DOI] [PubMed] [Google Scholar]
  32. Pathak GA, Singh K, Miller-Fleming TW, et al. Integrative genomic analyses identify susceptibility genes underlying COVID-19 hospitalization. Nat Commun. 2021;12(1):4569. doi: 10.1038/s41467-021-24824-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–2830. [Google Scholar]
  34. Picchiotti N, Benetti E, Fallerini C et al (2021) Post-Mendelian genetic model in COVID-19. Cardiol Cardiovas Med (in press)
  35. Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Reggiori F, Monastyrska I, Verheije MH, et al. Coronaviruses Hijack the LC3-I-positive EDEMosomes, ER-derived vesicles exporting short-lived ERAD regulators, for replication. Cell Host Microbe. 2010;7:500–508. doi: 10.1016/j.chom.2010.05.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP. GenePattern 2.0. Nat Genet. 2006;38:500–501. doi: 10.1038/ng0506-500. [DOI] [PubMed] [Google Scholar]
  38. SevereCovid-19GWASGroup. Ellinghaus D, Degenhardt F, et al. Genomewide association study of severe covid-19 with respiratory failure. N Engl J Med. 2020;383(16):1522–1534. doi: 10.1056/NEJMoa2020283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Shannon P, Markiel A, Ozier O, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Solanich X, Vargas-Parra G, van der Made CI, et al. Genetic screening for TLR7 variants in young and previously healthy men with severe COVID-19: a case series. Front Immunol. 2021;12:719115. doi: 10.3389/fimmu.2021.719115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Subramanian A, Tamayo P, Mootha VK, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Subramanian S, Tus K, Li QZ, et al. A Tlr7 translocation accelerates systemic autoimmunity in murine lupus. PNAS. 2006;103(26):9970–9975. doi: 10.1073/pnas.0603912103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Tan Y, Yang T, Liu P, et al. Association of the OAS3 rs1859330 G/A genetic polymorphism with severity of enterovirus-71 infection in Chinese Han children. Arch Virol. 2017;162:2305–2313. doi: 10.1007/s00705-017-3381-6. [DOI] [PubMed] [Google Scholar]
  44. Van der Made CI, Simons A, Schuurs-Hoeijmakers J, et al. Presence of genetic variants among young men with severe COVID-19. JAMA. 2020;324(7):663–673. doi: 10.1001/jama.2020.13719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Virtanen P, Gommers R, Oliphant TE, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 2020;17(3):261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Wickenhagen A, Sugrue E, Lytras S, et al. A prenylated dsRNA sensor protects against severe COVID-19. Science. 2021 doi: 10.1126/science.abj3624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Williams F, Freidin MB, Mangino M, et al. Self-reported symptoms of COVID-19, including symptoms most predictive of SARS-CoV-2 infection, are heritable. ISTS. 2020;23(6):316–321. doi: 10.1017/thg.2020.85. [DOI] [PubMed] [Google Scholar]
  49. Zhang X, Tan Y, Ling Y, et al. Viral and host factors related to the clinical outcome of COVID-19. Nature. 2020;583:7816. doi: 10.1038/s41586-020-2355-0. [DOI] [PubMed] [Google Scholar]
  50. Zhang Q, Bastard P, Liu Z et al (2020b) Inborn errors of type I IFN immunity in patients with life-threatening COVID-19. Science (New York, N.Y.) 370(6515) [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

439_2021_2397_MOESM1_ESM.jpg (129.3KB, jpg)

Supplementary Figure 1. Barplots for 0.01 both. Barplot of significance values (NOM p-values, -log10 transformation) from GSEA analysis for all the pathways significant in both females (orange) and males (blue), p<0.01. Vertical dotted line indicates the adopted significance threshold (JPG 130 kb)

439_2021_2397_MOESM2_ESM.jpg (209.9KB, jpg)

Supplementary Figure 2. Representative heatmaps for 0.01_both. Heatmaps of the genes belonging to representative pathways significant in both females and males, p<0.01. The color gradient represents the weight of each gene, calculated as described in methods (JPG 210 kb)

439_2021_2397_MOESM3_ESM.jpg (236.8KB, jpg)

Supplementary Figure 3. Barplots for 0.005_any. Barplot of significance values (NOM p values, -log10 transformation) from GSEA analysis for all the pathways significant in either females (orange) or males (blue), p<0.005. Vertical dotted line indicates the adopted significance threshold (JPG 237 kb)

439_2021_2397_MOESM4_ESM.jpg (222.4KB, jpg)

Supplementary Figure 4. Representative heatmaps for 0.005_any. Heatmaps of the genes belonging to representative pathways significant in either females or males, p<0.005. The color gradient represents the weight of each gene, calculated as described in methods (JPG 223 kb)

Data Availability Statement

The data and samples referenced here are housed in the GEN-COVID Patient Registry and the GEN-COVID Biobank and are available for consultation. You may contact the corresponding author, Prof. Alessandra Renieri (e-mail: alessandra.renieri@unisi.it).


Articles from Human Genetics are provided here courtesy of Springer

RESOURCES