Abstract
Candida bloodstream infection, i.e. candidemia, is the most frequently encountered life-threatening fungal infection worldwide, with mortality rates up to almost 50%. In the majority of candidemia cases, Candida albicans is responsible. Worryingly, a global increase in the number of patients who are susceptible to infection (e.g. immunocompromised patients), has led to a rise in the incidence of candidemia in the last few decades. Therefore, a better understanding of the anti-Candida host response is essential to overcome this poor prognosis and to lower disease incidence. Here, we integrated genome-wide association studies with bulk and single-cell transcriptomic analyses of immune cells stimulated with Candida albicans to further our understanding of the anti-Candida host response. We show that differential expression analysis upon Candida stimulation in single-cell expression data can reveal the important cell types involved in the host response against Candida. This confirmed the known major role of monocytes, but more interestingly, also uncovered an important role for NK cells. Moreover, combining the power of bulk RNA-seq with the high resolution of single-cell RNA-seq data led to the identification of 27 Candida-response QTLs and revealed the cell types potentially involved herein. Integration of these response QTLs with a GWAS on candidemia susceptibility uncovered a potential new role for LY86 in candidemia susceptibility. Finally, experimental follow-up confirmed that LY86 knockdown results in reduced monocyte migration towards the chemokine MCP-1, thereby implying that this reduced migration may underlie the increased susceptibility to candidemia. Altogether, our integrative systems genetics approach identifies previously unknown mechanisms underlying the immune response to Candida infection.
Author summary
Candida albicans is a fungus that can cause a life-threatening infection in individuals with an impaired immune system. To improve the prognosis and treatment of patients with such an infection, a better understanding of an individual’s immune response against Candida is required. However, small patient group sizes have limited our ability to gain such understanding. Here we show that integrating many different data layers can improve the sensitivity to detect the effects of genetics on the response to Candida infection and the roles different immune cell types have herein. Using this approach, we were able to prioritize genes that are associated with an increased risk of developing systemic Candida infections. We expand on the gene with the strongest risk association, LY86, and describe a potential mechanism through which this gene affects the immune response against Candida infection. Through experimental follow-up, we provided additional insights into how this gene is associated with an increased risk to develop a Candida infection. We expect that our approach can be generalized to other infectious diseases for which small patient group sizes have restricted our ability to unravel the disease mechanism in more detail. This will provide new opportunities to identify treatment targets in the future.
Introduction
Candida albicans (C. albicans) is an opportunistic fungus colonizing the skin and/or mucosae of approximately 70% of the population [1]. Disruption of the mucosal barrier or a compromised immune system of the host can increase susceptibility to Candida infections. This makes it the most common cause of hospital-acquired invasive fungal infections globally [2], with high mortality rates between 33% and 46% [3,4]. The most common form of invasive candidiasis occurs in the blood, known as candidemia [2]. Despite the severity of candidemia and its accompanying research interest, the ability to improve the outcomes for affected individuals has stagnated in recent years. Adjuvant immunotherapy has been suggested as an important strategy to improve patient outcomes, but to implement this a better understanding of the immune response to Candida is required [5,6]. As genetics have a great impact on an individual’s immune response [7], knowledge on its impact to the anti-Candida response will be important as well for the implementation of such therapies.
Genome-wide association studies (GWAS), linking genetic variants to disease risk, have been a commonly used approach to increase disease understanding. However, in the context of candidemia and other infectious diseases, conducting a GWAS is challenging due to the limited size of patient cohorts [8]. Moreover, GWAS studies provide limited insight into the underlying biology of how these genetic variants are linked to Candida infection susceptibility. Thus additional approaches are required.
Integrative strategies that combine different molecular datasets in the context of Candida infection have been suggested as alternative approaches to prioritize cell types, genes and pathways. These can then be used for follow-up functional studies to better understand candidemia susceptibility. For instance, Smeekens et al. integrated gene expression array data of Candida-stimulated PBMCs with genetic information and cytokine measurements from both healthy volunteers and patients with increased susceptibility to Candida infections [9]. Using this integrative approach, they identified the interferon pathway as being a crucial host response pathway against Candida infection. In a follow-up study, the additive value of integrating multiple molecular datasets became even more apparent as suggestive genetic associations together with transcriptomic data could prioritize novel pathways implicated in candidemia susceptibility, including the complement and hemostasis pathways [10].
However, further integration is required to understand the mechanism through which genetic variants lead to increased candidemia-susceptibility. These disease-associated variants can be linked to effects on gene expression levels through so-called expression quantitative trait loci (eQTL) analysis. Since disease-associated genetic variants are often regulated in a context-specific manner [11], such eQTL analyses should be performed in such a way that the context-specific nature, i.e. pathogen- and cell type-specificity, can be revealed. With the advent of single-cell RNA-sequencing (scRNA-seq) it now becomes possible to profile the expression of tens of thousands of individual cells at the same time in an unbiased manner [12]. This now allows capturing the context-specific nature of disease-associated genetic variants with increased resolution, while retaining the intercellular dynamics.
Here, we used an integrative approach combining GWAS with bulk and scRNA-seq transcriptomic analyses on Candida-stimulated and RPMI control peripheral blood mononuclear cells (PBMCs). By leveraging the sensitivity of bulk RNA-seq data with the context-specific information acquired from scRNA-seq, this integrative approach further improves our understanding of the host response against Candida.
Results and discussion
Cell type-specific transcriptional response to Candida albicans
To reveal the cell type-specific immune response against Candida, scRNA-seq analysis was performed on PBMCs from 6 individuals that were stimulated with Candida or RPMI control for 24h. After QC, a total of 15,085 cells remained, of which 7,925 cells were RPMI control and 7,160 cells were Candida-stimulated. These cells were classified as one of the following six immune cell types: B cells, CD4+ T cells, CD8+ T cells, monocytes, natural killer (NK) cells or plasmacytoid dendritic cells (pDC).
As pathogen-stimulation can potentially affect the cellular state or induce active recruitment of specific cell types, we first determined whether Candida-stimulation affected the relative abundance of immune cell types. At baseline, the largest differences in relative abundance of individual cell types varied between 1.6-fold for the CD4+ T cells up to 8.3-fold for the CD8+ T cells (S1 Fig). However, upon stimulation these abundances remained constant within an individual. Overall, CD4+ T cells were the most abundant cell type (61.2%), whereas pDCs were observed the least (1.3%) (Table 1). Even though changes in relative abundances were not detected, we cannot exclude that this is not happening in vivo, as our in vitro stimulation of PBMCs does not allow detection of active recruitment. Active recruitment of monocytes towards the lymph nodes is part of the host immune response towards Candida, as recently shown in mice [13].
Table 1. Differentially expressed genes per cell type within PBMC single-cell RNA-seq data.
Cell type | # cells | # DE genes | # up-regulated | # down-regulated |
---|---|---|---|---|
CD4+ T | 9,236 | 1,459 | 1,095 | 364 |
CD8+ T | 2,300 | 453 | 334 | 119 |
NK | 1,807 | 1,313 | 927 | 386 |
B | 789 | 392 | 329 | 63 |
Monocyte | 757 | 767 | 418 | 349 |
pDC | 196 | 56 | 49 | 7 |
DE, differentially expressed; NK, natural killer cell; pDC, plasmacytoid dendritic cell; PBMC, peripheral blood mononuclear cell.
Secondly, we identified differentially expressed (DE) genes upon stimulation per cell type separately as well as in all cells together (bulk-like) by performing DE analysis with MAST [14]. This analysis identified a total of 2,384 DE genes in the individual cell types and 3,568 DE genes in the bulk-like sample (Table 1, S1 Table). However, the noisiness and sparseness of single-cell data could potentially introduce artifacts in the DE analysis, resulting in false-positives [15]. To determine the extent to which this occurs, we compared the DE genes identified in the scRNA-seq data with their differential response in a previously described bulk RNA-seq dataset generated from Candida-stimulated PBMCs isolated from 70 individuals [7]. This comparison showed that 97.3% of the DE genes from the bulk-like scRNA-seq sample (Fig 1A-I) and at least 96.8% of the DE genes from the individual cell types (Fig 1A-II-VII) could be replicated in the bulk-RNA seq data (S1 Table). Thus, the DE genes identified in scRNA-seq data reflect true biology rather than artifacts and can be used to uncover the cell type-specific immune response against Candida. However, please note that during this prolonged incubation of 24h, it is not possible to distinguish between direct and indirect responses upon Candida stimulation.
Candida induces large gene expression differences in CD4+ T cells, NK cells and monocytes
Continuing with the 2,384 DE genes identified in the individual cell types (Fig 1B, S2 Table), we found that 71% of these genes are being upregulated upon stimulation. The majority of these DE genes (1,364) are only found in one cell type, of which the largest part in CD4+ T cells, NK cells and monocytes (558, 468 and 304 DE genes, respectively). The remaining three cell types have very few uniquely identified DE genes, with 27, 5 and 2 DE genes for B cells, CD8+ T cells and pDCs, respectively. As the power to detect DE genes for a cell type is strongly correlated with the number of cells for that particular cell type (Pearson correlation = 0.71) (Panel A in S2 Fig), part of these differences can be attributed to differences in cell numbers (Table 1). However, even when taking this into account, a disproportionately large number of DE genes are specifically identified in the monocytes and NK cells (Panel B in S2 Fig).
To follow-up on these findings, we determined whether the connectivity between each of the major cell types changed upon stimulation with Candida. For this, we calculated for each cell type their potential to interact with cells from the same or another cell type by analyzing the expression of cell type-specific receptor and ligand pairs per condition (Candida-stimulated and RPMI control), using the computational framework CellPhoneDB [16]. This analysis revealed that especially the B cells (on average 1.67-fold increase) and NK cells (on average 1.62-fold increase) gain additional potential cell-cell interactions upon stimulation with Candida (Fig 1C, S3 Table).
Previous studies have reached a consensus that monocytes play an important role in candidemia [17,18], but the contribution of NK cells is less clear [19,20]. Interestingly, specifically in immunocompromised mice the depletion of NK cells increased the susceptibility to candidemia [21]. As in humans, candidemia mainly affects immunocompromised patients, we hypothesize that NK cells are likely to play an important role in the human candidemia response as well. Through the DE and ligand-receptor expression analysis we show that, in addition to monocytes, also NK cells are strongly activated and are increasingly connected to other cells. This provides extra evidence for their importance in the immune response against Candida.
In addition to these unique responses, 1,020 DE genes were identified across multiple cell types, of which a core of 41 DE genes was shared between all six cell types. Of these shared DE genes, 89.8% of effects have the same direction across all responding cell types (Fig 1D). Moreover, these shared DE genes showed the strongest differential effect upon stimulation (Fig 1D). Pathway analysis on the core set of 41 DE genes revealed strong enrichment of the interferon pathway (P = 10-22) (S2 Table). This is in line with previous findings in PBMC bulk expression data that showed strong differential expression of the interferon pathway upon 24h Candida stimulation [9]. Notably, when taking the average expression of all interferon pathway-associated genes per cell, the strength of upregulation of the interferon I pathway after Candida stimulation is consistent across all cell types (Fig 1E).
Identification of Candida-response QTLs using bulk RNA sequencing
In addition to identifying cell type-specific responses to Candida infection, we also studied the effect of genetic variants on gene expression levels before and after Candida stimulation using previously published bulk RNA-seq data from PBMCs [7]. The rather small sample size of this study limits its predictive power, in part by the large multiple testing burden of genome-wide eQTL analysis [22]. To reduce the multiple testing burden, we limited our single nucleotide polymorphism (SNP)-gene combinations to only the 16,990 top cis SNP-gene pairs identified in the largest eQTL meta-analysis to date [23], containing unstimulated whole blood samples of 31,684 individuals. However, by confining our analysis only to previously reported cis-eQTLs in unstimulated blood samples, we might miss out on eQTLs that only show up after stimulation. Nevertheless, if there is a weak effect without stimulation that is strengthened by Candida stimulation, restricting ourselves to previously identified top SNP-gene pairs will increase our chance to detect the eQTL effect. Using this approach, a total of 1,563 and 1,637 eQTLs were found in 72 Candida-stimulated and 75 RPMI control samples, respectively (Fig 2A, S4 Table). Whilst many (44%) of these eQTLs were found both before and after stimulation, the majority of eQTLs were condition-specific (Fig 2A). By subtracting per individual and per gene the Candida-stimulated expression from the RPMI control expression, we also tested whether certain SNPs affected the expression of a particular gene with different effect sizes before and after stimulation. This so-called response QTL analysis was performed in the 67 individuals for which both Candida-stimulated and RPMI control conditions were assessed and revealed 27 response QTLs (S4 Table). Subsequently, scRNA-seq data was used to pinpoint the potential cell type in which the response QTL effects manifest themselves (S3 Fig). Annotation of the cell type- and context-specificity of eQTLs may help to understand their involvement in human disease.
Prioritization of LY86 as a potential key driver gene for candidemia
Previously, it was shown that integrating multiple molecular datasets can help prioritize disease-relevant genes, cell types and pathways [9,10]. Therefore, as a next step, we took the GWAS summary statistics of a previously published candidemia cohort of 161 cases and 152 disease-matched controls [8] and overlaid this with our 27 response QTLs (Fig 2B). This revealed an enrichment of candidemia-susceptibility SNPs within the Candida-response QTL SNPs (ƛinflation = 1.49) (Fig 2C). The top enriched response QTL SNP was rs9405943 (P = 1.2 x 10−3, OR = 0.594), and was in near perfect linkage disequilibrium with rs2103635, the SNP showing the strongest association with candidemia in this region (P = 7 x 10−4, OR = 0.58) (r2 = 0.94) (S4 Fig). SNP rs9405943 showed a strong effect on expression levels of LY86 after Candida stimulation (β = 0.58, P = 1.5 x 10−7), but not in the RPMI control condition (β = 0.05, P = 0.68) (Fig 3A).
The expression of LY86 is strongly downregulated upon Candida stimulation in the bulk RNA-seq dataset (P = 7.2 x 10−28). Additionally, we see that the candidemia-risk allele A at rs9405943 is associated with stronger downregulation of LY86 after stimulation. This suggests that high expression of LY86 has a protective function against candidemia. Single-cell gene expression data shows that both B-cells and monocytes express LY86. However, only expression in monocytes is affected by the stimulation (P = 1.9 x 10−14) (Fig 3B and 3C), suggesting that this gene contributes to candidemia susceptibility through monocytes.
It is known that LY86 forms a complex with Toll-like receptor protein RP105 and is involved in several immune disorders [24–26]. Depending on the cell type, this complex has opposite regulatory effects on TLR4 signaling [27,28]; while TLR4 signaling is activated and stimulates proliferation and antibody production in B-cells, it is negatively regulated in myeloid cells. These opposite effects likely reflect the engagement of different cell type-specific co-receptors [28]. While previous studies have shown the importance of the RP105/LY86 complex in mediating the TLR4-mediated innate immune response against bacterial lipopolysaccharides (LPS) [29,30], its role in the anti-Candida response is unknown.
In monocytes, both increased signaling activity of TLR4 and absence of RP105 are associated with downregulation of the chemokine receptor CCR2, leading to their reduced migratory capacity [25,31]. Through complex formation with LY86, RP105 inhibits TLR4 signaling in monocytes [28]. Therefore, we hypothesize that the rs9405943 candidemia-risk allele A, which lowers LY86 expression in monocytes upon Candida stimulation, will decrease the migratory capacity of monocytes, which ultimately increases susceptibility to candidemia (Fig 3D). In line with this, in mice the trafficking of CCR2-dependent inflammatory monocytes has been shown to play an essential role in fungal clearance during systemic candidiasis in the first 48h of infection [18].
Of note, the TLR4 signaling pathway has been shown to be involved in the innate immune responses of several microbial and fungal infections [32–35]. In addition, a previous study, in which PBMCs from 8 individuals were stimulated for 24h with microbial and fungal pathogens, showed reduced expression of LY86 after stimulation with Mycobacterium Tuberculosis(-1.20-fold, P = 1.53 x 10−7), Borrelia (-1.34-fold, P = 7.31 x 10−13), Pseudomonas Aeruginosa (-1.31-fold, P = 9.45 x 10−9) and Streptococcus Pseudomoniae (-1.54-fold, P = 2.01 x 10−19), but not Aspergillus Fumigatus (-0.012-fold, P = 0.98) [7]. Altogether, this indicates that the differential regulation of LY86 in monocytes, as seen in response to Candida, could also affect the susceptibility towards other blood-based bacterial infections.
Functional validation of the role of LY86 in monocytes
To test our hypothesized mechanism of action (Fig 3D), we conducted experimental follow-up studies in THP-1 monocytes. As the candidemia risk allele in combination with Candida stimulation is associated with reduced expression of LY86 specifically in the monocytes (Figs 2C and 3A–3C), we used siRNA knockdown of LY86 to mimic this effect. 72h after siRNA treatment, we confirmed efficient knockdown of LY86 (14.3-fold lower expression, P = 0.001) by qPCR. In line with our hypothesis, the expression of CCR2 was also reduced (33.3-fold, P = 0.0006) upon knockdown of LY86 (Fig 3E, S5 Table). These 72h LY86 or control siRNA treated cells were then used in a migration assay to assess their migratory capacity towards the chemokine MCP-1 or serum-free medium as a control. After 3h incubation, we only observed migration towards MCP-1 and not the serum-free medium. Notably, the migratory capacity towards MCP-1 of the LY86 siRNA treated cells was reduced (2.0-fold, P = 0.01) as compared to control siRNA treated cells (Fig 3F, S5 Table). Summarized, these results indicate that reduced expression of LY86 can reduce the migratory capacity of monocytes, potentially through reduced expression of CCR2, and thereby, may increase the susceptibility to candidemia.
Final discussion and conclusion
In summary, we present an integrative approach of GWAS, bulk RNA-seq and scRNA-seq data to extract important knowledge about candidemia susceptibility. Such an integrative approach is valuable in the context of infectious diseases, such as candidemia, for which the limited size of patient cohorts limits the power of the GWAS. Otherwise, a GWAS alone would require much larger sample sizes in order to extract useful information from such studies. Moreover, a GWAS alone cannot explain how genetic variation affects disease or which cell type will be affected, and therefore, a systematic integration of different molecular datasets may be the only avenue to reveal this information. By combining these data layers, we corroborate the previously identified importance of the IFN pathway and of monocytes in Candida infections [9]. In addition, we provide new evidence for a strong response in NK cells against Candida and a potential novel role for the LY86 gene in candidemia susceptibility.
Our integrative approach is not limited to Candida infection, but can also be applied to gain a better insight into other infectious diseases for which the progress of disease understanding is hindered by small patient cohorts. We expect that in the near future, the cell type-specific and context-specific resolution of this integrated approach can be further improved as large-scale scRNA-seq datasets become readily available in many different individuals, stimulation conditions and diseases through large-scale consortia such as the single-cell eQTLGen (https://eqtlgen.org/single-cell.html) [36] and LifeTime consortium (https://lifetime-fetflagship.eu). Such increased resolution would allow reconstruction of personalized, disease-specific gene regulatory networks that could provide us with new insights that could guide new treatment opportunities [37].
Materials and methods
PBMC collection and Candida-stimulation
Whole blood from 6 individuals of the northern Netherlands population cohort Lifelines Deep [38] was drawn into EDTA-vacutainers (BD). PBMCs were isolated and maintained as previously described [39]. In short, PBMCs were isolated using Cell Preparation Tubes with sodium heparin (BD) and were cryopreserved until use in RPMI 1640 containing 40% FCS and 10% DMSO. After thawing and a 1h resting period, 50x104 cells were seeded in 200 μl RPMI1640 supplemented with 50 μg/mL gentamicin, 2 mM L-glutamine, and 1 mM pyruvate in a nucleon sphere 96-round bottom well plate. Cells were either stimulated or kept unstimulated for 24h with 1x106 heat-killed C. albicans blastoconidia (strain ATCC MYA-3573, UC 820) CFU/ml at 37°C in a 5% CO2 incubator. After 24h, cells were washed twice in medium supplemented with 0.04% bovine serum albumin. Cells were counted using a haemocytometer and cell viability was assessed by Trypan Blue.
Single-cell library preparation and sequencing
Three, sex-balanced sample pools were prepared each aimed to contain 1750 cells/donor from 6 donors (10,500 cells). One pool contained only unstimulated cells, one pool only stimulated cells and one pool contained a 50/50 mixture of both. Each sample pool was loaded into a different lane of a 10x chip (Single Cell A Chip Kit, 120236). The 10x Chromium controller (10x Genomics) in combination with v2 reagents was used to capture the single cells and generate sequencing libraries according to the manufacturer’s instructions (document CG00026) and as previously described [39]. Sequencing was performed using the Illumina HiSeq 4000 with a 75-bp paired-end kit, performed by GenomeScan (Leiden, the Netherlands).
Single-cell RNA-seq alignment, preprocessing and QC
Alignment, demultiplexing and cell type classification of the scRNA-seq data was performed as previously described [39], but now using the 2.3.0 version of Seurat [40]. After QC, 15,085 cells remained of which 7,160 were stimulated and 7,925 were unstimulated. The stimulated and unstimulated cells were combined into a single dataset using Canonical Correlation Analysis (CCA) [40], by taking the first 20 dimensions. Clusters were identified using the FindClusters function from Seurat, using the first 20 dimensions in the CCA space. Expression of known marker genes was assessed to assign cell types to each cluster, resulting in the identification of six major cell types.
Single-cell RNA-seq differential expression analysis
Differential expression (DE) between Candida-stimulated and RPMI control cells was calculated for each cell type separately and in a bulk-like analysis using the MAST implementation of the Seurat package [14]. All genes without expression in at least 1 cell were removed, leaving 20,236 genes. Bonferroni multiple testing correction was applied, yielding a significance threshold of 2.47e-06. Genes that were differentially expressed in all cell types (i.e. core genes) and each cell type individually were used as input for the ToppFun functional enrichment analysis using the REACTOME pathway [41]. P-values were calculated using the probability density function and were Bonferroni corrected.
Cell-to-cell interaction potential analysis
The potential of cell-to-cell communication through ligand/receptor pair interactions was studied using version 2 of CellPhoneDB [16]. This software uses the normalized expression data and the cell type classifications to see which cell types have expression of known ligands and receptors to estimate whether there is an interaction potential between cells of the same or different cell types. The analysis was performed on each cell type and condition (24h Candida-stimulated versus RPMI control) separately. CellPhoneDB was run using the default database of ligand-receptor interactions provided with the software and was run using default settings for p-value thresholds (0.05), expression threshold (expression in > = 10% cells) and permutations (1,000).
Bulk RNA-seq data on Candida-stimulated PBMCs
All bulk RNA-seq data from PBMCs was previously generated [7] in 70 individuals from the GONL cohort [42]. This data was generated from PBMCs that were stimulated for 24h with Candida or remained unstimulated (RPMI control condition). Details of the stimulation are similar to the scRNA-seq data on Candida-stimulated PBMCs as mentioned above, and have been previously described [9]. The differentially expressed genes upon stimulation were previously identified [7] through DESeq2 [43]. The differential expressed genes identified in the scRNA-seq data were compared with the differential response in this bulk RNA-seq data.
Bulk RNA-seq eQTL analysis
Of the same bulk RNA-seq cohort, eQTLs were identified in the data from 72 individuals and 75 individuals for Candida-stimulated and RPMI control conditions, respectively. The response QTLs were identified in the 67 individuals for which both conditions were assessed and genotype information is available. To calculate this, we subtracted per individual and per gene the Candida-stimulated expression from the RPMI control expression and tested whether certain SNPs affected the expression of a particular gene with different effect sizes before and after stimulation. All expression data were log2 transformed before being used in the 1.2.4F version of the QTL pipeline as described before [44]. To reduce the multiple testing burden, analysis was restricted to the list of 16,989 top SNP-gene combinations identified in the largest whole blood eQTL meta-analysis to date containing 31,684 whole blood samples [23]. This list of top SNP-gene combinations contains SNPs with minor allele frequencies (MAF) >0.01, Hardy-Weinberg P-values >0.0001, call rate >0.95, and MACH r2 > 0.5 within a 1Mb window of the gene. An FDR threshold of 0.05 was used as significance cut-off, using the permutation strategy described in Westra et al. with 100 permutations [45].
GWAS on candidemia susceptibility
The GWAS on candidemia susceptibility was previously described [8]. In short, this GWAS was performed in a cohort of 161 candidemia cases and 152 disease-matched controls of European ancestry whose demographic and clinical characteristics have been previously described [46]. DNA was genotyped using Illumina HumanCoreExome-12 v1.0 and HumanCoreExome-24 v1.0 BeadChip SNP chips. Genotypes were imputed using the human reference consortium reference panel [47] using the Michigan imputation server [48]. In total, 5,426,313 SNPs were tested for disease association using Fisher’s exact test with PLINK v1.9 [49]. The lambda inflation was calculated by taking the GWAS p-values for each of the 27 response-QTL SNPs, regardless of whether the GWAS p-value was significant.
siRNA treatment
Before starting the experiment, THP-1 monocytes were maintained in RPMI medium supplemented with 10% FBS and 1% Pen-Strep at 37°C in a humidified 5% CO2 incubator. 50,000 THP-1 monocytes were seeded in round-bottom 96-wells plates. During seeding, 1 μM Accell human LY86 siRNA SMARTpool (Dharmacon) or 1 μM Accell Green non-targeting siRNA control (Dharmacon) was delivered to these cells in 100 ul Accell delivery medium. After 24h, this procedure was repeated by adding an additional 100 ul to each well. After 72h, LY86 and CCR2 mRNA levels were quantified using qRT-PCR and migration rate was quantified using a migration assay.
Quantitative real-time PCR (qRT-PCR)
RNA was isolated using QIAzol lysis reagent according to manufacturer’s instructions. RNA was quantified using a Nanodrop 1000 spectrophotometer (Thermo Scientific). 400 ng RNA was reverse transcribed into cDNA using random hexamer primers with the RevertAid H Minus First Strand cDNA Synthesis Kit (Thermo Scientific) following manufacturer’s protocol. Each qRT-PCR reaction contained 500 nM of each primer pair (Table 2), 10 ng of cDNA and 1x iTaq universal SYBR green supermix (Bio-Rad). qRT-PCR reactions were conducted on the Quantstudio 7 Flex real time PCR (Thermo Fischer) for 10 min at 95 °C, followed by 40 cycles of 15 sec at 95 °C and 30 sec at 60 °C. GAPDH was used as housekeeping gene. Data and melting curves were analyzed using Quantstudio Real-time PCR software v1.3 and relative expression compared to controls was calculated using the ΔΔCt method [50]. Significance was calculated using an unpaired t-test.
Table 2. qRT-PCR primer sequences.
Target gene | Forward primer (5’-3’) | Reverse primer (5’-3’) |
---|---|---|
LY68 | TGTGGAAGAAGGAAAGGAGAGCA | GTACAGTTCCAGCAAAACCTGG |
CCR2 | AGTTGCTGAGAAGCCTGACA | TCTCTGTTCAGCTTGTGGCT |
GAPDH | CCACATCGCTCAGACACCAT | GCGCCCAATACGACCAAAT |
Migration assay
The Boyden Chamber transwell migration assay was used to determine the migration rate towards MCP-1 upon LY86 KD [51]. A polycarbonate membrane insert with a 5 μM pore size (Cell Biolabs) was placed in a well of a 24-wells plate filled with 500 μl Accell delivery medium supplemented with 0.5% BSA with or without 100 ng/ml human MCP-1 (Prospec). The insert was filled with 100 μl Accell delivery medium supplemented with 0.5% BSA and 100,000 THP-1 monocytes treated for 72h with LY68 siRNA or Green non-targeting siRNA. Cells were placed in a humidified incubator with 5% CO2 at 37 °C. After 3h, the number of migratory cells was quantified in the bottom well using a hemocytometer. Significance was calculated using an unpaired t-test.
Code availability
The original R code for Seurat [40] (https://github.com/satijalab/seurat), CellPhoneDB v2.0 [16] (https://github.com/Teichlab/cellphonedb) and our in-house eQTL pipeline [44] (https://github.com/molgenis/ systemsgenetics/tree/master/eqtl-mapping-pipeline) can be found at Github. All custom-made code is made available via GitHub (https://github.com/molgenis/scRNA-seq).
Ethics statement
The LifeLines DEEP study was approved by the ethics committee of the University Medical Center Groningen, document number METC UMCG LLDEEP: M12.113965. All participants signed an informed consent form before study enrollment. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Supporting information
Data Availability
A Seurat object [40], containing the processed single-cell RNA-seq data after QC and cell type assignment, is made available through the website accompanying our manuscript: https://eqtlgen.org/candida.html. All other relevant data is within the manuscript and Supporting Information files.
Funding Statement
M.W. and L.F. are supported by grants from the Dutch Research Council (NWO-Veni 192.029 to M.W. (https://www.nwo.nl/en/), ZonMW-VIDI 917.14.374 to L.F. (https://www.zonmw.nl/en/)), L.F. is supported by a European Research Council Starting Grant (Immrisk 637640 to L.F. (https://erc.europa.eu/)), L.F. is supported by the Oncode institute (https://www.oncode.nl/). V.K. was supported by a grant from the European Society of Clinical Microbiology and Infectious Diseases (2017, https://www.escmid.org/) and a Radboudumc Hypatia Grant (2018, https://www.radboudumc.nl/en/research/academic-and-scientific-training/talent-management/talent-programs/radboudumc-hypatia-track-and-grants/hypatia-tenure-track-grant). MGN was supported by an ERC Advanced Grant (#833247) (https://erc.europa.eu/), and a Spinoza grant of the Netherlands Organization for Scientific Research (https://www.nwo.nl/en/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Mavor AL, Thewes S, Hube B. Systemic fungal infections caused by Candida species: epidemiology, infection process and virulence attributes. Curr Drug Targets 2005. December;6(8):863–874. 10.2174/138945005774912735 [DOI] [PubMed] [Google Scholar]
- 2.Quindos G. Epidemiology of candidaemia and invasive candidiasis. A changing face. Rev Iberoam Micol 2014. Jan-Mar;31(1):42–48. 10.1016/j.riam.2013.10.001 [DOI] [PubMed] [Google Scholar]
- 3.Leroy O, Gangneux JP, Montravers P, Mira JP, Gouin F, Sollet JP, et al. Epidemiology, management, and risk factors for death of invasive Candida infections in critical care: a multicenter, prospective, observational study in France (2005–2006). Crit Care Med 2009 May;37(5):1612–1618. [DOI] [PubMed] [Google Scholar]
- 4.Moran C, Grussemeyer CA, Spalding JR, Benjamin DK Jr, Reed SD. Comparison of costs, length of stay, and mortality associated with Candida glabrata and Candida albicans bloodstream infections. Am J Infect Control 2010. Feb;38(1):78–80. 10.1016/j.ajic.2009.06.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Johnson MD, Plantinga TS, van de Vosse E, Velez Edwards DR, Smith PB, Alexander BD, et al. Cytokine gene polymorphisms and the outcome of invasive candidiasis: a prospective cohort study. Clin Infect Dis 2012. February 15;54(4):502–510. 10.1093/cid/cir827 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Delsing CE, Gresnigt MS, Leentjens J, Preijers F, Frager FA, Kox M, et al. Interferon-gamma as adjunctive immunotherapy for invasive fungal infections: a case series. BMC Infect Dis 2014. March 26;14:166-2334-14-166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Li Y, Oosting M, Deelen P, Ricano-Ponce I, Smeekens S, Jaeger M, et al. Inter-individual variability and genetic influences on cytokine responses to bacteria and fungi. Nat Med 2016. August;22(8):952–960. 10.1038/nm.4139 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Jaeger M, Matzaraki V, Aguirre-Gamboa R, Gresnigt MS, Chu X, Johnson MD, et al. A genome-wide functional genomics approach identifies susceptibility pathways to fungal bloodstream infection in humans. jid 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Smeekens SP, Ng A, Kumar V, Johnson MD, Plantinga TS, van Diemen C, et al. Functional genomics identifies type I interferon pathway as central for host defense against Candida albicans. Nat Commun 2013;4:1342 10.1038/ncomms2343 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Matzaraki V, Gresnigt MS, Jaeger M, Ricano-Ponce I, Johnson MD, Oosting M, et al. An integrative genomics approach identifies novel pathways that influence candidaemia susceptibility. PLoS One 2017. July 20;12(7):e0180824 10.1371/journal.pone.0180824 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Roadmap Epigenomics Consortium, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, et al. Integrative analysis of 111 reference human epigenomes. Nature 2015. February 19;518(7539):317–330. 10.1038/nature14248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Svensson V, Vento-Tormo R, Teichmann SA. Exponential scaling of single-cell RNA-seq in the past decade. Nat Protoc 2018. April;13(4):599–604. 10.1038/nprot.2017.149 [DOI] [PubMed] [Google Scholar]
- 13.Blecher-Gonen R, Bost P, Hilligan KL, David E, Salame TM, Roussel E, et al. Single-Cell Analysis of Diverse Pathogen Responses Defines a Molecular Roadmap for Generating Antigen-Specific Immunity. Cell Syst 2019. February 27;8(2):109–121.e6. 10.1016/j.cels.2019.01.001 [DOI] [PubMed] [Google Scholar]
- 14.Finak G, McDavid A, Yajima M, Deng J, Gersuk V, Shalek AK, et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol 2015. December 10;16:278-015-0844-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wang T, Li B, Nelson CE, Nabavi S. Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data. BMC Bioinformatics 2019. January 18;20(1):40-019-2599-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Vento-Tormo R, Efremova M, Botting RA, Turco MY, Vento-Tormo M, Meyer KB, et al. Single-cell reconstruction of the early maternal-fetal interface in humans. Nature 2018. November;563(7731):347–353. 10.1038/s41586-018-0698-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Smeekens SP, van de Veerdonk FL, Joosten LA, Jacobs L, Jansen T, Williams DL, et al. The classical CD14(+)(+) CD16(-) monocytes, but not the patrolling CD14(+) CD16(+) monocytes, promote Th17 responses to Candida albicans. Eur J Immunol 2011. October;41(10):2915–2924. 10.1002/eji.201141418 [DOI] [PubMed] [Google Scholar]
- 18.Ngo LY, Kasahara S, Kumasaka DK, Knoblaugh SE, Jhingran A, Hohl TM. Inflammatory monocytes mediate early and organ-specific innate defense during systemic candidiasis. J Infect Dis 2014. January 1;209(1):109–119. 10.1093/infdis/jit413 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Romani L, Mencacci A, Cenci E, Spaccapelo R, Schiaffella E, Tonnetti L, et al. Natural killer cells do not play a dominant role in CD4+ subset differentiation in Candida albicans-infected mice. Infect Immun 1993. September;61(9):3769–3774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Whitney PG, Bar E, Osorio F, Rogers NC, Schraml BU, Deddouche S, et al. Syk signaling in dendritic cells orchestrates innate resistance to systemic fungal infection. PLoS Pathog 2014. July 17;10(7):e1004276 10.1371/journal.ppat.1004276 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Quintin J, Voigt J, van der Voort R, Jacobsen ID, Verschueren I, Hube B, et al. Differential role of NK cells against Candida albicans infection in immunocompetent or immunocompromised mice. Eur J Immunol 2014. August;44(8):2405–2414. 10.1002/eji.201343828 [DOI] [PubMed] [Google Scholar]
- 22.GTEx Consortium, Laboratory, Data Analysis &Coordinating Center (LDACC)-Analysis Working Group, Statistical Methods groups-Analysis Working Group, Enhancing GTEx (eGTEx) groups, NIH Common Fund, NIH/NCI, et al. Genetic effects on gene expression across human tissues. Nature 2017. October 11;550(7675):204–213. 10.1038/nature24277 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Võsa U, Claringbould A, Westra H, Bonder MJ, Deelen P, Zeng B, et al. Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. bioRxiv 2018. Cold Spring Harbor Laboratory:447367. [Google Scholar]
- 24.Tada Y, Koarada S, Morito F, Mitamura M, Inoue H, Suematsu R, et al. Toll-like receptor homolog RP105 modulates the antigen-presenting cell function and regulates the development of collagen-induced arthritis. Arthritis Res Ther 2008;10(5):R121 10.1186/ar2529 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wezel A, van der Velden D, Maassen JM, Lagraauw HM, de Vries MR, Karper JC, et al. RP105 deficiency attenuates early atherosclerosis via decreased monocyte influx in a CCR2 dependent manner. Atherosclerosis 2015. January;238(1):132–139. 10.1016/j.atherosclerosis.2014.11.020 [DOI] [PubMed] [Google Scholar]
- 26.Chen X, Pan H, Li J, Zhang G, Cheng S, Zuo N, et al. Inhibition of myeloid differentiation 1 specifically in colon with antisense oligonucleotide exacerbates dextran sodium sulfate-induced colitis. J Cell Biochem 2019. May 19. [DOI] [PubMed] [Google Scholar]
- 27.Divanovic S, Trompette A, Atabani SF, Madan R, Golenbock DT, Visintin A, et al. Negative regulation of Toll-like receptor 4 signaling by the Toll-like receptor homolog RP105. Nat Immunol 2005. June;6(6):571–578. 10.1038/ni1198 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Schultz TE, Blumenthal A. The RP105/MD-1 complex: molecular signaling mechanisms and pathophysiological implications. J Leukoc Biol 2017. January;101(1):183–192. 10.1189/jlb.2VMR1215-582R [DOI] [PubMed] [Google Scholar]
- 29.Ogata H, Su I, Miyake K, Nagai Y, Akashi S, Mecklenbrauker I, et al. The toll-like receptor protein RP105 regulates lipopolysaccharide signaling in B cells. J Exp Med 2000. July 3;192(1):23–29. 10.1084/jem.192.1.23 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kimoto M, Nagasawa K, Miyake K. Role of TLR4/MD-2 and RP105/MD-1 in innate recognition of lipopolysaccharide. Scand J Infect Dis 2003;35(9):568–572. 10.1080/00365540310015700 [DOI] [PubMed] [Google Scholar]
- 31.Parker LC, Whyte MK, Vogel SN, Dower SK, Sabroe I. Toll-like receptor (TLR)2 and TLR4 agonists regulate CCR expression in human monocytic cells. J Immunol 2004. April 15;172(8):4977–4986. 10.4049/jimmunol.172.8.4977 [DOI] [PubMed] [Google Scholar]
- 32.Loures FV, Pina A, Felonato M, Araujo EF, Leite KR, Calich VL. Toll-like receptor 4 signaling leads to severe fungal infection associated with enhanced proinflammatory immunity and impaired expansion of regulatory T cells. Infect Immun 2010. March;78(3):1078–1088. 10.1128/IAI.01198-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Meier A, Kirschning CJ, Nikolaus T, Wagner H, Heesemann J, Ebel F. Toll-like receptor (TLR) 2 and TLR4 are essential for Aspergillus-induced activation of murine macrophages. Cell Microbiol 2003. August;5(8):561–570. 10.1046/j.1462-5822.2003.00301.x [DOI] [PubMed] [Google Scholar]
- 34.Netea MG, Gow NA, Joosten LA, Verschueren I, van der Meer JW, Kullberg BJ. Variable recognition of Candida albicans strains by TLR4 and lectin recognition receptors. Med Mycol 2010. November;48(7):897–903. 10.3109/13693781003621575 [DOI] [PubMed] [Google Scholar]
- 35.Mukherjee S, Karmakar S, Babu SP. TLR2 and TLR4 mediated host immune responses in major infectious diseases: a review. Braz J Infect Dis 2016. Mar-Apr;20(2):193–204. 10.1016/j.bjid.2015.10.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.van der Wijst MGP, de Vries DH, Groot HE, Trynka G, Hon CC, Nawijn MC, et al. Single-cell eQTLGen consortium: a personalized understanding of disease. arXiv 2019;190912550:1–26 10.7554/eLife.52155 [DOI] [Google Scholar]
- 37.van der Wijst MGP, de Vries DH, Brugge H, Westra HJ, Franke L. An integrative approach for building personalized gene regulatory networks for precision medicine. Genome Med 2018. December 19;10(1):96-018-0608-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tigchelaar EF, Zhernakova A, Dekens JA, Hermes G, Baranska A, Mujagic Z, et al. Cohort profile: LifeLines DEEP, a prospective, general population cohort study in the northern Netherlands: study design and baseline characteristics. BMJ Open 2015. August 28;5(8):e006772-2014-006772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.van der Wijst MGP, Brugge H, de Vries DH, Deelen P, Swertz MA, Franke L. Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs. Nat Genet 2018. 04/02;50(4):493–497. 10.1038/s41588-018-0089-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol 2018. April 2;36(5):411–420. 10.1038/nbt.4096 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res 2009. July;37(Web Server issue):W305–11. 10.1093/nar/gkp427 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Genome of the Netherlands Consortium. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet 2014. August;46(8):818–825. 10.1038/ng.3021 [DOI] [PubMed] [Google Scholar]
- 43.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 2014;15(12):550-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Zhernakova DV, Deelen P, Vermaat M, van Iterson M, van Galen M, Arindrarto W, et al. Identification of context-dependent expression quantitative trait loci in whole blood. Nat Genet 2017. print;49(1):139–145. 10.1038/ng.3737 [DOI] [PubMed] [Google Scholar]
- 45.Westra HJ, Peters MJ, Esko T, Yaghootkar H, Schurmann C, Kettunen J, et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet 2013. October;45(10):1238–1243. 10.1038/ng.2756 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Kumar V, Cheng SC, Johnson MD, Smeekens SP, Wojtowicz A, Giamarellos-Bourboulis E, et al. Immunochip SNP array identifies novel genetic variants conferring susceptibility to candidaemia. Nat Commun 2014. September 8;5:4675 10.1038/ncomms5675 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood AR, Teumer A, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 2016. October;48(10):1279–1283. 10.1038/ng.3643 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Das S, Forer L, Schonherr S, Sidore C, Locke AE, Kwong A, et al. Next-generation genotype imputation service and methods. Nat Genet 2016. October;48(10):1284–1287. 10.1038/ng.3656 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007. September;81(3):559–575. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 2001. December;25(4):402–408. 10.1006/meth.2001.1262 [DOI] [PubMed] [Google Scholar]
- 51.Boyden S. The chemotactic effect of mixtures of antibody and antigen on polymorphonuclear leucocytes. J Exp Med 1962. March 1;115:453–466. 10.1084/jem.115.3.453 [DOI] [PMC free article] [PubMed] [Google Scholar]