Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2021 Jul 27;17(7):e1009684. doi: 10.1371/journal.pgen.1009684

An autoimmune disease risk variant: A trans master regulatory effect mediated by IRF1 under immune stimulation?

Margot Brandt 1,2,3, Sarah Kim-Hellmuth 1,2,4,5, Marcello Ziosi 1, Alper Gokden 1, Aaron Wolman 1, Nora Lam 1,6, Yocelyn Recinos 1,2,3, Zharko Daniloski 1,7, John A Morris 1,7, Veit Hornung 8, Johannes Schumacher 9, Tuuli Lappalainen 1,2,*
Editor: Xin He10
PMCID: PMC8345867  PMID: 34314424

Abstract

Functional mechanisms remain unknown for most genetic loci associated to complex human traits and diseases. In this study, we first mapped trans-eQTLs in a data set of primary monocytes stimulated with LPS, and discovered that a risk variant for autoimmune disease, rs17622517 in an intron of C5ORF56, affects the expression of the transcription factor IRF1 20 kb away. The cis-regulatory effect specific to IRF1 is active under early immune stimulus, with a large number of trans-eQTL effects across the genome under late LPS response. Using CRISPRi silencing, we showed that perturbation of the SNP locus downregulates IRF1 and causes widespread transcriptional effects. Genome editing by CRISPR had suggestive recapitulation of the LPS-specific trans-eQTL signal and lent support for the rs17622517 site being functional. Our results suggest that this common genetic variant affects inter-individual response to immune stimuli via regulation of IRF1. For this autoimmune GWAS locus, our work provides evidence of the functional variant, demonstrates a condition-specific enhancer effect, identifies IRF1 as the likely causal gene in cis, and indicates that overactivation of the downstream immune-related pathway may be the cellular mechanism increasing disease risk. This work not only provides rare experimental validation of a master-regulatory trans-eQTL, but also demonstrates the power of eQTL mapping to build mechanistic hypotheses amenable for experimental follow-up using the CRISPR toolkit.

Author summary

Although many genetic loci have been associated to disease, understanding how these variants impact molecular and cellular functions to impact disease risk have been challenging. Here, we first used blood cells from a large number of individuals and stimulated them in the laboratory with a proxy for bacterial infection. We identified that a genetic variant associated to autoimmune diseases also affects the expression of the nearby transcription factor IRF1 gene in early immune response, followed by expression change of other genes in late immune response. We then studied this effect in cell lines, using the CRISPR approach to silence the activity of the genomic element of this variant and cause mutations at that position. We found evidence that this autoimmune disease -associated variant is located in a genomic regulatory element that responds to immune stimulus and affects expression of IRF1 and a complex gene regulatory network. Thus, our characterization of genetic regulatory variation in the human population combined with experimental follow-up suggests a plausible, previously uncharacterized molecular mechanism that may underlie this genetic variant’s effect on immune disease risk.

Introduction

The discovery of tens of thousands of genetic loci associated to complex diseases and traits has introduced the challenge of characterizing the biological mechanisms that mediate these associations. Most GWAS loci include a large number of associated variants in linkage disequilibrium (LD) and are located in noncoding regions with likely gene regulatory functions. Therefore, it is typically unknown what is the causal variant, affected regulatory element, target gene in cis, and downstream molecular pathways that mediate genetic associations to complex disease phenotypes. Furthermore, the cellular context where these effects take place is often elusive, with increasing evidence that effects can be not only specific to cell types, but also cell states.

A key approach to answer these questions is expression quantitative trait locus (eQTL) analysis to discover associations between genetic variation and gene expression. Most widely applied in cis, eQTLs across diverse tissues have been shown to be strongly enriched for GWAS signals and have the ability to pinpoint potential target genes in GWAS loci [1]. Furthermore, response cis-eQTL mapping for immune cells with in vitro stimuli have provided powerful evidence of genetic regulatory effects that are not only specific to tissues but also to cell states, and disease associations that are driven by disrupted response to environmental stimuli [25]. Trans-eQTL mapping for distant, typically interchromosomal, genetic regulatory effects has the additional potential for elucidating regulatory pathways of the cell, and trans-eQTLs have been shown to be very strongly enriched for GWAS signals [69]. Unfortunately, robust discovery of trans-eQTLs has been challenging due to the large multiple testing burden, smaller effect sizes, and higher tissue-specificity as compared to cis-eQTLs [6]. While a large number of trans-eQTLs are mediated by a cis-eQTL, indicating potential biological mechanism, trans-eQTLs in humans rarely have strong master-regulatory effects on pathways of multiple genes [6,911].

Despite the power of eQTL characterization, the approach has limitations that open up possibilities for complementary experimental analysis. Association studies cannot provide definitive evidence for distinguishing causal regulatory variants from their LD proxies, creating the need for massively parallel screens and CRISPR assays to test whether specific genetic perturbations indeed affect gene regulation. Furthermore, experimental characterization and validation of eQTL associations has been sparse, which is a problem especially for trans-eQTLs that do not always replicate well between independent data sets [12]. Finally, with the functional genomics toolkit now including eQTL mapping and diverse experimental approaches in cellular models, understanding the transferability of ex vivo eQTL results and cellular models is poorly known. Understanding these aspects is a particularly burning question for experimental follow-up of disease-associated loci.

In this study, we mapped trans-eQTLs in a monocyte data set with LPS stimulus time points, with cis-eQTL discovery reported in earlier work [3]. We discovered an inflammatory bowel disease (IBD) GWAS variant rs17622517 that affects the expression of the transcription factor IRF1 in cis under early immune stimulus, and a large number of genes in trans under late stimulus. Using CRISPRi silencing and CRISPR genome editing, we show that the SNP locus indeed functions as an enhancer, identify the causal variant of this association, and demonstrate that the LPS-specific trans-eQTL signal can be recapitulated in a cellular model. Our results suggest that this common genetic variant affects inter-individual response to immune stimuli via regulation of IRF1, providing a strong hypothesis for the functional mechanism of the IBD risk association in this locus.

Results

eQTL discovery and characterization in cis and trans

We mapped response eQTLs in cis in primary monocytes under baseline and under the innate immune stimulus with LPS, using a data set of 134 donors with SNP genotyping and gene expression measured under baseline, and early (90 mins) and late (6h) LPS treatment that triggers the Toll-like receptor 4 (TLR4) pathway. The cis-eQTL discovery is described in [3,13], and here our focus was how early immune response cis-eQTLs may translate to later trans-eQTL effects. To this end, for the lead variants of the 126 cis-eQTLs at 90 min, we mapped trans-eQTLs for all the expressed probes at 6h. We discovered 47 probes with at least one trans-eQTL (trans-eGenes) at 5% false discovery rate (FDR) and 204 probes at 25% FDR, with 117 cis-eQTLs having at least one trans effect at 50% FDR (S1 Table and Fig 1A).

Fig 1. eQTL analysis.

Fig 1

A) Histogram showing the number of trans-eGenes at 0.5 FDR per each tested cis-eQTL locus with 6h LPS stimulation. The rightmost bar is the IRF1 locus; B) the rs17622517 cis-eQTL for IRF1, and C) and its trans-eQTL effect sizes (allelic fold change) under the three conditions; D) the association landscape for the IRF1 cis-eQTL signal; E) overlap of rs17622517 (blue line) with ENCODE transcription factor ChIP-seq peaks (grey) and motifs (green). Intensity of grey scale is proportional to the intensity of the ChIP-seq peak.

One locus stood out by being associated with a larger number of trans-eGenes than any other: 12 at 5% FDR and 232 at 50% FDR (S1 Fig). Interestingly, the trans-eQTL is a cis-eQTL for the nearby IRF1 only under early LPS response (Fig 1B), not for any other proximal gene (S2 Fig) and the trans-eGenes are enriched for IRF1 target genes, defined as having an IRF1 motif within 4 kb of their transcription start site (TSS) according to MSigDB (p = 1.8e-4). These trans-eQTLs are mostly active only at 6 hours after LPS stimulation (Fig 1C). This pattern supports a hypothesis where early LPS stimulation activates the cis-eQTL effect on IRF1 first, which then affects the expression of downstream direct and indirect targets of IRF1. The alternative allele of the lead variant rs17622517 is associated with upregulation of IRF1 upon early immune stimulus, and upregulation of 11 out of 12 of its most significant trans-eGenes as well (S3 Fig).

Next, we sought to further characterize the eQTL, its likely causal variant and its cis-regulatory mechanism. The lead variant rs17622517 is located 23 kb downstream of the IRF1 TSS in an intron of the gene C5ORF56 (Fig 1D). It has no LD-tagging variants at r2 > 0.2 in our data set nor in 1000 Genomes data, nor structural variants that are not extremely rare in gnomAD. The other neighboring variants that are also significantly associated with IRF1 in cis (S2 Table) represent an independent eQTL for IRF1 (S4A Fig), with a lead variant rs147386065 that has a uniform effect under baseline and the two stimulus timepoints (S4B Fig). The trans-eQTL signal is significant only for rs17622517, but its effect sizes for the 50% FDR trans-eGenes are correlated to much weaker effect sizes from rs147386065 (S4C Fig), suggesting that both genetic effects on IRF1 may contribute to downstream regulatory changes. For the lead eQTL, rs17622517 is a good candidate for the causal variant since it has no LD proxies, and its genomic context lends further support to this: In ENCODE data, the SNP locus overlaps open chromatin in many cell types, including monocytes [14], and a H3K27Ac peak in the GM12878 lymphoblastoid cell line suggests an active immune cell enhancer in this region. It also overlaps binding sites of numerous transcription factors in lymphoblastoid cell lines and is close to the motifs of many of them, including transcription factors activated in immune response, such as IRF4 and RELA, a subunit of NF-κB (Fig 1E).

Interestingly, this locus has a robust multi-trait genome-wide association signal for inflammatory bowel diseases (IBD; Crohn’s disease, and ulcerative colitis) and ankylosing spondylitis [1517]. At least three independent causal GWAS signals appear to exist in the locus, of which two match the two independent eQTLs in this locus and one of them is fine-mapped to rs17622517 [15,17]. The causal gene(s) in this locus have been unclear, with previous eQTL and other evidence pointing to several potential causal genes [1517]. While IRF1 is a plausible candidate as a key immune regulator with links to many ulcerative colitis-related genes [1820], functional data implicating IRF1 has been thus far lacking.

Our findings suggest that this GWAS association for IBD may be driven by a genetic effect that affects response to LPS via a cis-regulatory effect on IRF1 followed by trans-regulatory effect on its downstream pathway. With a good candidate for a causal variant in a putative enhancer, we next sought to characterize and validate these effects experimentally using enhancer and promoter silencing by CRISPRi and genome editing by CRISPR.

CRISPRi characterization of the enhancer function and downstream immune response

First, we had to select a cellular model. The monocyte-like THP1 cell line was used for some experiments, but since it is difficult to grow and edit, we chose to use as our primary model a modified HEK293 cell line HEK293/hTLR4A-MD2-CD14 (Invivogen) co-transfected with the human TLR4A, MD2 and CD14 genes, conferring NF-kB nuclear translocation in presence of Gram-negative lipopolysaccharide (LPS) [2123]. By RNA-sequencing after LPS stimulus under 4 concentrations and 7 timepoints we verified that the cell line responds to LPS stimulus and activates relevant pathways in a manner that is highly correlated to primary monocytes (S5 Fig). The transcriptional response appeared slightly delayed compared to primary monocytes, and thus we used 90 minutes and 12 hours post-LPS for early and late immune stimulus time points, respectively.

In order to study whether the rs17622517 locus indeed functions as an enhancer for IRF1, we first performed CRISPRi experiments in THP1 cell lines with the dCas9-KRAB-MeCP2 construct [24] (S6 Fig). We transfected the cells with gRNAs targeted to the variant locus to silence its activity (Fig 2A), the IRF1 promoter 14 bp downstream of the IRF1 TSS, and GFP as a control. We treated cells with LPS for 0h, 90m or 6h (with 4 to 14 replicates per condition), and performed qRT-PCR for IRF1. Its expression shows the expected upregulation with LPS in control samples (Wilcoxon p = 1.2x10-4 at 90mins and p = 0.03 at 6h compared to 0h) and promoter silencing shows strong repression of IRF1 (Fig 2B and S3 Table, Wilcoxon p <5x10-3 for all time points), indicating a successful repressive effect of the promoter gRNA. Enhancer silencing leads to significant upregulation of IRF1 at 0h LPS and suggestive downregulation of IRF1 at 90m (Fig 2B and S3 Table, Wilcoxon p = 0.022 and p = 0.16, respectively). Although somewhat surprising that we see a significant effect at baseline rather than LPS stimulation, the subtle change in effect direction between baseline and 90m LPS of the genetic effect is consistent with the stimulation-specific eQTL results (Fig 1B). This supports the eQTL data suggesting that the locus is indeed a context-specific enhancer of IRF1.

Fig 2. CRISPRi silencing.

Fig 2

A) Illustration of CRISPRi silencing of the IRF1 promoter (right) and the putative enhancer locus at rs17622517 (left); B) qRT-PCR of IRF1 with promoter and enhancer silencing in THP1 cells; C) principal component analysis of gene expression under the different LPS and CRISPRi conditions; D) The correlation of expression fold changes for promoter and enhancer silencing at 12h LPS, shown for 225 genes that are significantly differentially expressed by enhancer silencing.

Next, in order to study transcriptome-wide effect of this locus, we used HEK293/hTLR4A cell line that is easier to manipulate, and the same CRISPRi construct, gRNAs to target the enhancer and promoter loci, and LPS treatments for early and late immune response. We performed RNA-sequencing on the resulting 36 samples with a median of 17 million reads, and analyzed gene expression patterns. Principal component analysis showed that samples cluster both by LPS condition and gRNA (Fig 2C), indicating that both promoter and enhancer silencing have strong effects on the transcriptome. We observe again an upregulation of IRF1 expression with LPS (Wilcoxon p = 0.024 at 90mins and p = 0.038 at 12h compared to 0h in CRISPRi controls with GFP gRNA, S8 Fig). The fold change of 1.75 is comparable to that observed in the original primary monocyte data (1.23, Wilcoxon p = p<2.2−16, S7 Fig). Promoter silencing shows strong repression of IRF1 (Wilcoxon p = 7.4x10-7 across time points), but the effect of the enhancer silencing is not significantly different from the controls for IRF1 (Wilcoxon p = 0.8) or for other genes in the locus (P > 0.1) (S8 Fig). However, encouraged by the results in THP1 cells and the principal component analysis indicating a transcriptome-wide effect of the enhancer silencing, we hypothesized that we may be underpowered to detect its effect on IRF1 in cis in this experiment, but might capture its trans effects by counteracting the noise present in an individual genes by looking at many genes at once.

Thus, we performed transcriptome-wide differential expression analysis to detect the transcriptional effect of LPS timepoints, gRNAs and the interaction of these. First, we verified a robust LPS response in the CRISPRi controls, with 30 differentially expressed genes at 90 minutes (FDR 5%) and 476 genes at 12h compared to the 0h samples (S4 Table), with both gene sets being enriched for GO categories related to inflammatory response (S5 Table). Next, analyzing the promoter and enhancer silencing effect compared to controls at 12h, we detected 1016 differentially expressed genes for the promoter and 225 genes for the enhancer silencing (S4 Table), both having an enrichment of many cellular processes including pathways related to immune response, LPS binding and TLR4 activation (S5 Table). The differential expression of promoter and enhancer have a strong positive correlation (rho = 0.89, p<2.2e-16, Fig 2D), which suggests that the impact of enhancer may be mediated by downregulation of IRF1. The enhancer silencing effect was stronger under LPS stimulus than under baseline (S9 Fig), suggesting that the enhancer is activated under immune response. Neither enhancer nor promoter differentially expressed gene lists were significantly enriched for msigdb IRF1 targets (Fisher test p = 1 and 0.245, respectively), possibly because differentially expressed genes are expected to include direct and indirect targets and downstream pathway effects of IRF1. Altogether, these results show that repression of the enhancer affects the transcriptome, indicating that it is an active regulatory region. Together with the qRT-PCR results showing that this enhancer affects IRF1 and the transcriptome response being similar to the IRF1 promoter silencing, our results indicate that the rs17622517 locus is an enhancer of IRF1.

Detection of variant effects on gene expression by genome editing

Enhancer silencing does not necessarily reflect the effect that a genetic variant has on gene expression in cis and trans nor provide information whether a specific site has regulatory function. Thus, in order to test whether rs17622517 is indeed a functional eQTL variant and characterize its effect, we used CRISPR/Cas9 genome editing to introduce indels at the variant’s genomic locus in HEK293/TLR4 cells; SNP editing in the locus with homology-directed repair had too low efficiency to be applicable. From a population of edited cells with 50% non-homologous end joining (NHEJ) based on targeted sequencing, we isolated and grew monoclonal cell lines, and genotyped them by sequencing. We focused on 1–20 bp deletions, allowing a mix of different edited alleles in a given clone, and because the HEK293 cell line is triploid at this locus, clones were considered heterozygous if they have one or two edited alleles. Fifteen clones with initially promising genotypes were further analyzed by long-range PCR to exclude one clone with a large rearrangement at the locus [25] (S10A Fig) and one heterozygous clone was excluded due to a 50/50 allelic ratio suggesting that one allele had been lost. The 13 clones selected for functional follow-up include five homozygous reference (wild type) clones, three heterozygous clones, and five homozygous deletion clones (S10B Fig). Each of the 13 clones was exposed to LPS as above with 0h, 90 min, and 12h LPS treatment, after which RNA was extracted and sequenced, and gene expression was quantified. This was done in two replicates in order to reduce any potential noise especially in the stimulus step, and the reads from the replicates were combined for each sample, resulting in a median of 31 million reads per sample (S11 Fig).

The samples cluster strongly by LPS treatment in principal component analysis, as expected (Fig 3A and 3B). Differential expression analysis (S6 Table) confirmed the widespread LPS stimulus effect on gene expression of relevant pathways (S7 Table). IRF1 expression is induced upon stimulation at 90m and 12h compared to the controls, but it is not significantly differentially expressed between genotypes (log2 fold change 0.37, p = 0.29 heterozygous vs wild-type, log2 fold change -0.03, p = 0.93 homozygous vs wild-type, log2 fold change -0.40, p = 0.26 homozygous vs heterozygous, Fig 3C). However, again hypothesizing that genetic perturbations at this locus could have downstream effects that we are underpowered to see for IRF1, we next analyzed transcriptome-wide differential gene expression. We used an interaction model between genotype and condition to see if clone genotypes differ in their transcriptional response to LPS stimulation, discovering only up to 37 differentially expressed genes (5% FDR, S6 Table) with no apparent link to IRF1 targets. Testing differential expression between genotypes specifically at 12h, we saw the highest number of differentially expressed genes in comparison of homozygous and heterozygous clones (762 differentially expressed genes at 25% FDR, < 10 for other genotype comparisons). This discordance in numbers may be due to different effects of indels of slightly varying size and the cell lines potentially having other on-target or off-target mutations not detected by our genotyping assays (S10 Fig). Nevertheless, hypothesizing that these 762 genes may best reflect the impact of genomic disruption in this locus, we compared the gene expression response to that in the CRISPRi silencing experiment. We discovered a significant correlation of the fold changes with those from both promoter (rho = 0.21, p = 2.52e-8) and enhancer (rho = 0.22, p = 4.31e-9) silencing (Fig 3D and 3E). Correlations for the top 100 FC genes in other CRISPR genotype comparisons are shown in S12 Fig. These results suggest that disruption of genomic sequence at the rs17622517 locus can have functional effects that are partially similar to downregulation of IRF1, even though we are underpowered to detect effects on individual genes in cis and largely also in trans.

Fig 3. Genome editing with CRISPR.

Fig 3

A-B) principal component analysis of gene expression for the different genotypes and LPS conditions, with principal components 1 and 2 in (A) and 3 and 4 in (B); C) IRF1 expression (gene counts normalized to sample read depth) for the different genotypes and conditions; D-E) Correlation of gene expression fold changes at 12h LPS condition for genes that are differentially expressed in heterozygous lines compared to homozygous lines at 0.25 FDR (x-axis), versus expression fold change for promoter (D) and enhancer (E) silencing (y-axis).

Trans-eQTL effects in CRISPRi and CRISPR models

Having suggestive experimental evidence that rs17622517 is a functional eQTL variant and located in an enhancer that affects transcriptional immune response via IRF1, we sought to analyze whether the observed trans-eQTL effects in primary monocytes can be recapitulated in this experimental model. Thus, we analyzed trans-eGenes at a relaxed 50% FDR, and correlated their trans-eQTL effect sizes with expression fold change in CRISPRi and CRISPR experiments, all under the late LPS treatment. We observed a significant correlation of trans-eQTL effects with differential expression in the CRISPR cell lines for wild type/heterozygote (rho = 0.32, p = 3.7e-5, Fig 4A) and heterozygote/homozygote (rho = -0.27, p = 5.2e-4, S13B Fig) comparisons. The opposing direction of the correlations might be explained by the slight pattern of IRF1 expression of the genotype classes (Fig 3C), with the heterozygous clones having upregulation and homozygotes having downregulation of IRF1, compared to wild type at 90 minutes. Altogether, these correlations suggest that the trans-eQTL is a true association that can be captured in our cellular model (Fig 4B) despite the differences between primary monocytes and HEK/TLR4 cells and the SNP versus CRISPR-induced indels. We observed no correlation of the trans-eQTL effects with CRISPRi enhancer or promoter silencing (Figs 4C, 4D and S13C). This suggests that while CRISPRi can be very powerful for demonstrating enhancer function, it may be suboptimal for recapitulating eQTL effects driven by genetic variants.

Fig 4. Trans-eQTL effects captured by experimental data.

Fig 4

A) Correlation of trans-eQTL effect size with differential expression in CRISPR-edited heterozygous clones, B) Summary of the model supported by data of this study, with the rs17622517 C allele increasing the expression of IRF1 in cis under early immune stimulation, with downstream effects in trans under late immune stimulation. The C allele is also a risk allele for several autoimmune traits. C) Correlation of trans-eQTL effect size with differential expression after silencing of the enhancer locus with CRISPRi, and D) summarized for all the experimental comparisons.

Discussion

In this study, we discovered a condition-specific monocyte eQTL that affects the expression of IRF1 in cis and many genes in its pathway in trans, so that alternative allele carriers have an increased upregulation of IRF1 under immune stimulus. This is a plausible molecular mechanism for the inflammatory bowel disease association in this locus, with genetic predisposition for overactivation of IRF1 immune response (by the rs17622517 C allele) possibly contributing to autoimmune disease risk. This is further strengthened by two independent eQTLs for IRF1 having a GWAS signal for autoimmune traits, indicating convergence of genetic effects on autoimmune traits being mediated by IRF1 expression.

We took a CRISPR-based approach to characterize this association in detail in a cellular model. Using a combination of enhancer and promoter silencing with CRISPRi and genome editing, we showed that the locus at rs17622517 is functional, and likely affects the activity of an immune stimulus -specific enhancer for IRF1, located in an intron of C5ORF56. The effect on the expression of downstream genes in trans suggests recapitulation of the trans-eQTL association and shows how a genetic effect on a transcription factor such as IRF1 can affect entire downstream pathways.

Our work highlights both the power and challenges of CRISPR-based experimental analysis of functional effects of genetic variant. The eQTL mapping in primary monocytes under different treatments allowed us to build strong hypotheses of the mechanism that made the experimental follow-up practically feasible, demonstrating how observational functional genetics data helps to guide experimental work. Even though our modified HEK293 cell line is an imperfect model for events observed in primary monocytes, we were able to capture trans-eQTL effects in our cellular system. However, these effects do not fully recapitulate the trans-eQTL signal, and much more data will be needed to build a generalizable understanding of how well functional genetic effects can be extrapolated between cells in their physiological environment and in vitro.

Our results also highlight the limitations of CRISPR genome editing for functional follow-up: HDR efficiency is too low in many loci, including ours, for feasible analysis of specific SNPs, and interpretation of effects of different indels can be difficult. While editing the exact variant into a cell line may reflect directionally the same genetic effect as seen in the population, a deletion of a locus could feasibly have an effect in the same direction, opposite direction or not at all. This depends upon the specific mechanism of the genetic effect. For example, if a variant increases the affinity of a DNA binding protein by modifying its binding motif, that effect may not be replicated by a simple deletion of the variant locus. However, functional impact of indels does provide evidence that the specific SNP locus has a regulatory effect rather than being a nonfunctional LD proxy of another causal variant. Perhaps most importantly, we observed considerable variation between the different clones, possibly due to undesired and undetected genetic or epigenetic differences between the clones. This makes well-powered and robust analysis difficult without a very large number of clones from each genotype. A key approach in this study to mitigate the limitations of genome editing was to complement it with CRISPRi silencing. Furthermore, while both CRISPR and CRISPRi analyses had limited power to detect robust effects on IRF1 expression in cis, RNA-sequencing allowed us to leverage the power of the entire transcriptome, substantially improving our sensitivity to detect subtle regulatory effects.

The genetic eQTL effect on IRF1 and downstream gene expression in this locus manifests only under immune stimulus, and thus an individual’s transcriptional response to immune stimulus partially depends on the rs17622517 genotype. Such gene-environment interactions are common in cis [25,26], but few condition-specific master trans-regulatory variants such as the one characterized here have been well characterized before [2,27]. The association of this locus to inflammatory bowel disease, the known link between the IRF1 pathway and IBD [1517], and the link between IRF1 variants and eczema [28] and asthma [29,30] suggest that genetic IRF1 dysregulation affects disease risk. While it is likely that rs17622517 is driving both the autoimmune disease signal and the immune response eQTL due to limited linkage disequilibrium with surrounding variants, the independent overlapping eQTL makes colocalization analysis of the locus difficult. Additionally, the specific mechanistic link between the transcriptional IRF1 response described here and autoimmune disease is unknown. However, IRF1 is known to induce expression of type-I interferons [31] and pro-inflammatory cytokines, which can trigger an inflammatory response in a broad range of cell types [32]. Although this is a likely mechanism for IRF1’s role in autoimmune disease, further in vitro and/or in vivo experiments would be required to elucidate this disease mechanism and is a potential topic to be explored in future research. The role of IRF1 in TLR4 signaling in response to LPS has not been well characterized [3,33], and our findings indicate that IRF1 plays a role in LPS response in the innate immune system. However, IRF1 is expressed in many cell types and involved in other immune response pathways. Therefore, the effects described here and their relation to disease mechanism are not necessarily specific to LPS stimulation in monocytes, but could potentially be active in a wide range of immune cells and stimuli.

Altogether, our integrated analysis of genetic effects on gene expression in primary cells and in experimental models provides evidence that within the autoimmune disease risk locus near IRF1, rs17622517 is a functional variant that affects IRF1 expression in monocytes under LPS stimulation and leads to overactivation of its downstream regulatory pathway.

Methods

eQTL discovery

The IRF1 cis response eQTLs (reQTLs) were discovered in a previous immune-response eQTL study[3]. 134 donors of German descent were included in the eQTL study. Individuals were genotyped using Illumina’s HumanOmniExpress BeadChips comprising 730,525 SNPs, 579,090 of which passed QC and were used for analysis. Genotypes were phased with SHAPEIT2 and imputed with IMPUTE2 in 5 Mb chunks against the 1000 genomes phase 1 v3 reference panel. Primary monocytes were isolated from 134 donors and treated with LPS, MDP, IVT RNA, or no treatment. To assess early and late immune response, RNA expression was measured with a microarray chip after 90 min and 6 h. To detect a significant cis-reQTL, the effect size of association between genotype and expression of genes were compared between untreated and treated samples. rs17622517 was found to be a significant cis-reQTL variant for IRF1 at 90m after LPS treatment, i.e. under early immune stimulus. To explore the possibility of another genetic variant driving the association, 1000G CEU data was analyzed to see if any variants were in linkage disequilibrium >0.2 with rs17622517. Additionally, structural variants from Gnomad within 50Kb of rs17622517 were downloaded to see if any common structural variants were in proximity.

Trans-eQTL discovery was done with the MatrixEQTL R package, performing a linear regression between the genotype of the top variant for each of the 126 significant cis-reQTL at LPS 90 min and all expressed probes in monocytes at 6 hours (22,657). Benjamini-Hochberg correction was performed across p-values for all variants and probes. A significance threshold of FDR < 0.5 was used for downstream analyses. Since rs17622517 was found to be associated with many genes in trans at 6 hours, in addition to the cis association with IRF1 and GWAS association for autoimmune diseases, it was selected for further follow up.

Effect size of trans-eQTL associations was measured as allelic fold change calculated from linear regression summary statistics [34]. When multiple expression probes measure the same gene, their mean effect size was used. Effect sizes from probes for the same gene were highly correlated (rho = 0.81, p = 2.5e-5).

Enrichment of IRF1 targets

IRF1 targets were obtained from the molecular signatures database (MSigDB) gene set IRF1-01, which comprises genes which contain at least one IRF1 motif in the 4kb upstream and downstream from their transcription start site [35]. Trans-eGenes from the rs17622517 trans-eQTL (FDR < 0.5) and CRISPRi differentially expressed gene set (FDR < 0.05) were tested for enrichment in IRF1 target genes using a Fisher’s exact test comparing the number of IRF1 target genes in these sets compared to the background of all expressed genes. The background set of genes was determined from the eQTL analysis as the probes with a p of detection < 0.01 in at least 10 samples across all conditions.

Generation of THP1 CRISPRi cell line

Lentiviral particles were produced by transfecting HEK293FT (ThermoFisher) plated in 6-well plates at 80–90% confluency with 100uL Opti-MEM (ThermoFisher) pre-mixed with 0.55ug pMD2.G envelope plasmid (Addgene, 12259), 0.8ug psPAX2 packaging plasmid (Addgene, 12260), 1ug of transfer plasmid and 5.5ng of Polyethylenimine linear MW2000 (PEI) (Polyscience). Plasmids were mixed and incubated for 10 min, then the solution containing lentiviral particles was added to HEK293FT cells. We used two transfer plasmids to generate lentiviruses: KRAB-dCas9-MeCP2 (courtesy of the Sanjana lab). After 72 hours of incubation the media containing the lentivirus was harvested and collected and eventually concentrated with Lentivirus Precipitation Solution (Alstem Cell Advancem). THP1 cells were transduced at different virus concentrations in presence of 0.8ug/ml of Polybrene (Santa Cruz Biotechnology sc-134220). THP1 KRAB-dCas9-MeCP2 transduced cells were selected adding 10ug/ml of Blasticidin S.HCl (A.G.Scientific, B-1247) to the medium. Monoclonal cell lines where further selected and kept under selection at the same concentration of Blasticidin. sgRNA oligos were annealed and individually cloned by BsmBI digestion in pLentiGuidePuro vector (Addgene 52963) and verified by Sanger sequencing.

The following sgRNA sequences were inserted:

  • EGFP non targeting control (NT-Control): GGTGGTGCAGATGAACTTCA;

  • IRF1 promoter: GTCTTGCCTCGACTAAGGAG;

  • IRF1 enhancer: TTCTCTGTAGCCCTTGTATT.

Lentiviral production for the sgRNA lines was executed as above and THP1 KRAB-dCas9-MeCP2 monoclonal cell lines where transduced and selected in presence of blasticidin (10ug/ml) and puromycin (1ug/ml).

Western blot to confirm THP1 cell lines

THP1 KRAB-dCas9_MeCP2 polyclonal and monoclonal lines were tested for the correct presence of the protein by Western Blot. 10ug of protein for each sample were denaturated at 95C for 5 minutes in laemmli sample solution (BioRad 1610747). Protein was separated on 4–15% Mini-PROTEAN TGX Precast Protein Gels (Bio-Rad 451085DC). Protein was transferred on .45um Nitrocellulose membrane (Bio-Rad 1620117) and blocked in 5% milk in PBST for 1 hour. HA-tag present on the construct was detected by HA-Tag (6E2) primary antibody (Cell Signaling; Mouse mAb #2367; 1:2000) at expected size above 250kD. As an internal control, a primary antibody against GAPDH (Cell Signaling; Rabbit mAb #2118; 1:10000) was used. For detection, secondary Anti-Mouse IgG Polyclonal Antibody (IRDye 800CW; 1:10000) and Anti-Rabbit (IRDye 680RD; 1:10000) were used.

Expression of IRF1 by quantitative RT PCR

Cells were grown in absence of LPS (time 0) or in presence of 200ng/ml of ultrapure LPS from Escherichia coli (invivogen) for 90 minutes and 6 hours. Total RNA from THP1 cells was extracted using TRIzol (Invitrogen) and Direct-zol RNA MicroPrep (Zymo Research R2062) and quantified by NanoDrop. RNA was stored at −80°C. Samples were digested with ezDNase before reverse trascription (RT) executed with 0.5 μg of RNA using SuperScript IV VILO (Sigma Aldrich 11766050).

To determine IRF1 gene expression level, quantitative RT-PCR was performed using TaqMan Assays using IRF1 probe from Thermo Fisher Scientific (Hs00971965_m1, FAM). The expression of IRF1 was quantified by −ΔΔC T method and normalized to the multiplexed expression of GAPDH (Hs02786624_g1 VIC-MGB_PL). Four independent experiments were executed and run in technical triplicates in two quantitative PCR. Samples with GAPDH Ct above a threshold of 27 were excluded from the analysis as indication of low cDNA input or inconsistent PCR measurement and biological replicates were excluded if they had less than two technical replicates. Biological replicates included in the final analysis consisted of 6 0h control, 8 0h promoter, 7 0h enhancer, 14 90m control, 7 90m promoter, 12 90m enhancer, 5 6h control, 6 6h promoter and 4 6h enhancer. All qRT-PCR experiments were performed using ViiA7 System (Applied Biosystem).

HEK293 cell culture

HEK293/hTLR4A-MD2-CD14 Cells (Invivogen) were selected for functional follow up of the IRF1 trans-eQTL because of the ease of transfection of HEK293 cells and the addition of the TLR4 receptor, which is essential for cellular response to LPS. Cells were cultured in DMEM supplemented with 4.5 g/l glucose (Corning), 10% fetal bovine serum (Sigma-Aldrich), 1% penicillin/ streptomycin (Corning) and 1% L-glutaMAX (gibco). Cells were passaged using cell scraping to avoid damaging the cell surface receptors.

LPS concentration and time point optimization

To determine the optimal LPS concentration and harvesting timepoints for the HEK293-TLR4 cells, we tested their transcriptomic response to LPS under multiple conditions. HEK293-TLR4 cells were plated into three wells in 24-well plates with 180,000 cells per well. 24 hours later, 0 ng, 500 ng, 1000 ng or 2000 ng of LPS (Invivogen) per mL was added to the cells. RNA was extracted at three timepoints for the 500 and 2000 ng conditions (90m, 6h, 24h) and six timepoints for the 1000 ng condition (45m, 90m, 3h, 6h, 12h, 24h), by adding 500 uL of IBI Isolate DNA/RNA Reagent (IBI Scientific) directly to cells on the plate and stored at -80C until extraction.

RNA extraction and RNA-seq library preparation

RNA was extracted following the Direct-zol RNA MicroPrep kit (Zymo Research) manufacturer’s instructions. RNA was treated with DNAse I (Ambion) and enriched for mRNA using Dynabeads mRNA DIRECT Purification Kit (Thermo Fisher). cDNA was generated using a custom scaled-down modification of the SMART-seq protocol [36]. cDNA was synthesized from RNA input using Maxima H Minus Reverse Transcriptase (Thermo Fisher Scientific). It was then amplified using Kapa HiFi 2X Ready Mix (Kapa Biosystems) and cleaned using 0.9X Ampure beads (Beckman Coulter). Finally, cleaned cDNA samples were tagmented and indexed using the Nextera XT DNA Library Prep Kit (Illumina). Library size and tagmentation were confirmed using the TapeStation HS D1000 kit (Agilent). Libraries were pooled in equal molarity and sequenced with the NextSeq 550 High-Output kit (Illumina) with paired-end 75 bp reads.

RNA-seq data processing

Reads were first trimmed of adapters using trimmomatic, then aligned to the hg19 genome using STAR 2-pass mapping. Gene counts were calculated with FeatureCounts using Gencode v19 gene annotations.

CRISPRi of IRF1 promoter and enhancer locus

In order to determine whether the region of the eQTL variant regulates IRF1, and the effect of IRF1 perturbation in our cell line, we performed CRISPRi experiments targeting both the variant locus and the IRF1 promoter. HEK293-TLR4 cells were plated in three 24-well plates with 120,000 cells/well in 1 ml of DMEM. 24 hours later, the medium was replaced with 0.5 ml of OptiMem (Gibco) and cells were transfected with 1.5 ul of lipofectamine 3000 (Thermo Fisher Scientific), 200 ng of CRISPR-KRAB-MeCP2 vector (Addgene 110821) and 50 ng of gRNA gblock (IDT) (4 wells each received a gRNA targeting EGFP as a neutral control (‘GGTGGTGCAGATGAACTTCA’), 14 bp downstream of the IRF1 promoter (‘GTCTTGCCTCGACTAAGGAG’) or the enhancer (‘TTCTCTGTAGCCCTTGTATT’)). After 28 hours, 1 ug/mL of LPS (Invivogen) was added to the 90 m and 12 h samples and nothing to the control samples, with four biological replicates of each gRNA for each LPS treatment (36 total samples). Cells were collected at their respective time point with TRIzol reagent (Thermo Fisher Scientific) and stored at -80C before RNA extraction and RNA-sequencing.

Genome editing

In order to validate the cis and trans reQTL associations of rs17622517, we edited the HEK293-TLR4 cells using CRISPR/Cas9 genome editing. A gRNA (‘TTCTCTGTAGCCCTTGTATT’) was designed with an NGG PAM and a cut site 1 bp downstream from rs17622517. The gRNA was ordered as a single stranded oligo gblock from IDT and amplified using 2 50 uL reactions of Q5 High Fidelity 2X Master Mix (NEB). Cells were transfected with 0.5 ug gRNA gblock and 2.5 ug px458 plasmid (Addgene plasmid # 48138) containing spCas9 and GFP, using lipofectamine 3000 (Thermo Fisher Scientific). 24-hours later, cells then underwent fluorescence-activated cell sorting for GFP+ cells using a Sony SH800Z cell sorter to enrich for transfected cells. Efficiency of editing was tested using a T7E1 assay and electrophoresis gel to detect presence of NHEJ. The GFP+ cells were also sorted as single cells into 15 96-well plates and expanded into monoclonal cell lines.

Clones were genotyped by creating an amplicon library for each clone from gDNA using nextera primers capturing a 218 bp amplicon containing the variant locus. Indexing PCR was performed using primers specific to the constant sequence on the Nextera primers, resulting in dual-barcoded amplicons with Illumina adapters. Libraries were mixed in equal volume and sequenced on the MiSeq using 150 bp paired-end reads. Fastq files generated by the Illumina software were trimmed for adapter sequences and quality using trimmomatic. Reads were aligned to the genomic locus and categorized as no edit or NHEJ (with indels) using Edityper (Yahi et al. in prep). Since the cell line is triploid at this genomic locus, clones were considered heterozygous if they had NHEJ rate between 20–70%. Clones with less than 10% NHEJ were considered wild type and clones with greater than 90% NHEJ were considered homozygous edited. Five each of wild type, heterozygous and homozygous edited were selected for follow up. BAM files from the selected clones were visually inspected to confirm genotype. In addition, a 3,496 bp PCR followed by electrophoresis gel was performed on the clones in order to check for larger indels not captured by the shorter amplicon libraries.

LPS treatment of edited clones

To detect the reQTL effect of the edited locus, we performed LPS treatment followed by RNA-sequencing on each isolated edited and wild type cell line. Each clonal cell line was plated into three wells (one well each for control, 90m and 12h treatments) in 24-well plates with 180,000 cells per well. 24 hours later, 1ug/mL of LPS (Invivogen) was added to the 90 m and 12 h samples. RNA was extracted at the designated time point by adding 500 uL of IBI Isolate DNA/RNA Reagent (IBI Scientific) directly to cells on the plate and stored at -80C until extraction and RNA-sequencing. This procedure was done twice to reduce potential technical variability from the LPS treatment and sequencing, and after verifying that the results were generally consistent, the gene count matrices from the two runs were summed together for analysis.

Differential expression analysis

Differential expression analyses for the CRISPRi and CRISPR data were performed using the R package DEseq2 [37]. We first transformed and normalized the gene count matrices with vst(), and did principal component analysis. Differential expression was performed using interaction models. For CRISPRi we used ~gRNA + condition + gRNA:condition where gRNA denotes promoter, enhancer or control gRNA, and condition denotes 0, 90 min or 12 h LPS treatment. Additionally, we looked at the promoter vs control and enhancer vs control gRNA effect within 90m LPS samples and 12h LPS samples, and the effects of the LPS treatments within the control samples. For CRISPR, we used a nested interaction model (expression ~ genotype + genotype:clone + genotype:condition) accounting for the same clone undergoing different LPS treatments. Additionally, we looked at the effects of the two LPS treatments within the WT clones. Prior to p-value correction, genes were discarded if they were not annotated as protein coding or lncRNA or they did not have an average expression of greater than 5 read counts across samples. P-values for differential expression were corrected using Benjamini-Hochberg correction, using FDR<0.05 threshold.

Enrichment analysis of significantly differentially expressed genes was done using DAVID biological process gene ontology enrichment [38], using Benjamini-Hochberg corrected p-values and FDR<0.05 except for comparison between edited clone and CRISPRi differential expression, where FDR <0.25 was used.

Supporting information

S1 Table. LPS 6h Trans-eQTLs.

Significant LPS 90 min cis-reQTLs in monocytes were tested against all expressed probes and filtered for an FDR cutoff of 0.5.

(XLS)

S2 Table. IRF1 90m cis-eQTL.

Association between genotype of variants within 100 kb of IRF1 transcription start site and IRF1 expression in monocytes treated with LPS for 90 min.

(XLS)

S3 Table. IRF1 RT-qPCR in THP1 cells.

Relative quantification of IRF1 in RNA extracted from CRISPRi THP1 cells with promoter, enhancer or control gRNAs with and without LPS stimulation.

(XLSX)

S4 Table. Differential gene expression (FDR 0.05) for CRISPRi samples.

hTLR4 Cells were transfected with control, enhancer and promoter gRNAs and a CRISPRi construct. They were then treated with no LPS, LPS for 90 min and LPS for 12h. Differential expression analysis was performed for LPS treatment in CRISPRi control samples, gRNA condition in different LPS treatments, and the interaction between LPS treatment and gRNA condition.

(XLS)

S5 Table. Gene ontology enrichment for CRISPRi differentially expressed genes.

Gene ontology enrichment analysis was performed on gene sets from CRISPRi differential expression analysis.

(XLS)

S6 Table. Differential gene expression (FDR 0.05) for CRISPR edited clones.

hTLR4 Cells were transfected a CRISPR/Cas9 construct and a gRNA specific for the rs17622517 locus. Wild type, heterozygous or homozygous clones were isolated and then treated with no LPS, LPS for 90 min and LPS for 12h. Differential expression analysis was performed for LPS treatment in WT clones, and the interaction between genotype and LPS treatment.

(XLS)

S7 Table. Gene ontology enrichment for CRISPR edited differentially expressed genes.

Gene ontology enrichment analysis was performed on gene sets from CRISPR edited differential expression analysis.

(XLS)

S1 Fig. Trans-eQTL analysis with 126 cis-eQTL.

Histogram of the number of trans probes associated to each cis-eQTL SNP in monocyte trans-eQTL study at 0.05 FDR (A) and number of probes associated with each SNP at 0.5 FDR (B).

(TIFF)

S2 Fig

rs17622517 cis association with expression of genes within 1 MB in monocytes in control (A) and LPS 90 min stimulation (B).

(TIFF)

S3 Fig. Trans-eQTL associations for rs17622517.

Twelve most significant trans-eQTLs associated with rs17622517 under 6h LPS at FDR < 0.05.

(TIFF)

S4 Fig. Secondary eQTL signal in the IRF1 locus.

A) Locus zoom plot highlighting LD pattern for second top SNP in IRF1 cis-eQTL and B) cis-eQTL effect of rs147386065 on IRF1, C) trans-eQTL effect sizes of the significant rs17622517 trans-eQTLs (FDR < 50%) versus effect sizes of rs147386065 trans effects on the same genes.

(TIFF)

S5 Fig. Comparison of LPS response in HEK293-hTLR4 cells versus monocytes.

A) expression of IRF1 for the different conditions; B-C) number of rs17622517 trans-eGenes that have their maximum expression at each time point in primary monocytes using data from Kim-Hellmuth et al. 2017 (B), and in HEK293-TLR4 cells (C). The peak trans-eGene expression is at 6 hours in monocytes and at 12 hours in HEK293-TLR4 cells. D) Overlap of genes expressed in monocytes and HEK-hTLR4 cells. Monocyte expression was quantified with microarray (in Kim-Hellmuth, 2017) where expressed probes were defined as having a p value of detection less than 0.0001. HEK293-hTLR4 cells were quantified with RNA-seq and expressed genes were defined as having an average expression across samples of at least 5 reads. E) Correlation of the log2 fold change (log2FC) in HEK-hTLR4 versus monocytes for the top 100 differentially expressed genes (by absolute value of fold change) in HEK-hTLR4 cells. F) Correlation of log2FC in HEK-hTLR4 versus monocytes for the top 100 differentially expressed genes in monocytes.

(TIFF)

S6 Fig. Western blot confirming dKRAB-dCas9-MECP2 protein in THP1 cells.

Western blot with anti HA-tag of THP1 WT, THP1 KRAB-dCas9-MeCP2 polyclonal and monoclonal cell lines. Mouse GAPDH was used as loading control.

(TIFF)

S7 Fig. Effect of LPS stimulation on IRF1 in monocytes.

Expression change in IRF1 for 134 donors from Kim-Hellmuth, 2017), showing a mean fold change of 1.23 between control and LPS 90m samples (Wilcoxon p < 2.2−16).

(TIFF)

S8 Fig. Enhancer silencing effects on gene expression in cis.

Expression levels of the genes within 1Mb in HEK293-TLR4 cells with and without CRISPRi silencing of the rs17622517 locus. The lack of significant difference indicates that this putative enhancer does not have a strong effect on other genes in cis.

(TIFF)

S9 Fig. Enhancer and promoter silencing effects for different LPS time points.

Number of significantly differentially expressed genes in promoter (A) and enhancer (B) versus controls at the respective LPS condition.

(TIFF)

S10 Fig. Genotyping of edited clones.

A) Electrophoresis of long-range PCR products showed a large deletion in one clone (green) with that was excluded from further analysis; B) genotypes of clones selected for analysis.

(TIFF)

S11 Fig. Principal component analysis of the two replicate CRISPR experiments.

A) Principal components 1 and 2 for all clones across both experiments, highlighting condition and experiment. B) Principal components 3 and 4 for all clones under control conditions, highlighting clone identity.

(TIFF)

S12 Fig. Correlation of differential expression between CRISPR and CRISPRi.

Correlation of gene expression fold changes at 12h LPS condition for the top 100 differentially expressed genes (by fold change) in wild type versus heterozygous clones (A & C) and wild type versus homozygous clones (B & D) versus fold change for promoter (A & B) and enhancer (C & D) CRISPRi silencing. Correlation of 0.05 FDR differentially expressed genes in heterozygous versus homozygous clones versus fold change for promoter (E) and enhancer (F) CRISPRi silencing.

(TIFF)

S13 Fig. Correlation of trans-eQTL effect sizes and experimental perturbations.

Correlation of 0.5 FDR trans-eQTL effect size (allelic fold change) with differential expression in (A) wild-type versus CRISPR-edited homozygous clones, (B) CRISPR-edited heterozygous versus homozygous clones and (C) CRISPRi promoter silencing versus control samples. D) Correlation of 0.05 FDR trans-eQTL effect size with differential expression in CRISPR edited heterozygous clones.

(TIFF)

Acknowledgments

We thank Harm Wessel and Neville Sanjana for assistance with the THP1 cell line.

Data Availability

The RNA-sequencing fastq files and gene expression matrices are available through NCBI’s gene expression omnibus under series accession number GSE145487.

Funding Statement

This work was supported by National Institutes of Health grants R01MH106842 (T.L.), Roy and Diana Vagelos Precision Medicine Initiative Pilot Grant (T.L.), T32GM008224 (Y.R.), and Marie-Skłodowska Curie fellowship H2020 Grant 706636 (S.K-H.), American Heart Association Postdoctoral Fellowship 20POST35220040 (Z.D) and Banting Postdoctoral Fellowship from the Canadian Institutes of Health Research (J.A.M). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Gamazon ER, Segrè AV, van de Bunt M, Wen X, Xi HS, Hormozdiari F, et al. Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation. Nat Genet. 2018. Jul;50(7):956–67. doi: 10.1038/s41588-018-0154-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Fairfax BP, Humburg P, Makino S, Naranbhai V, Wong D, Lau E, et al. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression. Science. American Association for the Advancement of Science; 2014. Mar 7;343(6175):1246949–9. doi: 10.1126/science.1246949 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kim-Hellmuth S, Bechheim M, Pütz B, Mohammadi P, Nédélec Y, Giangreco N, et al. Genetic regulatory effects modified by immune activation contribute to autoimmune disease associations. Nat Commun. Nature Publishing Group; 2017. Aug 16;8(1):266. doi: 10.1038/s41467-017-00366-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lee MN, Ye C, Villani A-C, Raj T, Li W, Eisenhaure TM, et al. Common genetic variants modulate pathogen-sensing responses in human dendritic cells. Science. American Association for the Advancement of Science; 2014. Mar 7;343(6175):1246980–0. doi: 10.1126/science.1246980 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Nédélec Y, Sanz J, Baharian G, Szpiech ZA, Pacis A, Dumaine A, et al. Genetic Ancestry and Natural Selection Drive Population Differences in Immune Responses to Pathogens. Cell. 2016. Oct 20;167(3):657–669.e21. doi: 10.1016/j.cell.2016.09.025 [DOI] [PubMed] [Google Scholar]
  • 6.Consortium GTEx. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science; 2020. Sep 11; 369(6509):1318–1330. doi: 10.1126/science.aaz1776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Small KS, Todorčević M, Civelek M, Moustafa JSE-S, Wang X, Simon MM, et al. Regulatory variants at KLF14 influence type 2 diabetes risk via a female-specific effect on adipocyte size and body composition. Nat Genet.; 2018. Sep;50(9):1342. [DOI] [PubMed] [Google Scholar]
  • 8.Võsa U, Claringbould A, Westra H-J, Bonder MJ, Deelen P, Zeng B, et al. Unraveling the polygenic architecture of complex traits using blood eQTL meta-analysis. BioRxiv; 2018. Oct 18;:1–57. [Google Scholar]
  • 9.Westra H-J, Peters MJ, Esko T, Yaghootkar H, Schurmann C, Kettunen J, et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet. Nature Publishing Group; 2013. Sep 8;45(10):1238–43. doi: 10.1038/ng.2756 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Battle A, Mostafavi S, Zhu X, Potash JB, Weissman MM, McCormick C, et al. Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals. Genome Res. 2014. Jan 1;24(1):14–24. doi: 10.1101/gr.155192.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Grundberg E, Small KS, Hedman ÅK, Nica AC, Buil A, Keildson S, et al. Mapping cis- and trans-regulatory effects across multiple tissues in twins. Nat Genet. Nature Publishing Group; 2012. Oct;44(10):1084–9. doi: 10.1038/ng.2394 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.GTEx Consortium. Genetic effects on gene expression across human tissues. Nature. Nature Publishing Group; 2017. Oct 11;550(7675):204–13. doi: 10.1038/nature24277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kim S, Becker J, Bechheim M, Kaiser V, Noursadeghi M, Fricker N, et al. Characterizing the genetic basis of innate immune response in TLR4-activated human monocytes. Nat Commun. Nature Publishing Group; 2014. Oct 20;5(1):5236. doi: 10.1038/ncomms6236 [DOI] [PubMed] [Google Scholar]
  • 14.Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, et al. The accessible chromatin landscape of the human genome. Nature. Nature Publishing Group; 2012. Sep 6;489(7414):75–82. doi: 10.1038/nature11232 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ellinghaus D, Jostins L, Spain SL, Cortes A, Bethune J, Han B, et al. Analysis of five chronic inflammatory diseases identifies 27 new associations and highlights disease-specific patterns at shared loci. Nat Genet. Nature Publishing Group; 2016. May;48(5):510–8. doi: 10.1038/ng.3528 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Huang H, Fang M, Jostins L, Umićević Mirkov M, Boucher G, Anderson CA, et al. Fine-mapping inflammatory bowel disease loci to single-variant resolution. Nature. Nature Publishing Group; 2017. Jul 13;547(7662):173–8. doi: 10.1038/nature22969 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.de Lange KM, Moutsianas L, Lee JC, Lamb CA, Luo Y, Kennedy NA, et al. Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease. Nat Genet. Nature Publishing Group; 2017. Feb;49(2):256–61. doi: 10.1038/ng.3760 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Shi W, Zou R, Yang M, Mai L, Ren J, Wen J, et al. Analysis of Genes Involved in Ulcerative Colitis Activity and Tumorigenesis Through Systematic Mining of Gene Co-expression Networks. Front Physiol. Frontiers; 2019;10:662. doi: 10.3389/fphys.2019.00662 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wu X-F, Ouyang Z-J, Feng L-L, Chen G, Guo W-J, Shen Y, et al. Suppression of NF-κB signaling and NLRP3 inflammasome activation in macrophages is responsible for the amelioration of experimental murine colitis by the natural compound fraxinellone. Toxicol Appl Pharmacol. 2014. Nov 15;281(1):146–56. doi: 10.1016/j.taap.2014.10.002 [DOI] [PubMed] [Google Scholar]
  • 20.Ciorba MA, Bettonville EE, McDonald KG, Metz R, Prendergast GC, Newberry RD, et al. Induction of IDO-1 by immunostimulatory DNA limits severity of experimental colitis. J Immunol. American Association of Immunologists; 2010. Apr 1;184(7):3907–16. doi: 10.4049/jimmunol.0900291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Shimazu R, Akashi S, Ogata H, Nagai Y, Fukudome K, Miyake K, et al. MD-2, a molecule that confers lipopolysaccharide responsiveness on Toll-like receptor 4. J Exp Med. 1999. Jun 7;189(11):1777–82. doi: 10.1084/jem.189.11.1777 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Jiang Z, Georgel P, Du X, Shamel L, Sovath S, Mudd S, et al. CD14 is required for MyD88-independent LPS signaling. Nat Immunol. Nature Publishing Group; 2005. Jun;6(6):565–70. doi: 10.1038/ni1207 [DOI] [PubMed] [Google Scholar]
  • 23.Poltorak A, He X, Smirnova I, Liu MY, Van Huffel C, Du X, et al. Defective LPS signaling in C3H/HeJ and C57BL/10ScCr mice: mutations in Tlr4 gene. Science. American Association for the Advancement of Science; 1998. Dec 11;282(5396):2085–8. doi: 10.1126/science.282.5396.2085 [DOI] [PubMed] [Google Scholar]
  • 24.Yeo NC, Chavez A, Lance-Byrne A, Chan Y, Menn D, Milanova D, et al. An enhanced CRISPR repressor for targeted mammalian gene regulation. Nat Methods. Nature Publishing Group; 2018. Aug;15(8):611–6. doi: 10.1038/s41592-018-0048-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kosicki M, Tomberg K, Bradley A. Repair of double-strand breaks induced by CRISPR-Cas9 leads to large deletions and complex rearrangements. Nat Biotechnol. 2018. Sep;36(8):765–71. doi: 10.1038/nbt.4192 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Barreiro LB, Tailleux L, Pai AA, Gicquel B, Marioni JC, Gilad Y. Deciphering the genetic architecture of variation in the immune response to Mycobacterium tuberculosis infection. Proc Natl Acad Sci USA. 2012. Jan 24;109(4):1204–9. doi: 10.1073/pnas.1115761109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Quach H, Rotival M, Pothlichet J, Loh Y-HE, Dannemann M, Zidane N, et al. Genetic Adaptation and Neandertal Admixture Shaped the Immune System of Human Populations. Cell. 2016. Oct 20;167(3):643–656.e17. doi: 10.1016/j.cell.2016.09.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Kichaev G, Bhatia G, Loh P-R, Gazal S, Burch K, Freund MK, et al. Leveraging Polygenic Functional Enrichment to Improve GWAS Power. Am J Hum Genet. 2019. Jan 3;104(1):65–75. doi: 10.1016/j.ajhg.2018.11.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Demenais F, Margaritte-Jeannin P, Barnes KC, Cookson WOC, Altmüller J, Ang W, et al. Multiancestry association study identifies new asthma risk loci that colocalize with immune-cell enhancer marks. Nat Genet. Nature Publishing Group; 2018. Jan;50(1):42–53. doi: 10.1038/s41588-017-0014-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ferreira MA, Vonk JM, Baurecht H, Marenholz I, Tian C, Hoffman JD, et al. Shared genetic origin of asthma, hay fever and eczema elucidates allergic disease biology. Nat Genet. Nature Publishing Group; 2017. Dec;49(12):1752–7. doi: 10.1038/ng.3985 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Miyamoto M, Fujita T, Kimura Y, Maruyama M, Harada H, Sudo Y, et al. Regulated expression of a gene encoding a nuclear factor, IRF-1, that specifically binds to IFN-beta gene regulatory elements. Cell. 1988. Sep 9;54(6):903–13. doi: 10.1016/s0092-8674(88)91307-4 [DOI] [PubMed] [Google Scholar]
  • 32.Kröger A, Köster M, Schroeder K, Hauser H, Mueller PP. Activities of IRF-1. J Interferon Cytokine Res. Mary Ann Liebert, Inc; 2002. Jan;22(1):5–14. doi: 10.1089/107999002753452610 [DOI] [PubMed] [Google Scholar]
  • 33.Pan P-H, Cardinal J, Li M-L, Hu C-P, Tsung A. Interferon regulatory factor-1 mediates the release of high mobility group box-1 in endotoxemia in mice. Chin Med J. 2013. Mar;126(5):918–24. [PubMed] [Google Scholar]
  • 34.Mohammadi P, Castel SE, Brown AA, Lappalainen T. Quantifying the regulatory effect size of cis-acting genetic variation using allelic fold change. Genome Res. 2017. Nov;27(11):1872–84. doi: 10.1101/gr.216747.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011. Jun 15;27(12):1739–40. doi: 10.1093/bioinformatics/btr260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Picelli S, Faridani OR, Björklund AK, Winberg G, Sagasser S, Sandberg R. Full-length RNA-seq from single cells using Smart-seq2. Nat Protoc. Nature Publishing Group; 2014. Jan;9(1):171–81. doi: 10.1038/nprot.2014.006 [DOI] [PubMed] [Google Scholar]
  • 37.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. BioMed Central; 2014;15(12):550–21. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. Nature Publishing Group; 2009;4(1):44–57. doi: 10.1038/nprot.2008.211 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Scott M Williams, Xin He

5 Jun 2020

Dear Dr Lappalainen,

Thank you very much for submitting your Research Article entitled 'An autoimmune disease risk variant has a trans master regulatory effect mediated by IRF1 under immune stimulation' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. In particular, both reviewers raised the concern that the CRISPRi and CRISPR experimental results do not support the causal role of the SNP and the mediation by IRF1. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review again a much-revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Xin He

Guest Editor

PLOS Genetics

Scott Williams

Section Editor: Natural Variation

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors map trans eQTLs in monocytes from 134 donors stimulated with LPS for 6h, restricting the analysis to SNPs previously identified as response cis-eQTLs at an earlier timepoint (90 mins LPS treatment). They identify a locus that is a cis-eQTL for IRF1 and is associated with a large number of IRF1 target genes in trans, as well as being a GWAS locus. They aim to validate this putative enhancer using CRISPRi and CRISPR in a modified HEK293 cell line, and present a model of how the risk allele impacts gene expression.

It is important to conduct these functional analyses to follow up eQTL studies and I appreciate the challenges with being able to design the perfect experiment to directly address the difficult problem of identifying causal SNPs and genes. However, my major concern is that the results do not support the strong conclusions made in the manuscript. The results presented are only suggestive of a role of IRF1 and rs17622517 in the nearby enhancer locus and additional work will be required to confirm they are the causal gene and SNP.

Major comments:

1. The results of the CRISPRi experiment do not support the strong conclusion that IRF1 is downregulated by the enhancer silencing

a. As stated on page 10 “The effect of the enhancer silencing is not significantly different from the controls for IRF1”. Given that downregulation of IRF1 was not observed I would suggest that the later sentence “The differential expression of promoter and enhancer have a strong positive correlation...which indicates that enhancer silencing downregulates IRF1” is too strong a conclusion and should be rephrased.

b. On page 10: “the response being similar to the IRF1 promoter silencing further provides strong evidence that it is indeed an enhancer of IRF1”. This evidence is suggestive rather than strong.

c. Figure 2c: Is the upregulation of IRF1 expression in response to LPS less than would be expected based on the previous microarray study and the response in the CRISPR experiment? While statistically significant, from the figure it looks like a very modest upregulation in expression.

d. On page 14 the authors state that “CRISPRi is suboptimal for recapitulating eQTL effects driven by genetic variants”. Could they expand on the motivation for using this technique for this eQTL validation experiment? Have the authors considered if any other experiments could be more informative for determining if the rs17622517 locus is an enhancer for IRF1 e.g. chromatin conformation capture?

e. As described in the points above, the following sentence in the abstract referring to the function as an IRF1 enhancer feels too strong given the results of the CRISPRi silencing experiment “Using CRISPRi silencing, we showed that the SNP locus indeed functions as an IRF1 enhancer with widespread transcriptional effects.”

2. On page 10, could the authors expand upon why they hypothesise that they are “underpowered to detect an effect on IRF1 in cis, but might capture its trans effects” given there is usually better power for the detection of cis effects?

3. The results of the CRISPR experiment do not support the conclusion that rs17622517 is the causal SNP driving the cis and trans eQTL effects. The experiment conducted provides additional supportive evidence for the role of that locus in the regulation of IRF1 expression, but it still doesn’t address the hypothesis that rs17622517 is the causal SNP.

a. Could the authors elaborate on what the correct interpretation of NHEJ is in the context of an eQTL? How should the reader interpret the deletion of the eQTL allele rather than a substitution?

b. On page 12 it is very hard to see the “potential slight genotype effect” particularly in fig 3a.

c. Although the correlations between CRISPR and CRISPRi are statistically significant, they are very weak and what looks like nearly half of the genes show opposite effects between the two experiments.

d. As described in the points above, I do not agree that the results presented support the sentence on page 13 “Having provided experimental evidence that rs17622517 is the causal eQTL variant”

e. Fig 4a shows a weak correlation between the trans-eQTL effect size and the CRISPR experiment which is being driven by a small number of genes. Given that such a relaxed FDR threshold of 0.5 has been used, could the authors comment on whether the genes that do show a good correlation between the experiments are those with the more significant trans eQTL FDR?

f. As described in the points above, the results presented do not support the statement in the abstract “CRISPR further indicated that rs17622517 is indeed a causal variant in this locus, and recapitulated the LPS-specific trans-eQTL signal”

Minor comments:

1. Could the authors clarify if the “strongest trans-eGenes” referred to on page 7 are those with the most significant eGenes rather than necessarily those with the greatest effect size?

2. Page 7: how were the tagging variants identified? Could this SNP be tagging a structural variant?

3. Beyond highlighting that the rs17622517 has a substantial number of trans-eGenes, Figure 1a is initially hard to interpret further. It may be more helpful to plot a histogram of how many cis SNPs have 1 trans-eGene, 2 trans-eGenes etc to highlight the rs17622517 locus but also give a better idea of how many eGenes are detected for the other SNPs.

4. It would be helpful to include the p value and effect size of the cis eQTLs in figure 1b.

5. What does the shading represent on figure 1e?

6. For supplementary figure 5 could the authors clarify in the figure legend what comparisons have been made for the differential expression due to enhancer and promoter silencing in? Is this compared to the same time point in controls?

7. Are the authors concerned that the differential expression lists aren’t enriched for IRF1 targets? Is it something else driving the difference in expression e.g. off target CRISPR effects, or is this related to the time points chosen?

8. Fig 4c is cited in the text before fig 4a. Fig 4d is cited in the text before fig 4b.

Reviewer #2: In « An autoimmune disease risk variant has a trans master regulatory effect mediated by IRF1 under immune stimulation », authors Margot Brandt et al. explore the trans-effect of genetic variation on the transcriptional response of primary monocyte upon LPS stimulation using a data set published previously in Kim-Hellmuth et al. (2017). They identified a risk variant for autoimmune disease, rs17622517, as a master regulator of the innate immune response upon late LPS stimulus through the regulation of IRF1 gene expression. To validate the role of this particular variant in such regulation, they performed CRISPR-based experiments in order to provide functional evidence of its causative effect.

Overall, the results presented by the authors are of interest since only few studies have attempted to develop an in vitro CRISPR-based approach in cellular models to characterize the functional effect of immune-responsive regulatory variants. However, I have major concerns about the conclusions highlighted by the authors.

The experimental work presented here suggest a trans-effect of the rs17622517 locus regulating immune response genes following LPS stimulation, but it is not sufficient to conclude that the effect is modulated by IRF1. Indeed, the authors did not show any direct evidence that the rs17622517 locus is an enhancer of IRF1 expression since the inhibition of this locus by CRISPRi has no effect on IRF1 expression (Wilcoxon p=O.8), as opposed to the inhibition of IRF1 promoter which exhibited a significant down-regulation of IRF1 (Wilcoxon p=7.5x10-7). The same remark can be made for the CRISPR/Cas9 genome editing experiments where the different deletions introduced in the rs17622517 locus have again no effect on IRF1 expression. In addition, the lack of power that they highlighted several times in the article, as well as the relaxed FDR thresholds that they use for the analysis indicate some weakness of the study and highlight the need to be replicated more deeply in order to have confident results. For all these reasons, the authors should be more cautious when claiming that the trans-effect of the rs17622517 locus is modulated by IRF1. The conclusions, as well as the title of this article, should be modified accordingly.

Moreover, the cellular model of HEK293-TLR4 cell line used here does not seem to be well suited to study IRF1 pathway. Although this cellular system is responsive to LPS stimulus, there is no enrichment of IRF1 target genes among differentially expressed genes upon stimulation, as opposed to the results obtained in the primary monocytes. A comparison of the transcriptional response upon stimulation between monocytes and HEK293-TLR4 cell line would be informative (Venn diagram to see the proportion of genes expressed in both cell-types, correlation of the transcriptional response for genes commonly expressed, GO enrichment for these genes…). More particularly, are the 12 trans-eGenes the most strongly associated at FDR=0.05 in monocytes expressed in HEK293 – TLR4 cells? If so, is there a correlation in their expression?

The CRISPR/Cas9 gene editing design used by the authors did not allow the discrimination of the trans-effect at the nucleotide resolution since they introduced deletions surrounding the rs17622517 locus and did not perform SNP editing. They delimited a DNA sequence where a regulatory element may be present and any nucleotide modification within this sequence, including rs17622517, may have a trans-effect. A comment on this issue in the article would be appreciable. In addition, the results regarding the analysis of Het and Hom clones are not convincing because there is no down-regulation of IRF1 expression depending on the genotype classes, as expected. The trans-effect with differential expression in Het and Hom clones showed also an opposing direction of the correlations, which is counter-intuitive, and the variability within genotype classes seems very high. The authors used ranges of NHEJ rate to determine the different genotype classes (NHEJ<10% for WT, NHEJ=20-70% for Het and NHEJ>90% for Hom). Did they choose these ranges arbitrary or this decision was supported by biological/statistical arguments? Moreover, how did the authors estimate the number of clones to test per genotype class? What was the transcriptional variation among clones within each class? Given the moderate effect size of rs17622517 locus and knowing the transcriptional variability among clones within each class, they should calculate the statistical power to detect a biological effect using 13 clones.

My general comment on this article is that the interpretation of the data should be more rigorous:

Depending on the biological hypothesis that the authors want to highlight, they considered different FDR thresholds, from 5% FDR up to 50% FDR. They should explain on which criteria they selected these thresholds and what are the consequences in terms of data significance. It reveals the clear lack of statistical power in their analyses, which results in a doubt about the robustness of the genetic associations.

In the figure Fig. 1a, the authors reported that the rs17622517 locus presented “substantially” a larger number of trans-eGenes with respect to other loci at a FDR=0.5. First they should compare groups using a FDR of 0.05 and second test if the difference is statistically significant. In general, the authors should avoid the terms “substantially” or “suggestive” when no statistical support is provided.

The authors mention that the lead variant rs17622517 had no LD-tagging variants but no information on the processing of the genotyping data has been reported in the article. What is the total number of SNPs considered in the analyses? Has SNP imputation been performed? Has the genetic sub-structure of the population been accounted for in the eQTL analysis? Has a fine mapping of the region been performed to make sure that rs17622517 was the best SNP associated with the disease, as well as with gene expression phenotypes?

Several replicates have been performed in this study but no analysis of the replicates have been reported by the authors in the article, knowing that such analysis is essential for the validation of the experimental settings. Please provide measures of technical variability and show that the biological effect that you expect is higher. Regarding the two replicates that were done for each of the 13 clones, the reads from the replicates were combined. Did the authors quantify the technical noise between replicates? Did they perform some quality checks to ensure that the sequencing data from the two replicates could be merged (correlations, variation between replicates…).

Regarding in the analyses ran in the Figure 3 d-e, the authors used the set of genes that were significantly differentially expressed in heterozygous lines compared to homozygote lines because the effect of the genetic mutations was best captured with respect to the other genotype comparisons. Please provide the biological sense of comparing the fold change of these genes with the fold change obtained in the CRISPRi promoter vs control or in the CRISPRi enhancer vs control. Such comparisons should be performed with the list of genes captured in Het vs WT or Hom vs WT, instead of Het vs Hom.

Minor comments:

When multiple expression probes measure the same gene, what was the correlation between probes for the trans-eGenes?

Is the background gene set used in the enrichment analysis for IRF1 target genes based on a pool of genes expressed in all conditions or only on gene expressed in the condition of interest?

Was the expression data corrected for technical variability?

Please add in the supplementary tables 3 and 5 the name of the genes using HUGO nomenclature.

In Supplementary Figure 6B, the orange line for gDNA should be replaced by the genomic sequence of the region.

Regarding the clones edited by CRISPR/Cas9, it would be appreciable to have their ID on the corresponding figures.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Helene Quach

Decision Letter 1

Scott M Williams, Xin He

28 Apr 2021

Dear Dr Lappalainen,

Thank you very much for submitting your Research Article entitled 'An autoimmune disease risk variant has a trans master regulatory effect mediated by IRF1 under immune stimulation' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer and the editorial comments below our signatures. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Xin He

Guest Editor

PLOS Genetics

Scott Williams

Section Editor: Natural Variation

PLOS Genetics

Both reviewers acknowledged the extra efforts made by the authors, but raised the concern of the strength of the evidence presented, particularly about the effect of the SNP on IRF1 expression. We would suggest the authors to better address the limitations of their evidence. Additionally, please change the title to: "An autoimmune disease risk variant: a trans master regulatory effect mediated by IRF1 under immune stimulation?"

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #2: In the revised version of “An autoimmune disease risk variant has a trans master regulatory effect mediated by IRF1 under immune stimulation”, the authors Margot Brandt et al. have significantly improved the quality of the manuscript by bringing new experimental data (i.e. generation of THP-1 CRISPRi cell lines and RT-PCR experiment to measure IFR1 gene expression) in order to support the trans-effect of rs17622517 on immune response following LPS activation, via the regulation of IRF1. They have also added supplementary analyses (i.e. comparison of LPS response in HEK293-hTLR4 cells versus monocytes) and technical information that allow a better understanding of the article.

The association between rs17622517 and IRF1 expression remains suggestive, preventing any clear demonstration of a trans-effect of rs17622517 following LPS activation, even in THP-1 cells, as shown in the new figure 2b. Regarding this new figure, enhancer silencing leads to significant upregulation of IRF1 at 0h LPS (t-test p= 0.018). Since we do not expect an effect of the enhancer at baseline on the basis of the eQTL mapping performed in primary monocytes (Figure 1b), please comment on your results. In addition, the authors used a t-test to show a suggestive down-regulation of IRF1 at 90 min LPS (t-test p=0.078), but nothing was specified about the normality of the distribution. According to the low number of samples, I think a Wilcoxon rank-sum test would be more appropriate. I suggest that the authors re-calculate the p-values accordingly. Finally, to appreciate the variation of IRF1 expression among THP-1 clones, the box plot should show the median value with the interquartile range and individual values should be added to the figure.

In general, the authors have made a lot of efforts to discuss the lack of power in their experimental settings preventing them to clearly demonstrate the causal role of rs17622517 in the inter-individual response to LPS stimuli via regulation of IRF1, and have softened their claims accordingly.

Reviewer #3: I am commenting on the revised version of this manuscript, not having given feedback on the original. In general, the authors have made substantial revisions in light of the comments on the original manuscript. They show several lines of evidence which, whilst individually weak, suggest that a variant near IRF1 is associated with IRF1 expression levels in early macrophage LPS response, and with differences of IRF1 targets in trans later in the same response. The CRISPRi data suggest this region, which harbors a variant associated to IBD, is an enhancer, as there are marginal effects on LPS-induced IRF1.

This work is a good cautionary tale in how hard it can be to unravel trans-eQTLs - both temporally and in terms of effect size. The authors have to compile multiple strands of individually incomplete evidence to arrive at their current narrative. I have a couple of things for them to consider:

- There is no co-localization analysis between the IBD GWAS data and at least the IRF1 cis-eQTL data, which might indicate a shared effect. In the title and discussion, the authors imply that this is likely a disease-relevant mechanism, but some acknowledgement of the limited resolution of fine-mapping should be made if no sharing can be shown.

- IRF1 is expressed in a wide variety of immune cell types. The possibility that this mechanism, if relevant to disease, may be acting in some other cell type deserves some acknowledgment.

- There is no discussion or data on whether this trans-eQTL actually changes anything about macrophage response to LPS. If no cellular phenotype can be detected, why would this effect be likely to have a broader physiological effect like influencing disease risk? I would suggest this merits at least some discussion as a limitation on the interpretation of the data presented.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #2: None

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: Yes: Helene Quach

Reviewer #3: No

Decision Letter 2

Scott M Williams, Xin He

25 Jun 2021

Dear Dr Lappalainen,

We are pleased to inform you that your manuscript entitled "An autoimmune disease risk variant: a trans master regulatory effect mediated by IRF1 under immune stimulation?" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Xin He

Guest Editor

PLOS Genetics

Scott Williams

Section Editor: Natural Variation

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #2: The authors Margot Brandt et al. took into account all my suggestions in their last version.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: Yes: Helene Quach

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-00488R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Acceptance letter

Scott M Williams, Xin He

22 Jul 2021

PGENETICS-D-20-00488R2

An autoimmune disease risk variant: a trans master regulatory effect mediated by IRF1 under immune stimulation?

Dear Dr Lappalainen,

We are pleased to inform you that your manuscript entitled "An autoimmune disease risk variant: a trans master regulatory effect mediated by IRF1 under immune stimulation?" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Katalin Szabo

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. LPS 6h Trans-eQTLs.

    Significant LPS 90 min cis-reQTLs in monocytes were tested against all expressed probes and filtered for an FDR cutoff of 0.5.

    (XLS)

    S2 Table. IRF1 90m cis-eQTL.

    Association between genotype of variants within 100 kb of IRF1 transcription start site and IRF1 expression in monocytes treated with LPS for 90 min.

    (XLS)

    S3 Table. IRF1 RT-qPCR in THP1 cells.

    Relative quantification of IRF1 in RNA extracted from CRISPRi THP1 cells with promoter, enhancer or control gRNAs with and without LPS stimulation.

    (XLSX)

    S4 Table. Differential gene expression (FDR 0.05) for CRISPRi samples.

    hTLR4 Cells were transfected with control, enhancer and promoter gRNAs and a CRISPRi construct. They were then treated with no LPS, LPS for 90 min and LPS for 12h. Differential expression analysis was performed for LPS treatment in CRISPRi control samples, gRNA condition in different LPS treatments, and the interaction between LPS treatment and gRNA condition.

    (XLS)

    S5 Table. Gene ontology enrichment for CRISPRi differentially expressed genes.

    Gene ontology enrichment analysis was performed on gene sets from CRISPRi differential expression analysis.

    (XLS)

    S6 Table. Differential gene expression (FDR 0.05) for CRISPR edited clones.

    hTLR4 Cells were transfected a CRISPR/Cas9 construct and a gRNA specific for the rs17622517 locus. Wild type, heterozygous or homozygous clones were isolated and then treated with no LPS, LPS for 90 min and LPS for 12h. Differential expression analysis was performed for LPS treatment in WT clones, and the interaction between genotype and LPS treatment.

    (XLS)

    S7 Table. Gene ontology enrichment for CRISPR edited differentially expressed genes.

    Gene ontology enrichment analysis was performed on gene sets from CRISPR edited differential expression analysis.

    (XLS)

    S1 Fig. Trans-eQTL analysis with 126 cis-eQTL.

    Histogram of the number of trans probes associated to each cis-eQTL SNP in monocyte trans-eQTL study at 0.05 FDR (A) and number of probes associated with each SNP at 0.5 FDR (B).

    (TIFF)

    S2 Fig

    rs17622517 cis association with expression of genes within 1 MB in monocytes in control (A) and LPS 90 min stimulation (B).

    (TIFF)

    S3 Fig. Trans-eQTL associations for rs17622517.

    Twelve most significant trans-eQTLs associated with rs17622517 under 6h LPS at FDR < 0.05.

    (TIFF)

    S4 Fig. Secondary eQTL signal in the IRF1 locus.

    A) Locus zoom plot highlighting LD pattern for second top SNP in IRF1 cis-eQTL and B) cis-eQTL effect of rs147386065 on IRF1, C) trans-eQTL effect sizes of the significant rs17622517 trans-eQTLs (FDR < 50%) versus effect sizes of rs147386065 trans effects on the same genes.

    (TIFF)

    S5 Fig. Comparison of LPS response in HEK293-hTLR4 cells versus monocytes.

    A) expression of IRF1 for the different conditions; B-C) number of rs17622517 trans-eGenes that have their maximum expression at each time point in primary monocytes using data from Kim-Hellmuth et al. 2017 (B), and in HEK293-TLR4 cells (C). The peak trans-eGene expression is at 6 hours in monocytes and at 12 hours in HEK293-TLR4 cells. D) Overlap of genes expressed in monocytes and HEK-hTLR4 cells. Monocyte expression was quantified with microarray (in Kim-Hellmuth, 2017) where expressed probes were defined as having a p value of detection less than 0.0001. HEK293-hTLR4 cells were quantified with RNA-seq and expressed genes were defined as having an average expression across samples of at least 5 reads. E) Correlation of the log2 fold change (log2FC) in HEK-hTLR4 versus monocytes for the top 100 differentially expressed genes (by absolute value of fold change) in HEK-hTLR4 cells. F) Correlation of log2FC in HEK-hTLR4 versus monocytes for the top 100 differentially expressed genes in monocytes.

    (TIFF)

    S6 Fig. Western blot confirming dKRAB-dCas9-MECP2 protein in THP1 cells.

    Western blot with anti HA-tag of THP1 WT, THP1 KRAB-dCas9-MeCP2 polyclonal and monoclonal cell lines. Mouse GAPDH was used as loading control.

    (TIFF)

    S7 Fig. Effect of LPS stimulation on IRF1 in monocytes.

    Expression change in IRF1 for 134 donors from Kim-Hellmuth, 2017), showing a mean fold change of 1.23 between control and LPS 90m samples (Wilcoxon p < 2.2−16).

    (TIFF)

    S8 Fig. Enhancer silencing effects on gene expression in cis.

    Expression levels of the genes within 1Mb in HEK293-TLR4 cells with and without CRISPRi silencing of the rs17622517 locus. The lack of significant difference indicates that this putative enhancer does not have a strong effect on other genes in cis.

    (TIFF)

    S9 Fig. Enhancer and promoter silencing effects for different LPS time points.

    Number of significantly differentially expressed genes in promoter (A) and enhancer (B) versus controls at the respective LPS condition.

    (TIFF)

    S10 Fig. Genotyping of edited clones.

    A) Electrophoresis of long-range PCR products showed a large deletion in one clone (green) with that was excluded from further analysis; B) genotypes of clones selected for analysis.

    (TIFF)

    S11 Fig. Principal component analysis of the two replicate CRISPR experiments.

    A) Principal components 1 and 2 for all clones across both experiments, highlighting condition and experiment. B) Principal components 3 and 4 for all clones under control conditions, highlighting clone identity.

    (TIFF)

    S12 Fig. Correlation of differential expression between CRISPR and CRISPRi.

    Correlation of gene expression fold changes at 12h LPS condition for the top 100 differentially expressed genes (by fold change) in wild type versus heterozygous clones (A & C) and wild type versus homozygous clones (B & D) versus fold change for promoter (A & B) and enhancer (C & D) CRISPRi silencing. Correlation of 0.05 FDR differentially expressed genes in heterozygous versus homozygous clones versus fold change for promoter (E) and enhancer (F) CRISPRi silencing.

    (TIFF)

    S13 Fig. Correlation of trans-eQTL effect sizes and experimental perturbations.

    Correlation of 0.5 FDR trans-eQTL effect size (allelic fold change) with differential expression in (A) wild-type versus CRISPR-edited homozygous clones, (B) CRISPR-edited heterozygous versus homozygous clones and (C) CRISPRi promoter silencing versus control samples. D) Correlation of 0.05 FDR trans-eQTL effect size with differential expression in CRISPR edited heterozygous clones.

    (TIFF)

    Attachment

    Submitted filename: response to reviewers.pdf

    Attachment

    Submitted filename: IRF1_response_R2.docx

    Data Availability Statement

    The RNA-sequencing fastq files and gene expression matrices are available through NCBI’s gene expression omnibus under series accession number GSE145487.


    Articles from PLoS Genetics are provided here courtesy of PLOS

    RESOURCES