Abstract
The phase III Thai RV144 vaccine trial showed an estimated vaccine efficacy (VE) to prevent HIV-1 infection of 31.2%, which has motivated the search for immune correlates of vaccine protection. In a recent report, several single nucleotide polymorphisms (SNPs) in FCGR2C were identified to associate with the level of VE in the RV144 trial. To investigate the functional significance of these SNPs, we utilized a large scale B cell RNA sequencing database of 462 individuals from the 1000 Genomes Project to examine associations between FCGR2C SNPs and gene expression. We found that the FCGR2C SNPs that associated with vaccine efficacy in RV144 also strongly associated with the expression of FCGR2A/C and one of them also associated with the expression of Fc receptor-like A (FCRLA), another Fc-γ receptor (FcγR) gene that was not examined in the previous report. These results suggest that the expression of FcγR genes is influenced by these SNPs either directly or in linkage with other causal variants. More importantly, these results motivate further investigations into the potential for a causal association of expression and alternative splicing of FCGR2C and other FcγR genes with the HIV-1 vaccine protection in the RV144 trial and other similar studies.
Introduction
The RV144 preventive HIV-1 vaccine efficacy trial tested the ALVAC-HIV-1 plus gp120 AIDSVAX B/E vaccine regimen in Thailand and demonstrated an estimated vaccine efficacy (VE) of 31.2% for prevention of HIV-1 infection [1]. Several follow-up studies were conducted to search for the correlates of risk for HIV-1 infections and to investigate the predictors and the mechanisms of the vaccine protection [2–6]. Among them, Li et al reported that FCGR2C (Fc fragment of IgG, low affinity IIc, receptor for (CD32)) polymorphisms associated with the HIV-1 vaccine protection [2]. By sequencing the external protein-coding domains and the transmembrane regions of the five low-affinity FcγR genes (FCGR2A, FCGR2B, FCGR2C, FCGR3A, FCGR3B), the study found that one FCGR2C tag SNP (rs114945036) in linkage disequilibrium with 3 other FCGR2C SNPs (rs138747765, rs78603008, and rs373013207) was associated with VE against HIV-1 subtype CRF01_AE. Individuals carrying CC had an estimated VE against HIV-1 subtype CRF01_AE with lysine at position 169 (169K) of 15%, while individuals carrying CT or TT exhibited a VE against CRF01_AE 169K HIV-1 of 91%. For VE against any HIV-1 subtypes, the individuals carrying CC had VE = 11% compared to VE = 64% for the individuals carrying CT or TT. Overall, the study provided a strong evidence for a role of FcγRs and Fc-mediated Ab function in the protection conferred by the RV144 vaccine regimen against HIV-1 acquisition. However, the functional significance of these associations, such as a causal relationship to alternative splicing or other changes in FcγR expression patterns or levels under different conditions, was not established [2].
Materials and Methods
RNA-seq read mapping, genotypes, expression quantification and eQTL mapping results reported by [7] were downloaded from EBI ArrayExpress (accession E-GEUV-1). As detailed in [7], the mRNA quantification was calculated at different levels. For exon quantification, overlapping exons of a gene were first merged into meta-exons. Transcripts and splice junctions were quantified by the Flux Capacitor approach [8]. Gene quantification was calculated as the sum of all transcript RPKMs (reads per kilobase per million) for each gene. Before eQTL analysis, expression quantifications were normalized by PEER correction [9] and transformed into standard normal distribution. eQTLs were mapped using a linear model in Matrix eQTL [10] and permutations were used for FDR estimation.
For FcγR genes with updated RefSeq annotation, the number of reads uniquely mapped to each gene in each sample was counted using HTSeq [11], and the same read mapping as in [7] was used. The RefSeq annotation was downloaded from UCSC genome browser (https://genome.ucsc.edu, accessed on July 1, 2015). The pairwise LD (both D’ and r2) between the SNP rs114945036 and other SNPs was individually estimated based on the genotypes of the 373 individuals from EUR using PLINK [12]. VCF files were first converted into PLINK format using VCFtools [13]. The similarities between RefSeq annotated FCGR2A and FCGR2C transcript sequences were calculated using Exonerate [14]. Additional statistical analyses including boxplot and Pearson correlation were performed using R (https://www.r-project.org).
Results
FCGR2C polymorphisms associate with FcγR gene expression in human B cells
To investigate the functional consequence of the relevant genetic variants [2], we analyzed a published large-scale B cell RNA sequencing dataset from the 1000 Genomes Project [7]. Lappalainen et al performed mRNA sequencing on lymphoblastoid cell line samples of 462 individuals in an attempt to uncover functional variants in humans at the genome scale [7]. They mapped cis- quantitative trait loci (QTLs) to transcriptome traits of protein coding genes and miRNA genes, separately in the European (EUR, n = 373) and Yoruba (YRI, n = 89) populations. We reasoned that this integrated transcriptome and genome sequencing dataset would allow us to investigate in detail the impact of individual SNPs [2] on gene expression. Interestingly, we found that three of the four SNPs described in [2], rs114945036, rs138747765 and rs78603008, were significantly associated (FDR < 0.05 by Benjamini-Hochberg method [15]) with the expression level of FCGR2A in the EUR populations (Fig 1). The reason that these significant associations were found only in the EUR population may have been due to the larger sample size (373 EUR vs. 89 YRI) in the eQTL dataset and a relatively higher minor allele frequency for all three SNPs in the EUR population (S1 Fig). Especially, all three SNPs were associated with changes in the expression of the last exon of FCGR2A. These three SNPs were in a complete linkage disequilibrium (LD) in the original RV144 study [2], and they were also in a strong LD in EUR (rs114945036 vs. rs138747765: D’ = 1, r2 = 0.83; rs114945036 vs. rs78603008: D’ = 1, r2 = 0.83, where D’ is a standard measure of LD).
FCGR2C polymorphisms likely directly associate with FCGR2C gene expression in human B cells
Since these three SNPs are relatively distant from FCGR2A (over 84 kb away from its transcriptional start site), we investigated if there were any functional variants which were not genotyped in [2] but were in linkage with these three FCGR2C SNPs. A search of all SNPs identified as having a significant association with the FCGR2A’s expression in [7] suggested that was not the case. S2 Fig shows that only a few SNPs that are within or near FCGR2A passed the significance cutoff of FDR < 0.05, and none of these were in LD with the FCGR2C tag SNP rs114945036. In contrast, in addition of the three FCGR2C SNPs, we identified several additional 1000 genomes SNPs in and around FCGR2C that significantly associated with FCGR2A expression, all of which were in linkage with rs114945036 (S1 Table and also see S2 Table). These observations raised the possibility that the newly identified SNPs in the FCGR2C region might directly associate with the expression of FCGR2C itself.
In order to gain understanding of why FCGR2C SNPs were associated with FCGR2A expression in this dataset, we undertook a close examination of the Gencode annotation of FcγR genes, on which the analysis published in [7] was based. This analysis discovered confounding errors in the annotation of FCGR gene transcripts that may indicate that FCGR2C and FCGR2A were not distinguished appropriately (Gencode Team, personal communication, July 3, 2015). For example, several transcripts with long introns read through multiple FcγR genes (S2 Fig), which could severely complicate the quantification of FCGR2C and other involved FcγR genes. Further, the last exon of FCGR2C, which is over 98% identical to the last exon of FCGR2A at the nucleotide level, was apparently missing in the Gencode annotation (S2 Fig). These results suggested the association of FCGR2C SNPs with the expression of FCGR2A could be due to the correlated expression between FCGR2A and FCGR2C or simply to the inability to distinguish these sequences using short read sequence data. To confirm this possibility, we re-calculated the number of reads uniquely mapped to each FcγR gene, but using the canonical RefSeq annotation of FcγR genes shown in S2 Fig. As expected, S3 Fig shows that the expression of FCGR2A and FCGR2C was highly correlated (Pearson correlation r = 0.9). Since there is a high sequence identity between FCGR2A and FCGR2C transcripts (overall ~96% nucleotide identity, S3 Table), it is possible that the correlated expression was driven by certain ambiguities in read assignments, or shared regulatory mechanisms of FcγR gene expression, or both. The former is unlikely because a mappability analysis across the FcγR region showed that there are many regions unique to each of these genes for which their RNA fragments can be unambiguously mapped (data not shown). The complexity of repeating the complete eQTL analysis with updated transcript annotation and the complications of quantifying genes of high sequence identity with short read sequencing data precluded us from directly establishing the association between FCGR2C SNPs and its expression. However, it is evident from these results that FCGR2C SNPs could also directly associate with the expression of FCGR2C itself.
FCRLA polymorphisms associate with FCRLA gene expression in human B cells
As shown in Fig 1, rs138747765, one of the FCGR2C SNPs identified in [2], was also significantly associated with the expression of FCRLA, another FcγR gene not examined in the previous report [2]. In addition, a close examination showed that several SNPs in the FCRLA gene had much stronger associations with the expression of FCRLA (Fig 2A). These SNPs were not in a linkage with the FCGR2C SNP rs138747765 (Fig 2B), and their much stronger associations and closer proximity to FCRLA is consistent with more direct roles as functional variants for FCRLA than rs138747765. Interestingly, the association of rs138747765 with the expression of FCRLA could also be partially explained by the correlation between the expression levels of FCGR2C and FCRLA (Fig 2C). These results suggest that the expression of individual FcγR genes could be influenced by multiple genetic variations within the FcγR region.
Discussion
Recently, Li et al reported that FCGR2C polymorphisms were associated with HIV-1 vaccine protection in the RV144 trial [2], but the functional significance of these polymorphisms was not established. Here, we analyzed a large scale B cell RNA sequencing dataset of 462 individuals, and found that the FCGR2C polymorphisms also associated with the expression of FcγR gene FCGR2A, and very likely FCGR2C itself. In addition, we found that one of these FCGR2C SNPs also associated with the expression of FCRLA, another FcγR gene in the region, which was not examined in the original report [2]. These results show that the expression of FcγR genes is influenced by these polymorphisms, either directly or in linkage with other causal variants.
This work further supports conclusions from previous studies of genetic correlates of vaccine efficacy in the RV144 trial. Three recent studies collectively pointed to a common mechanistic effect of three different gene groups by acting on antibody responses. HLA class I was implicated by a significant HLA A*02 association with vaccine efficacy [3]. That study suggested that the HLA A*02 allele could have influenced antibody production acting through an exongenous pathway linked to both HLA class I and class II antigen presentation [16]. The study by Prentice et al. [6] directly supported a role of HLA class II allelic variants in modulating HIV-1 vaccine-induced antibody responses. A role for the FcγR proteins in antibody responses is evident, and the present study may point to mechanisms affecting FcγR expression and consequent protein function.
Evidence of genetic variants from three distinct polymorphic gene groups collectively pointing to modulation of vaccine induced antibody responses in a single vaccine trial, highlights a major point relevant to vaccine design–host genetic variation can modulate vaccine efficacy. Genetic variants can affect antigen presentation directly as with MHC class I and class II, or by affecting gene expression levels or splicing patterns, thus affecting derivative protein function through quantitative or qualitative mechanisms, as suggested by this study. While HIV-1 vaccine trials undertaken thus far have each used single source vaccines, consideration of genetic factors may provide a novel avenue for HIV-1 vaccine design. The essential detail of host genetics in the RV144 trial that modulated specific responses and improved vaccine efficacy could not only be used to better understand the mechanisms underlying vaccine efficacy, but also in designing multiple variants of the vaccine itself. Modifications of individual vaccines to produce genetic ‘vaccine variants’ that were matched with knowledge of host genetics, could direct vaccines with the highest efficacy to appropriate genetic subpopulations. Indeed, for HIV-1 and other antigenically variable viruses, such an approach may not only be valid but also necessary.
Clearly, additional studies are necessary to establish the causality of these associations. For example, the gene expression data examined was measured in subjects from the European population and we cannot rule out that other genetic differences between EUR and the Thai population in which RV144 trial was conducted also influence expression levels. However, it has been observed that there is significant sharing of eQTL effects between Asian and European populations overall [17]. It was reported that the expression of a FCGR2C allele (FCGR2C-ORF) in B cells enhances humoral response to immunization in mice and to vaccination in a human anthrax vaccine trial [18], but no data was reported on the frequency of the specific allele in Asian populations and the RV144 population was of Thai origin. A close look at the dbSNP shows 100% of the rs759550223 as T–the other allele variant (FCGR2C-STP) at the same position as FCGR2C-ORF was found in all but one sample in the RV144 samples we analyzed [2]. The low frequency of the FCGR2C-ORF allele in East Asian population was also reported previously [19], suggesting that the associations observed here are less likely driven by this particular allele. It is known that FCGR2C is copy number variable, in part due to the FCGR3A or FCGR3B deletions [19–21]. It will be interesting to investigate if and how the deletion alleles contribute to the observed differences in gene expression, as hemizygotes may appear to be homozygotes for FCGR2C SNPs. Since the frequency of these deletion alleles is very low (~5%), this gene dosage effect may be limited.
It is not clear why FCGR2C SNPs and not FCGR2A polymorphisms were associated with RV144 vaccine efficacy, while changes in expression levels of FCGR2A were associated with those SNPs. However, considering the high homology between the FCGR2A and FCGR2C sequences, the RNA-seq data examined does not allow us to rule out changes in FCGR2C expression as well or instead of FCGR2A. Also, little is known about the regulation of the FCGR genes and considering their relative locations within the FCGR gene cluster there is no a priori reason to rule out an effect of FCGR2C localized SNPs on FCGR2A expression. Another potentially relevant factor is expression changes specific to cell subtypes, as FCGRs exhibit distinct expression patterns in different in lymphocyte subsets [22]. Interestingly, a recent comprehensive immunephenotyping analysis of 78,000 immune traits in 699 female twins reported that the genetic variations in the FcγR locus containing FCGR2A, FCGR2B, and FCRLA had the widest range of impacts on immune cell subset frequency and immune cell-surface protein expression levels [23]. The strongest association for this locus was between a SNP in the FCGR2A coding region (rs1801274) and the protein expression of CD32 (FcGR2a and/or FcRG2b) on the surface of inflammatory myeloid dendritic cells. But similar genetic control of CD32 protein expression was not observed in B cells [23]. Though the SNPs examined in [23] were distinct from the FCGR2C SNPs studied here, the observations reported together suggest a broad range of impacts on the phenotypes of different leukocyte subsets. At the same time, these results strongly argue that it would be informative to examine if altered expression of FCGR2C and other FcγR genes such as FCRLA associate with HIV-1 vaccine protection in the RV144 trial and other similar studies and across individual immune cell subsets. Also, it would be useful to re-evaluate the same RV144 ALVAC-HIV-1 plus gp120 HIV-1 vaccine regimen in animal models like humanized mice [24] and non-human primates, especially by matching animals with the corresponding genotypes described here and in [2]. This would firmly recapitulate the relevance of these polymorphisms in conferring protection, and offer new avenues for designing an improved vaccine regimen.
Supporting Information
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was supported by the National Institutes of Health, Office of Research Infrastructure Programs (http://dpcpsi.nih.gov/orip) [R24OD011172, R24OD010445, P51OD010425 to MGK]; the National Institute of Allergy and Infectious Diseases (https://www.niaid.nih.gov)[1R21AI120713 to XP]; Federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services (https://www.niaid.nih.gov) [HHSN272201300010C, in part]. This work was also supported by Contract #694251 from the Henry Jackson Foundation for the Military HIV Research Program to the Statistical Center for HIV/AIDS Research and Prevention (http://www.hjf.org) [SSL and PBG]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Rerks-Ngarm S, Pitisuttithum P, Nitayaphan S, Kaewkungwal J, Chiu J, Paris R, et al. Vaccination with ALVAC and AIDSVAX to prevent HIV-1 infection in Thailand. N Engl J Med. 2009;361(23):2209–20. Epub 2009/10/22. 10.1056/NEJMoa0908492 [pii]. . [DOI] [PubMed] [Google Scholar]
- 2.Li SS, Gilbert PB, Tomaras GD, Kijak G, Ferrari G, Thomas R, et al. FCGR2C polymorphisms associate with HIV-1 vaccine protection in RV144 trial. J Clin Invest. 2014;124(9):3879–90. Epub 2014/08/12. 10.1172/JCI75539 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gartland AJ, Li S, McNevin J, Tomaras GD, Gottardo R, Janes H, et al. Analysis of HLA A*02 association with vaccine efficacy in the RV144 HIV-1 vaccine trial. J Virol. 2014;88(15):8242–55. Epub 2014/05/16. 10.1128/JVI.01164-14 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rolland M, Edlefsen PT, Larsen BB, Tovanabutra S, Sanders-Buell E, Hertz T, et al. Increased HIV-1 vaccine efficacy against viruses with genetic signatures in Env V2. Nature. 2012;490(7420):417–20. Epub 2012/09/11. 10.1038/nature11519 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Liao HX, Bonsignori M, Alam SM, McLellan JS, Tomaras GD, Moody MA, et al. Vaccine induction of antibodies against a structurally heterogeneous site of immune pressure within HIV-1 envelope protein variable regions 1 and 2. Immunity. 2013;38(1):176–86. Epub 2013/01/15. 10.1016/j.immuni.2012.11.011 S1074-7613(12)00552-3 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Prentice HA, Tomaras GD, Geraghty DE, Apps R, Fong Y, Ehrenberg PK, et al. HLA class II genes modulate vaccine-induced antibody responses to affect HIV-1 acquisition. Sci Transl Med. 2015;7(296):296ra112 Epub 2015/07/17. 10.1126/scitranslmed.aab4005 7/296/296ra112 [pii]. . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lappalainen T, Sammeth M, Friedlander MR, t Hoen PA, Monlong J, Rivas MA, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501(7468):506–11. Epub 2013/09/17. 10.1038/nature12531 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Montgomery SB, Sammeth M, Gutierrez-Arcelus M, Lach RP, Ingle C, Nisbett J, et al. Transcriptome genetics using second generation sequencing in a Caucasian population. Nature. 2010;464(7289):773–7. Epub 2010/03/12. 10.1038/nature08903 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Stegle O, Parts L, Durbin R, Winn J. A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies. PLoS Comput Biol. 2010;6(5):e1000770 Epub 2010/05/14. 10.1371/journal.pcbi.1000770 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Shabalin AA. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics. 2012;28(10):1353–8. Epub 2012/04/12. 10.1093/bioinformatics/bts163 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31(2):166–9. Epub 2014/09/28. 10.1093/bioinformatics/btu638 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. Epub 2007/08/19. doi: S0002-9297(07)61352-4 [pii] 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8. Epub 2011/06/10. 10.1093/bioinformatics/btr330 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Slater GS, Birney E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005;6:31. Epub 2005/02/17. doi: 1471-2105-6-31 [pii] 10.1186/1471-2105-6-31 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. JRoyStatistSocSer. 1995;B(57):289–300. [Google Scholar]
- 16.Goodridge JP, Lee N, Burian A, Pyo CW, Tykodi SS, Warren EH, et al. HLA-F and MHC-I open conformers cooperate in a MHC-I antigen cross-presentation pathway. J Immunol. 2013;191(4):1567–77. Epub 2013/07/16. 10.4049/jimmunol.1300080 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Stranger BE, Montgomery SB, Dimas AS, Parts L, Stegle O, Ingle CE, et al. Patterns of cis regulatory variation in diverse human populations. PLoS Genet. 2012;8(4):e1002639 Epub 2012/04/26. 10.1371/journal.pgen.1002639 PGENETICS-D-11-00883 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Li X, Wu J, Ptacek T, Redden DT, Brown EE, Alarcon GS, et al. Allelic-dependent expression of an activating Fc receptor on B cells enhances humoral immune responses. Sci Transl Med. 2013;5(216):216ra175 Epub 2013/12/20. 10.1126/scitranslmed.3007097 5/216/216ra175 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Machado LR, Hardwick RJ, Bowdrey J, Bogle H, Knowles TJ, Sironi M, et al. Evolutionary history of copy-number-variable locus for the low-affinity Fcgamma receptor: mutation rate, autoimmune disease, and the legacy of helminth infection. Am J Hum Genet. 2012;90(6):973–85. Epub 2012/05/23. 10.1016/j.ajhg.2012.04.018 S0002-9297(12)00251-0 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Breunis WB, van Mirre E, Geissler J, Laddach N, Wolbink G, van der Schoot E, et al. Copy number variation at the FCGR locus includes FCGR3A, FCGR2C and FCGR3B but not FCGR2A and FCGR2B. Hum Mutat. 2009;30(5):E640–50. Epub 2009/03/25. 10.1002/humu.20997 . [DOI] [PubMed] [Google Scholar]
- 21.Hollox EJ, Hoh BP. Human gene copy number variation and infectious disease. Hum Genet. 2014;133(10):1217–33. Epub 2014/08/12. 10.1007/s00439-014-1457-x . [DOI] [PubMed] [Google Scholar]
- 22.Bruhns P. Properties of mouse and human IgG receptors and their contribution to disease models. Blood. 2012;119(24):5640–9. Epub 2012/04/27. 10.1182/blood-2012-01-380121 [pii]. . [DOI] [PubMed] [Google Scholar]
- 23.Roederer M, Quaye L, Mangino M, Beddall MH, Mahnke Y, Chattopadhyay P, et al. The genetic architecture of the human immune system: a bioresource for autoimmunity and disease pathogenesis. Cell. 2015;161(2):387–403. Epub 2015/03/17. 10.1016/j.cell.2015.02.046 S0092-8674(15)00247-0 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bournazos S, Klein F, Pietzsch J, Seaman MS, Nussenzweig MC, Ravetch JV. Broadly neutralizing anti-HIV-1 antibodies require Fc effector functions for in vivo activity. Cell. 2014;158(6):1243–53. Epub 2014/09/13. 10.1016/j.cell.2014.08.023 S0092-8674(14)01055-1 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.