Abstract
Background:
Control of the COVID-19 pandemic will rely on SARS-CoV-2 vaccine-elicited antibodies to protect against emerging and future variants; an understanding of the unique features of the humoral responses to infection and vaccination, including different vaccine platforms, is needed to achieve this goal.
Methods:
The epitopes and pathways of escape for Spike-specific antibodies in individuals with diverse infection and vaccination history were profiled using Phage-DMS. Principal component analysis was performed to identify regions of antibody binding along the Spike protein that differentiate the samples from one another. Within these epitope regions we determined potential escape mutations by comparing antibody binding of peptides containing wildtype residues versus peptides containing a mutant residue.
Results:
Individuals with mild infection had antibodies that bound to epitopes in the S2 subunit within the fusion peptide and heptad-repeat regions, whereas vaccinated individuals had antibodies that additionally bound to epitopes in the N- and C-terminal domains of the S1 subunit, a pattern that was also observed in individuals with severe disease due to infection. Epitope binding appeared to change over time after vaccination, but other covariates such as mRNA vaccine dose, mRNA vaccine type, and age did not affect antibody binding to these epitopes. Vaccination induced a relatively uniform escape profile across individuals for some epitopes, whereas there was much more variation in escape pathways in in mildly infected individuals. In the case of antibodies targeting the fusion peptide region, which was a common response to both infection and vaccination, the escape profile after infection was not altered by subsequent vaccination.
Conclusions:
The finding that SARS-CoV-2 mRNA vaccination resulted in binding to additional epitopes beyond what was seen after infection suggests protection could vary depending on the route of exposure to Spike antigen. The relatively conserved escape pathways to vaccine-induced antibodies relative to infection-induced antibodies suggests that if escape variants emerge, they may be readily selected for across vaccinated individuals. Given that the majority of people will be first exposed to Spike via vaccination and not infection, this work has implications for predicting the selection of immune escape variants at a population level.
INTRODUCTION
The future of the COVID-19 pandemic will be determined in large part by the ability of vaccine-elicited immunity to protect against current and future variants of the SARS-CoV-2 virus. Several vaccines have now been approved for use in multiple countries, including two that are based on mRNA technology: BNT162b2 (Pfizer/BioNTech) and mRNA-1273 (Moderna). In the United States, over half of adults are now vaccinated against SARS-CoV-2, the majority of whom have received one of these mRNA vaccines. While these vaccines have been shown to effectively guard against infection, severe disease, and death related to SARS-CoV-21–7, less is known about how effective they will be against emerging and future variants. Current surges in the Delta variant coupled with reports of reduced potency of vaccine elicited antibodies against this variant highlight this concerning ongoing dynamic8,9. Evidence from related endemic coronaviruses indicates that evolution in the Spike protein results in escape from neutralizing antibodies elicited by prior infection10, potentially contributing to why endemic coronaviruses can reinfect the same host11–13. Without immunity that is robust in the face of antigenic drift, continual updates of the vaccine to combat new SARS-CoV-2 variants will likely be necessary to provide optimal protection against symptomatic infection.
Prior infection with SARS-CoV-2 also provides some immunity against subsequent re-infection, and several studies have characterized the epitopes targeted by convalescent sera14–18. It is currently unknown whether SARS-CoV-2 infection and vaccination result in antibodies that bind to similar epitopes, an important point to consider given that most people have acquired antibodies through immunization and not infection. The Spike protein encoded by the mRNA in both SARS-CoV-2 vaccines is stabilized in the prefusion conformation by addition of two proline substitutions19. This change in sequence and fixed conformation of the Spike protein could result in altered antibody targeting when compared to antibodies elicited during infection, where Spike undergoes several conformational changes. It is also possible that differences in antibody specificity could be due to the amount of antigen or type of immune response stimulated in the context of infection versus vaccination. We know that vaccines drive higher neutralization titers and more Spike binding IgG antibodies than infection20–22, indicating some differences in the B cell response compared to infection. A recent study showed that antibodies against the receptor binding domain (RBD) of Spike differ between infected and vaccinated individuals; they are generally less sensitive to mutation and bind more broadly across the domain in the context of vaccination as compared to infection23.
Although the majority of the serum binding response in SARS-CoV-2 infected and vaccinated people is directed towards regions of the protein outside of the RBD epitopes15,23–25, few studies have examined the prevalence and escape pathways of these antibodies, especially in the setting of vaccination. Antibodies to linear epitopes in the S2 domain of Spike overlapping the fusion peptide (FP), and in the stem helix region just upstream of heptad repeat 2 (SH-H) region are found in serum from COVID-19 patients, and some studies suggest these antibodies may be neutralizing26,27. These non-RBD responses may also be important contributors to non-neutralizing antibody activities, which have been associated with protection and therapeutic benefit in experimental SARS-CoV-2 models and with vaccine protection28–31. Importantly, these epitopes lie in more conserved regions of Spike than RBD where functional constraints on variation may counter the selective pressure for viral escape.
To compare antibody immunity elicited by SARS-CoV-2 infection and vaccination, we used a high-resolution Spike-specific deep mutational scanning phage display library to profile the epitopes and sites of escape for serum antibodies from people who had been infected, vaccinated, or a combination of both. This approach, called Phage-DMS, identified four non-RBD antibody binding epitopes across all samples: the FP and SH-H region in the S2 subunit, and the N-terminal and C-terminal domains (NTD and CTD, respectively) in the S1 subunit of Spike. Antibodies to NTD and CTD were uniquely present in the setting of mRNA vaccination or severe infection, but mostly absent in mild COVID-19 cases. In vaccinated individuals, the magnitude of the response varied over time both to the CTD and SH-H epitopes. Other covariates, such as age, dose, and vaccine type had no significant differences in the binding profiles observed. Of particular relevance to protection against emerging variants, infection and vaccination appear to shape the pathways of escape differently in different epitopes. In the FP epitope, which is a dominant response after infection, the escape pathway was maintained after subsequent vaccination; in the SH-H epitope, infection resulted in antibodies with diverse pathways of escape, whereas vaccination induced a highly uniform escape profile across individuals. Overall, these findings indicate that vaccination induced a broader antibody response across the Spike protein but induced a singular antibody response at the SH-H epitope, which could favor variants that emerge with these mutations.
RESULTS
Samples from individuals with varying SARS-CoV-2 infection and mRNA vaccination histories profiled using high resolution Spike Phage-DMS library
We collected serum samples from two cohorts, termed the Moderna Trial Cohort and the Hospitalized or Ambulatory Adults with Respiratory Viral Infections (HAARVI) Cohort24,32. The Moderna Trial Cohort were participants in a Phase 1 trial and consisted of 49 individuals, 34 who received the 100 μg dose of mRNA-1273 (Moderna) and 15 who received the 250 μg dose. Serum samples were taken at days 36 and 119 post first dose (7 and 90 days post second dose, respectively32. Serum samples were taken at days 36 and 119 post first dose (7 and 90 days post second dose, respectively)32. The HAARVI Cohort included 64 individuals, 44 who had confirmed SARS-CoV-2 infection and 20 who had no reported infection; among this group, 44 were also vaccinated. Those with infection history were stratified by severity based on hospitalization status (39 non-hospitalized/mild vs. 5 hospitalized/severe) and serum was sampled at timepoints ranging from 8 to 309 days post symptom onset. Of these 44 individuals, 24 were also sampled after vaccination with two doses of either mRNA-1273 (Moderna, n=8) or BNT162b2 (Pfizer/BioNTech, n=15), with 23 from the non-hospitalized group and 1 from the hospitalized group. All 20 SARS-CoV-2 naïve individuals were sampled post-vaccination, with 18 having an additional sample taken pre-vaccination (0 to 98 days). Post-vaccination timepoints for all naïve and convalescent individuals ranged from 23 to 65 days after the first dose (5 to 42 days after the second dose, respectively). Figure 1 provides an illustration of the two cohorts and their respective samples’ infection and vaccination statuses. Additional details are available in Supplementary Table 1.
We used a previously described Spike Phage-DMS library to profile the epitopes bound by serum antibodies in the samples described above24. This library consists of peptides displayed on the surface of T7 bacteriophage that are 31 amino acids long, tiling across the length of Spike in one amino acid increments. Peptides in the library correspond to the wild-type Wuhan Hu-1 Spike sequence as well as sequences that contain every possible single amino acid mutation at the central position of the peptide. Serum samples were screened with this library by performing immunoprecipitation (IP) followed by sequencing of the pool of phage enriched by the serum antibodies as previously described24,33,34.
Serum antibodies bind to distinct epitopes in infected and vaccinated individuals
We first examined the wild-type peptides in the Spike Phage-DMS library that were enriched by each serum sample to determine the epitopes bound by antibodies in each sample from these cohorts (Figure 2A). The major targeted epitopes across all the cohorts were in the NTD, CTD, FP, and SH-H regions. Serum from non-vaccinated infected individuals who were not hospitalized mostly bound to immunodominant epitopes in the FP and SH-H, both of which are epitopes previously identified in infected individuals using Phage-DMS24. Samples from hospitalized/severe COVID-19 cases and vaccinated individuals also bound to the FP and SH-H regions, but additionally bound to epitopes within the NTD and CTD regions. In naïve serum samples there were antibodies that occasionally bound to the FP and SH-H peptides. These findings likely reflect that some individuals have preexisting cross-reactive antibodies that bind to these conserved regions between SARS-CoV-2 and endemic coronaviruses, as suggested by previous studies16,18.
A Principal Component Analysis (PCA) was used to further investigate differences between the infected and/or vaccinated groups. This analysis indicated that binding to epitopes in the NTD, CTD, FP, and SH-H regions were driving differences between samples (Figure 2B). To quantify differences in antibody binding between groups, for each sample we summed together the enrichment values within each identified epitope region and performed pairwise comparisons between non-hospitalized infected people and all other groups (Figure 2C). Most strikingly, we found non-trivial group differences in the magnitude of humoral responses to these major epitopes on the Spike protein. Specifically, antibodies from both hospitalized infected and vaccinated individuals had significantly higher binding to the NTD, CTD, and SH-H regions compared to non-hospitalized infected individuals. However, antibodies from non-hospitalized infected individuals displayed significantly higher binding to the FP epitope than samples from hospitalized or vaccinated individuals. There was no significant difference in any epitope binding in these four regions between vaccinated samples with and without prior infection (p>0.05, Mann-Whitney-Wilcoxon [M.W.W.]).
Effect of age, dose, vaccine type, and timepoint on epitope binding
In order to determine if there were covariates that contributed to differences in antibody binding, we examined the effect of participant age, vaccine dose and type, and timepoint post infection or vaccination on binding to the four epitopes identified above (Figure 3). For samples in the Moderna Trial Cohort, there was significantly decreased binding to the CTD epitope and SH-H epitope (p=0.008, p=0.011, Wilcoxon rank-sum test with Bonferroni correction) at the later timepoint post first dose (day 119) compared to the earlier timepoint (day 36) (Figure 3A). To examine the effect of dosage, we compared 100 ug and 250 ug mRNA-1273 groups for those between the age of 18 to 55, as that was the only age group included for the 250 ug dose (Figure 3B). There was no significant difference by vaccine dosage for any of the four epitope regions (NTD, CTD, FP, or SH-H). Participant age was also examined as a variable; there appeared to be a difference in epitope binding in the SH-H region, but this did not survive multiple testing correction (Figure 3C).
In infected individuals, the effect of time post symptom onset on epitope binding was examined using non-hospitalized infected individuals in the HAARVI Cohort, who were sampled between 26 and 309 days post symptom onset (Supplemental Figure 2A). Samples were binned into three groups: 0–60, 60–180, and 180–360 days post symptom onset. At all times post symptom onset there was no significant difference in binding to the four identified epitopes (p>0.05, M.W.W.). Individuals in the HAARVI Cohort were given either the Moderna mRNA-1273 or Pfizer/BioNTech BNT162b2 mRNA vaccine, and comparison of the epitope binding response between the two vaccine types revealed no significant differences in all epitope regions (Supplemental Figure 2B, p>0.05, M.W.W.).
Infection and vaccination shape pathways of escape
The Spike Phage-DMS library contains peptides with every possible single amino acid substitution in addition to the wild-type sequence, enabling us to assay the impact of mutations on antibody binding. The effect of site-specific substitutions in critical antibody binding regions not only provides a high-resolution picture of the likely epitope intervals, but also identifies mutations that confer escape within the binding region. The effect of each mutation on serum antibody binding was quantified by calculating its scaled differential selection value, a metric that reports log fold change of mutant binding affinity over wild-type binding affinity at any given site (see Methods)33. Site mutations that cause a loss of binding when compared to the wild-type peptide centered at that same site are reported as having negative differential selection values, whereas those that bind better than the wild-type peptide have positive differential selection values. In order for differential selection to be meaningful, however, we must ensure that we do not include weak or sporadic signals that may be due to non-specific binding. Accordingly, we set a threshold of summed wild-type peptide binding in any one region. By doing so, we lose samples in the analysis but can be confident in the results presented by samples passing this curation step (Supplemental Figure 3). For samples that passed this threshold, we compared the effect of prior infection and/or time post vaccination on the pathways of escape in each epitope region as follows. Plots depicting the effect of mutations for all samples are publicly explorable at https://github.com/matsengrp/vacc-dms-view-host-repo.
N-Terminal Domain (NTD) and C-terminal Domain (CTD)
We examined the sites of escape within the NTD and CTD epitope regions, focusing on vaccinated individuals from the Moderna Trial Cohort because these epitopes were notable targets of the vaccine response and not commonly found in infected individuals. Vaccination elicited antibodies with a strikingly uniform escape profile in the NTD epitope across samples (Figure 4A), with the majority of samples being sensitive to mutation at sites 291, 294–297, 300–302, and 304, which are in the very C terminal portion of NTD as well as the region between NTD and RBD. The CTD region appeared to consist of multiple epitopes, the dominant being located at the N-terminal region between positions 545 to 580 (termed CTD-N). Antibodies that bound to this dominant CTD epitope had a less uniform escape profile, but sites 561 and 562 were common sites of escape in most samples (Figure 4B). For antibodies to both the NTD and CTD-N epitopes, the pathways of escape tended to drift over time and were different at 119 days post vaccination as compared to 36 days post vaccination.
Fusion Peptide (FP)
Antibodies against the FP epitope are strongly stimulated after infection but are less strongly induced after subsequent vaccination (Figure 2). Thus, we investigated whether the pathways of escape for serum antibodies also changed after vaccination within samples from previously infected individuals in the HAARVI Cohort. The escape profiles of antibodies in paired samples that strongly bound to the FP epitope both after infection and after subsequent vaccination are shown as a logo plot (Figure 5A). The major sites of escape within the FP epitope for these samples were sites 819, 820, 822, and 823, and these sites of escape did not appear to change after vaccination although we noted there was more variability in the escape profiles after vaccination.
We next examined the pathways of escape for FP binding antibodies in vaccinated individuals from the Moderna Trial Cohort. In people with no prior infection, vaccination induced diverse pathways of escape in the FP region (Figure 5B). For example, for participant M10 escape was focused on sites 814, 816, and 818, whereas for participant M38 escape was focused on 819, 820, and 823. There appeared to be some differences in the escape profile at 119 days as compared to 36 days post vaccination, as exemplified by participants M15, M17, and M20. However, in general many of the major sites of escape were shared at both timepoints within each individual and as a group.
Stem Helix-Heptad Repeat 2 (SH-H)
In order to determine the effect of prior infection on the binding profiles of antibodies after vaccination within the SH-H epitope region, we explored the pathways of escape for paired samples from patients with prior infection in the HAARVI Cohort before and after vaccination as was done for the FP region. Samples from previously infected individuals with no vaccination history had diverse pathways of escape within the SH-H epitope (Figure 6A). For example, site 1149 was only sensitive to mutation for participant 217C, and site 1157 was only sensitive to mutations for participants 120C and 146C. In contrast, the samples from vaccinated individuals, regardless of infection history, tended to have a uniform pathway of escape. The most prominent and consistent sites of escape for vaccinated individuals, both with and without prior infection, were at sites 1148, 1152, 1155 and 1156. Of note, the pre-vaccination sample from an individual with prior infection requiring hospitalization (participant 6C) displayed an escape profile highly similar to those from vaccinated individuals, and this escape profile did not change after vaccination.
To see whether the pathways of escape changed over time after vaccination, we visualized the escape mutations within the SH-H epitope for the samples in the Moderna Trial Cohort at 36 and 119 days after the first dose of vaccine (Figure 6B). We saw a highly uniform pattern of escape for most samples at day 36 and 119, again with escape mainly occurring at sites 1148, 1152, 1155 and 1156. For some participants, such as M11, M34, and M35, the escape mutations appeared to drift over time, but the major sites of escape remained the same.
DISCUSSION
In this study, we comprehensively profiled the antibody response to the SARS-CoV-2 Spike protein, including pathways of escape from sera in individuals with diverse infection and vaccination histories. We identified four major targets of antibody responses outside of the core RBD domains, in the NTD, CTD, FP, and stem helix-HR2 regions. Vaccinated individuals as well as individuals with severe infection requiring hospitalization both had antibodies to these four epitope regions, whereas individuals with mild infection that did not require hospitalization preferentially targeted only FP and SH-H. In previously infected cases, the epitope binding patterns changed over time after vaccination, with decreased binding to both the CTD and SH-H epitopes. However, there was not uniform decay across all four epitopes, indicating that waning antibody titers may not occur for all epitopes equally. Other factors such as vaccine dose (100 ug or 250 ug), vaccine type (BNT162b2 or mRNA-1273), and participant age did not significantly affect the specificity of the antibody response.
We explored the pathways of escape for antibodies binding to these key regions in infected and vaccinated people. We defined for the first time the escape pathways for NTD and CTD-N binding antibodies, epitopes that were commonly found in vaccinated individuals but not in infected individuals. In the case of the NTD epitope, which was located at the C-terminal end of the NTD, escape mutations were uniform and consistent amongst vaccinees, while pathways of escape were more diverse for CTD-N antibodies. Individuals with antibodies that strongly bound the FP epitope had focused escape profiles, with the majority of escape occurring at sites 819, 820, 822, and 823, although the sample size of this group is small (N=3). Vaccination did not greatly alter the escape profile in previously infected individuals, nor did vaccination alone induce a strong or uniform response at the FP epitope. In contrast, antibodies that bind in the SH-H epitope region after infection have diverse pathways of escape, while after vaccination they appear to converge on a more uniform pathway of escape that includes mutations at sites 1148, 1152, 1155 and 1156. Interestingly, these are also the sites of contact for a cross-reactive HR2-specific antibody isolated from a mouse sequentially immunized with the MERS and SARS Spike proteins35. This hints that a singular antibody clonotype could be elicited when exposed to a stabilized Spike protein, dominating the response in the SH-H region.
We also observed some drift in the pathways of escape within a single person over time after vaccination. This mirrors findings from a recent study that examined sites of escape for RBD-specific antibodies in serum samples from the same Moderna Trial Cohort as used in this study23. Together these results suggest that the B cell response after vaccination with Spike mRNA continues to evolve over time. Multiple studies have demonstrated that SARS-CoV-2-specific B cells undergo continued somatic hypermutation in the months after infection, likely due to antigen persistence36,37. Spike antigen has been detected in the lymph nodes at least 3 months after vaccination with BNT162b2, and continued maturation of germinal center B cells could be a possible explanation for the changes in epitope binding we observed38. Alternatively, turnover of short-term plasma cells and memory B cells could account for loss of antibody binding to certain epitopes.
Our study has important limitations worth noting. Because the Spike Phage-DMS library displays 31 amino acid peptides, we are unable to detect antibodies that bind to conformational epitopes and/or glycosylated epitopes. This is demonstrated by the lack of observable binding to the RBD region, a domain with complex folding and known target of antibodies from infected and vaccinated individuals. However, prior studies of RBD epitopes have already been reported using an overlapping set of samples from the HAARVI Cohort, however, and together these results paint a more complete picture of epitopes across the Spike protein15,23. Finally, we only have 5 individuals within the hospitalized group and this small sample size limits our ability to make conclusions about epitope binding in those with severe infection.
Our finding that vaccinated individuals have a broader response across the Spike protein than infected individuals may have important implications for immune durability against future SARS-CoV-2 variants. Evidence suggests that a polyclonal antibody response that is resistant in the face of multiple mutations is necessary for long-lasting immunity against a mutating viral pathogen39. Thus, the polyclonal response to vaccination may provide greater protection from infection than the more focused response after infection. However, the number of epitopes targeted provides just one benchmark and the ability to escape at the population level could also be influenced by the diversity of individuals’ antibody responses at each epitope and thus the likelihood that a single escape mutation could be widely selected. At one S2-domain epitope region (SH-H) vaccination induced uniform sites of escape that may be due to a singular type of antibody that would allow escape by the same mutations for all vaccinated people. However, epitopes in the S2 domain tend to be in highly conserved regions with important functions that constrain the virus’ ability to mutate, making escape from these antibodies less likely than for RBD, where escape is already common. Indeed, mutations in the FP and SH-H epitopes are not arising in the global population of SARS-CoV-224, providing some suggestion that these regions may be constrained40,41. Overall, further studies of the functional capacity of these vaccine-elicited antibodies targeting epitopes outside of RBD are warranted, to provide a path towards a polyclonal response to epitopes across the full Spike protein. This comprehensive view may further the goal of a more universal coronavirus vaccine that eliminates the need for continual updates of the SARS-CoV-2 vaccine strain due to mutations in variable regions on Spike.
MATERIALS AND METHODS
Sample collection
Moderna Trial Cohort
We obtained post-vaccination serum samples via the National Institute of Allergy and Infection Disease that were taken as part of a phase I clinical trial testing the safety and efficacy of the Moderna mRNA-1273 vaccine (NCT04283461)32. All samples were de-identified and thus all work was approved by the Fred Hutchinson Cancer Research Center Institutional Review Board as nonhuman subjects research. Trial participants were given either 100 ug or 250 ug doses of the mRNA-1273 vaccine, and serum was sampled from all trial participants at 36 days and 119 days post vaccination. See Table 1 for detailed metadata related to each participant and serum sample.
HAARVI Cohort
We obtained plasma samples from individuals enrolled in the Hospitalized or Ambulatory Adults with Respiratory Viral Infections (HAARVI) study conducted in Seattle24. Individuals were either enrolled upon PCR confirmed diagnosis with SARS-CoV-2 infection or as control subjects prior to receiving vaccination with either BNT162b2 (Pfizer/BioNTech) or mRNA-1273 (Moderna). See Table 1 for detailed metadata related to each participant and plasma sample. For convenience, all plasma and serum samples in this study are referred to as serum. This study was approved by the University of Washington Institutional Review Board.
Spike Phage-DMS assay
The Spike Phage-DMS library used in this study contained 24,820 designed peptides that tile across the length of the Spike protein. Peptides are each 31 amino acids long and tile by 1 amino acid increments, and correspond to either the wild-type sequence or a sequence containing a single mutation. Serum samples were profiled using the Spike Phage-DMS library as previously described24. Following this method, the Spike Phage-DMS library was diluted in Phage Extraction Buffer (20 mM Tris-HCl, pH 8.0, 100 mM NaCl, 6 mM MgSO4) to a concentration of 2.964 × 109 plaque forming units/mL, which corresponds to approximately 200,000-fold coverage of each peptide. 10 uL of serum or plasma was added to 1 mL of the diluted library and incubated in a deep 96-well plate overnight at 4°C on a rotator. 40uL of a 1:1 mixture of Protein A and Protein G Dynabeads (Invitrogen) were added to each well and then incubated at 4°C for 4 hours on a rotator. Beads bound to the antibodyphage complex were magnetically separated and washed 3x with 400 uL wash buffer (150 mM NaCl, 50 mM Tris-HCl, 0.1% [vol/vol] NP-40, pH 7.5). Beads were resuspended in 40 uL of water and lysed at 95°C for 10 minutes. The diluted Spike Phage-DMS library was also lysed to capture the starting frequencies of peptides. All samples were run twice, once each with two independently generated Spike Phage-DMS libraries.
DNA from lysed samples were amplified and sequenced as previously described24. Two rounds of PCR were performed using Q5 High-Fidelity 2X Master Mix (NEB). For the first round of PCR, 10uL of lysed phage was used as the template in a 25 uL reaction using primers described in24. For the second round of PCR, 2 uL of the round 1 PCR product was then used as the template in a 50 uL reaction, with primers that add dual indexing sequences on either side of the insert. PCR products were then cleaned using AMPure XP beads (Beckman Coulter) and eluted in 50 uL water. DNA concentrations were quantified via Quant-iT PicoGreen dsDNA Assay Kit (Invitrogen). Equimolar amounts of DNA from the samples, along with 10X the amount of the input library samples, was pooled, gel purified, and the final library was quantified using the KAPA Library Quantification Kit (Roche). Pools were sequenced on an Illumina HiSeq 2500 machine using the rapid run setting with single end reads.
Sample curation and replicate structure
All sample IP’s and downstream analysis were run in duplicate across two separate phage display library batches to ensure reproducibility, with the exception of the four acute samples from hospitalized HAARVI participants which were run in singlicate. All results were cross checked with the set of batch replicates to ensure significance fell within one order of magnitude where applicable. For brevity, we present only figures resulting from the single complete set of batch-specific replicates, however, all figures using the second set of library batch replicates are available (see Code and Data Availability). Additionally, some samples were run with “in-line” technical replicates within the same batch. In the case with more than one technical replicate, we selected the sample with the highest reads mapped from each set of batch replicates for our downstream analysis.
Short read alignment and peptide counts processing
Samples were aliquoted and sequenced targeting 10X coverage of total sample reads to the peptide library reference. We demultiplexed samples using Illumina MiSeq Reporter software. Post sample demultiplexing, we used a Nextflow pipeline to process the peptide counts as well as alignment stats for all samples42. The tools and parameters describing the workflow are as follows. The index creation and short-read alignment step were done using Bowtie2. During alignment we allowed for zero mismatches in the default seed length of each read (20, very sensitive) after trimming 32 bases from the 3’ end of each 125bp read to match the 93 bp peptides in our reference library43. Samtools was subsequently used to gather sequencing statistics as well as produce the final peptide counts using the stats and idxstats modules. Finally, the pipeline collected all reference peptide alignment counts and merges them into a single xarray dataset coupling sample and peptide metadata with their respective count. Alignment stats for all replicates are seen in Supplementary Figure S6.
Each of the processing steps described here, as well as downstream analysis and plotting, were run using static and freely available Docker containers for reproducibility. We provide an automated workflow and the configuration scripts defining exact parameters. See Code and Data Availability section for more information.
Epitope binding region identification
Principal Component Analysis (PCA) via Singular Value Decomposition (SVD) was performed on each set of batch replicates using the scikit-learn package44. We first subset our dataset to only include wildtype peptide count enrichments from either infected or vaccinated individuals as input. This curation resulted in the matrix, X of size n × p with n biologically distinct replicates and p enrichment features across the spike protein. All enrichment values were calculated as a fold change in the frequency for any one sample enrichment over the library control enrichment at the same sites. Each feature was mean centered before performing the PCA such that the covariance matrix of X is equivalent to XTX/(n – 1). We can then use the eigendecomposition, X = USVT, to describe the data. The principal axes in feature space are then represented by the columns of V and represent the direction of maximum variance in the data. Figure S1 shows three facets of this decomposition; Figure S1A: the unit scaled sample “scores” represented by the columns to visualize sample relationship in principal component space; Figure S1B: component loadings (scaled by the square root of the respective eigenvalues in S); and Figure S1C: the first three principal axes/directions in feature space plotted as a function of the WT peptide feature location on the Spike protein. Together, these provide a visualization of key features in the data used in our downstream analysis. We chose our epitope regions as contiguous regions of nonzero value in the loadings in the first three principal axes.
Identifying high-resolution pathways of escape
In order to ensure reliable measurements of differential selection of single AA variants compared to the ancestral sequence variant, we threw out samples whose respective sum of wild-type enrichment was below a threshold set for each of the defined binding regions (Figure S4). Once curated, we computed the log-fold change in each of the 19 possible variant substitutions at each site. This metric was then scaled by the average of the wild-type sequence enrichment coupled with both the preceding and following wild-type peptide enrichments at any given site. To evaluate escape at each site, we then sum the differential selection metric as described for each variant at a site to examine a more complete picture of the data defining escape patterns in each sample group.
CODE AND DATA AVAILABILITY
We provide a fully reproducible automated workflow which ingests raw sequencing data and performs all analyses presented in the paper. The workflow defines and runs the processing steps within publicly available and static Docker software containers, including phippery and phip-flow described in the Methods section. The source code, Nextflow script, software dependencies, and instructions for re-running the analysis can be found at https://github.com/matsengrp/phage-dms-vacc-analysis.
The generalized PhIP-Seq alignment and count generation pipeline script can be found at https://github.com/matsengrp/phip-flow. A template and documentation for the alignment pipeline configuration is available at https://github.com/matsengrp/phip-flow-template. Finally, we provide a python API, phippery, to query the resulting dataset post-alignment that can be found at https://github.com/matsengrp/phippery.
All raw sequencing data was submitted to the NCBI SRA under PRJNA765705. Pre-processed enrichment data is available upon request. Additionally, differential selection data and more can be explored interactively using the dms-view toolkit available at https://github.com/matsengrp/vacc-dms-view-host-repo.
For more information regarding code and data availability, please email jgallowa@fredhutch.org. For original data from the NIH Moderna trial please see Jackson et al32, and for information on the HAARVI cohort please contact HYC.
STATISTICAL ANALYSIS
Estimates of significance presented between group continuous distributions of wild-type enrichment were reported using a Mann-Whitney Wilcoxon test with the exception of analysis that included only paired longitudinal samples - such as the comparison of 36-and-119 Days post-vaccination - in this case we used a Wilcoxon signed-rank test. Bonferroni correction was applied where applicable and adjusted P values < 0.05 were presented as significant. All statistical analysis were done using Python 3.6 and plotted using the statannot package found here https://github.com/webermarcolivier/statannot. The static Docker container used for all statistical analysis is publicly hosted at https://quay.io/repository/matsengrp/vacc-ms-analysis.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Kevin Sung, Thayer Fisher, and Noah Simon for providing advice regarding computational analyses. We thank Cassie Sather and the Genomics core facility for assistance with sequencing. We thank Laura Jackson (Kaiser Permanente), Chris Roberts, Catherine Luke, and Rebecca Lampley [National Institute of Allergy and Infectious Diseases (NIAID), NIH] for assistance with obtaining the mRNA-1273 phase 1 trial vaccine samples. We also thank all research participants and study staff of the Hospitalized or Ambulatory Adults with Respiratory Viral Infections (HAARVI) study. This work was supported by NIH grants AI138709 (PI Overbaugh) and AI146028 (PI Matsen). Julie Overbaugh received support as the Endowed Chair for Graduate Education (FHCRC). The research of Frederick Matsen was supported in part by a Faculty Scholar grant from the Howard Hughes Medical Institute and the Simons Foundation. Scientific Computing Infrastructure at Fred Hutch was funded by ORIP grant S10OD028685.
Competing Interests
H.Y.C. reported consulting with Ellume, Pfizer, The Bill and Melinda Gates Foundation, Glaxo Smith Kline, and Merck. She has received research funding from Gates Ventures, Sanofi Pasteur, and support and reagents from Ellume and Cepheid outside of the submitted work.
CITATIONS
- 1.Polack F. P. et al. Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine. N Engl J Med 383, 2603–2615, doi: 10.1056/NEJMoa2034577 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Keehner J. et al. SARS-CoV-2 Infection after Vaccination in Health Care Workers in California. N Engl J Med 384, 1774–1775, doi: 10.1056/NEJMc2101927 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Amit S., Regev-Yochay G., Afek A., Kreiss Y. & Leshem E. Early rate reductions of SARS-CoV-2 infection and COVID-19 in BNT162b2 vaccine recipients. Lancet 397, 875–877, doi: 10.1016/s0140-6736(21)00448-7 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Angel Y. et al. Association Between Vaccination With BNT162b2 and Incidence of Symptomatic and Asymptomatic SARS-CoV-2 Infections Among Health Care Workers. Jama, doi: 10.1001/jama.2021.7152 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Thompson M. G. et al. Interim Estimates of Vaccine Effectiveness of BNT162b2 and mRNA-1273 COVID-19 Vaccines in Preventing SARS-CoV-2 Infection Among Health Care Personnel, First Responders, and Other Essential and Frontline Workers - Eight U.S. Locations, December 2020-March 2021. MMWR Morb Mortal Wkly Rep 70, 495–500, doi: 10.15585/mmwr.mm7013e3 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Haas E. J. et al. Impact and effectiveness of mRNA BNT162b2 vaccine against SARS-CoV-2 infections and COVID-19 cases, hospitalisations, and deaths following a nationwide vaccination campaign in Israel: an observational study using national surveillance data. Lancet, doi: 10.1016/s0140-6736(21)00947-8 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Baden L. R. et al. Efficacy and Safety of the mRNA-1273 SARS-CoV-2 Vaccine. N Engl J Med 384, 403–416, doi: 10.1056/NEJMoa2035389 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Planas D. et al. Reduced sensitivity of SARS-CoV-2 variant Delta to antibody neutralization. Nature, doi: 10.1038/s41586-021-03777-9 (2021). [DOI] [PubMed] [Google Scholar]
- 9.Lopez Bernal J. et al. Effectiveness of Covid-19 Vaccines against the B.1.617.2 (Delta) Variant. N Engl J Med, doi: 10.1056/NEJMoa2108891 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Eguia R. T. et al. A human coronavirus evolves antigenically to escape antibody immunity. PLoS Pathog 17, e1009453, doi: 10.1371/journal.ppat.1009453 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Edridge A. W. D. et al. Seasonal coronavirus protective immunity is short-lasting. Nat Med 26, 1691–1693, doi: 10.1038/s41591-020-1083-1 (2020). [DOI] [PubMed] [Google Scholar]
- 12.Hendley J. O., Fishburne H. B. & Gwaltney J. M. Jr. Coronavirus infections in working adults. Eight-year study with 229 E and OC 43. Am Rev Respir Dis 105, 805–811, doi: 10.1164/arrd.1972.105.5.805 (1972). [DOI] [PubMed] [Google Scholar]
- 13.Schmidt O. W., Allan I. D., Cooney M. K., Foy H. M. & Fox J. P. Rises in titers of antibody to human coronaviruses OC43 and 229E in Seattle families during 1975–1979. Am J Epidemiol 123, 862–868, doi: 10.1093/oxfordjournals.aje.a114315 (1986). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hanrath A. T., Payne B. A. I. & Duncan C. J. A. Prior SARS-CoV-2 infection is associated with protection against symptomatic reinfection. J Infect 82, e29–e30, doi: 10.1016/j.jinf.2020.12.023 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Greaney A. J. et al. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe, doi: 10.1016/j.chom.2021.02.003 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Shrock E. et al. Viral epitope profiling of COVID-19 patients reveals cross-reactivity and correlates of severity. Science, doi: 10.1126/science.abd4250 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Li Y. et al. Linear epitope landscape of the SARS-CoV-2 Spike protein constructed from 1,051 COVID-19 patients. Cell Rep 34, 108915, doi: 10.1016/j.celrep.2021.108915 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Stoddard C. I. et al. Epitope profiling reveals binding signatures of SARS-CoV-2 immune response in natural infection and cross-reactivity with endemic human CoVs. Cell Rep 35, 109164, doi: 10.1016/j.celrep.2021.109164 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Corbett K. S. et al. SARS-CoV-2 mRNA vaccine design enabled by prototype pathogen preparedness. Nature 586, 567–571, doi: 10.1038/s41586-020-2622-0 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Goel R. R. et al. Distinct antibody and memory B cell responses in SARS-CoV-2 naïve and recovered individuals following mRNA vaccination. Sci Immunol 6, doi: 10.1126/sciimmunol.abi6950 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Prendecki M. et al. Effect of previous SARS-CoV-2 infection on humoral and T-cell responses to single-dose BNT162b2 vaccine. Lancet 397, 1178–1181, doi: 10.1016/s01406736(21)00502-x (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Edara V. V., Hudson W. H., Xie X., Ahmed R. & Suthar M. S. Neutralizing Antibodies Against SARS-CoV-2 Variants After Infection and Vaccination. Jama, doi: 10.1001/jama.2021.4388 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Greaney A. J. et al. Antibodies elicited by mRNA-1273 vaccination bind more broadly to the receptor binding domain than do those from SARS-CoV-2 infection. Sci Transl Med 13, doi: 10.1126/scitranslmed.abi9915 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Garrett M. E. et al. High-resolution profiling of pathways of escape for SARS-CoV-2 spike-binding antibodies. Cell 184, 2927–2938.e2911, doi: 10.1016/j.cell.2021.04.045 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Piccoli L. et al. Mapping Neutralizing and Immunodominant Sites on the SARS-CoV-2 Spike Receptor-Binding Domain by Structure-Guided High-Resolution Serology. Cell 183, 1024–1042.e1021, doi: 10.1016/j.cell.2020.09.037 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Poh C. M. et al. Two linear epitopes on the SARS-CoV-2 spike protein that elicit neutralising antibodies in COVID-19 patients. Nat Commun 11, 2806, doi: 10.1038/s41467-020-16638-2 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li Y. et al. Linear epitopes of SARS-CoV-2 spike protein elicit neutralizing antibodies in COVID-19 patients. Cell Mol Immunol, doi: 10.1038/s41423-020-00523-5 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Schäfer A. et al. Antibody potency, effector function, and combinations in protection and therapy for SARS-CoV-2 infection in vivo. J Exp Med 218, doi: 10.1084/jem.20201993 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Tauzin A. et al. A single dose of the SARS-CoV-2 vaccine BNT162b2 elicits Fc-mediated antibody effector functions and T cell responses. Cell Host Microbe, doi: 10.1016/j.chom.2021.06.001 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Winkler E. S. et al. Human neutralizing antibodies against SARS-CoV-2 require intact Fc effector functions for optimal therapeutic protection. Cell 184, 1804–1820.e1816, doi: 10.1016/j.cell.2021.02.026 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ullah I. et al. Live imaging of SARS-CoV-2 infection in mice reveals neutralizing antibodies require Fc function for optimal efficacy. bioRxiv, doi: 10.1101/2021.03.22.436337 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Jackson L. A. et al. An mRNA Vaccine against SARS-CoV-2 - Preliminary Report. N Engl J Med 383, 1920–1931, doi: 10.1056/NEJMoa2022483 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Garrett M. E. et al. Phage-DMS: A Comprehensive Method for Fine Mapping of Antibody Epitopes. iScience 23, doi: 10.1016/j.isci.2020.101622 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Mohan D. et al. PhIP-Seq characterization of serum antibodies using oligonucleotide-encoded peptidomes. Nat Protoc 13, 1958–1978, doi: 10.1038/s41596-018-0025-6 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sauer M. M. et al. Structural basis for broad coronavirus neutralization. Nat Struct Mol Biol 28, 478–486, doi: 10.1038/s41594-021-00596-4 (2021). [DOI] [PubMed] [Google Scholar]
- 36.Gaebler C. et al. Evolution of antibody immunity to SARS-CoV-2. Nature 591, 639–644, doi: 10.1038/s41586-021-03207-w (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Sakharkar M. et al. Prolonged evolution of the human B cell response to SARS-CoV-2 infection. Sci Immunol 6, doi: 10.1126/sciimmunol.abg6916 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Turner J. S. et al. SARS-CoV-2 mRNA vaccines induce persistent human germinal centre responses. Nature, doi: 10.1038/s41586-021-03738-2 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Greaney A. J., Welsh F. C. & Bloom J. D. Co-dominant neutralizing epitopes make anti-measles immunity resistant to viral evolution. Cell Rep Med 2, 100257, doi: 10.1016/j.xcrm.2021.100257 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Walls A. C. et al. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 181, 281–292.e286, doi: 10.1016/j.cell.2020.02.058 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Jaroszewski L., Iyer M., Alisoltani A., Sedova M. & Godzik A. The interplay of SARS-CoV-2 evolution and constraints imposed by the structure and functionality of its proteins. PLoS Comput Biol 17, e1009147, doi: 10.1371/journal.pcbi.1009147 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Di Tommaso P. et al. Nextflow enables reproducible computational workflows. Nat Biotechnol 35, 316–319, doi: 10.1038/nbt.3820 (2017). [DOI] [PubMed] [Google Scholar]
- 43.Langmead B. & Salzberg S. L. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359, doi: 10.1038/nmeth.1923 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Pedregosa F. et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011). [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
We provide a fully reproducible automated workflow which ingests raw sequencing data and performs all analyses presented in the paper. The workflow defines and runs the processing steps within publicly available and static Docker software containers, including phippery and phip-flow described in the Methods section. The source code, Nextflow script, software dependencies, and instructions for re-running the analysis can be found at https://github.com/matsengrp/phage-dms-vacc-analysis.
The generalized PhIP-Seq alignment and count generation pipeline script can be found at https://github.com/matsengrp/phip-flow. A template and documentation for the alignment pipeline configuration is available at https://github.com/matsengrp/phip-flow-template. Finally, we provide a python API, phippery, to query the resulting dataset post-alignment that can be found at https://github.com/matsengrp/phippery.
All raw sequencing data was submitted to the NCBI SRA under PRJNA765705. Pre-processed enrichment data is available upon request. Additionally, differential selection data and more can be explored interactively using the dms-view toolkit available at https://github.com/matsengrp/vacc-dms-view-host-repo.
For more information regarding code and data availability, please email jgallowa@fredhutch.org. For original data from the NIH Moderna trial please see Jackson et al32, and for information on the HAARVI cohort please contact HYC.