Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Oct 28.
Published in final edited form as: Nat Microbiol. 2022 Apr 28;7(5):640–652. doi: 10.1038/s41564-022-01105-z

Daily longitudinal sampling of SARS-CoV-2 infection reveals substantial heterogeneity in infectiousness

Ruian Ke 1, Pamela P Martinez 2,3,4, Rebecca L Smith 4,5,6, Laura L Gibson 7, Agha Mirza 8, Madison Conte 8, Nicholas Gallagher 9, Chun Huai Luo 9, Junko Jarrett 9, Ruifeng Zhou 10, Abigail Conte 10, Tongyu Liu 2, Mireille Farjo 2, Kimberly KO Walden 11, Gloria Rendon 11, Christopher J Fields 11, Leyi Wang 12, Richard Fredrickson 12, Darci C Edmonson 4, Melinda E Baughman 4, Karen K Chiu 4, Hannah Choi 4, Kevin R Scardina 4, Shannon Bradley 4, Stacy L Gloss 4, Crystal Reinhart 4, Jagadeesh Yedetore 4, Jessica Quicksall 4, Alyssa N Owens 13, John Broach 14,15, Bruce Barton 16,17, Peter Lazar 16, William J Heetderks 18, Matthew L Robinson 8, Heba H Mostafa 9, Yukari C Manabe 8,10, Andrew Pekosz 10, David D McManus 19, Christopher B Brooke 2,4,*
PMCID: PMC9084242  NIHMSID: NIHMS1789713  PMID: 35484231

Abstract

The dynamics of SARS-CoV-2 replication and shedding in humans remain poorly understood. We captured the dynamics of infectious virus and viral RNA shedding during acute infection through daily longitudinal sampling of 60 individuals for up to 14 days. By fitting mechanistic models, we directly estimated viral expansion and clearance rates, and overall infectiousness for each individual. Significant person-to-person variation in infectious virus shedding suggests that individual-level heterogeneity in viral dynamics contributes to superspreading. Viral genome loads often peaked days earlier in saliva than in nasal swabs, indicating strong tissue compartmentalization and suggesting that saliva may serve as a superior sampling site for early detection of infection. Viral loads and clearance kinetics of Alpha (B.1.1.7) and previously circulating non-variant of concern viruses were mostly indistinguishable, indicating that the enhanced transmissibility of this variant cannot be simply explained by higher viral loads or delayed clearance. These results provide a high-resolution portrait of SARS-CoV-2 infection dynamics and implicate individual-level heterogeneity in infectiousness in superspreading.


Transmission of SARS-CoV-2 by pre-symptomatic and asymptomatic individuals has been a major contributor to the explosive spread of this virus 15. Recent epidemiological investigations of community outbreaks have indicated that transmission of SARS-CoV-2 is highly heterogeneous, with a small fraction of infected individuals (often referred to as “superspreaders”) contributing a disproportionate share of forward transmission 68. Transmission heterogeneity has also been implicated in the epidemic spread of several other important viral pathogens, including measles and smallpox 9. Numerous behavioral and environmental explanations have been offered to explain transmission heterogeneity, but the extent to which the underlying features of the infection process within individual hosts contribute towards the superspreading phenomenon remains unclear. Addressing this gap in knowledge will inform the design of more targeted and effective strategies for controlling community spread.

Viral infection is a highly complex process in which viral replication and shedding dynamics are shaped by the complex interplay between host and viral factors. Recent studies have suggested that the magnitude and/or duration of viral shedding in both nasal and saliva samples correlate with disease severity, highlighting the potential importance of viral dynamics in influencing infection outcomes 1013. Variation in viral load has also been suggested to correlate with transmission risk 14. In addition to implications for pathogenesis and transmission, defining the contours of viral shedding dynamics is also critical for designing effective surveillance, screening, and testing strategies 15. To date, studies aimed at describing the longitudinal dynamics of SARS-CoV-2 shedding have been limited by (a) sparse sampling frequency, (b) failure to capture the early stages of infection when transmission is most likely, (c) absence of individual-level data on infectious virus shedding kinetics, and (d) biasing towards the most severe clinical outcomes 1621. This is also true for viruses beyond SARS-CoV-2, as the dynamics of natural infection in humans have not been described in detail for any acute viral pathogen.

Here, we capture the longitudinal viral dynamics of mild and asymptomatic early acute SARS-CoV-2 infection in 60 people by recording daily measurements of both viral RNA shedding (from mid-turbinate nasal swabs and saliva samples) and infectious virus shedding (from mid-turbinate nasal swabs) for up to 14 days. We reveal a striking degree of individual-level heterogeneity in infectious virus shedding between individuals, thus providing a partial explanation for the central role of superspreaders in community transmission of SARS-CoV-2. We also directly compare the shedding dynamics of Alpha (B.1.1.7) and previously circulating, non-Alpha viruses, revealing no significant differences in nasal or saliva shedding. Altogether, these results provide a high-resolution, multi-parameter empirical profile of acute SARS-CoV-2 infection in humans and implicate person-to-person variation in infectious virus shedding in driving patterns of epidemiological spread of the pandemic.

Description of cohort and study design

During the Fall of 2020 and Spring of 2021, all faculty, staff, and students at the University of Illinois at Urbana-Champaign were required to undergo at least twice weekly RTqPCR testing for SARS-CoV-2 22. We leveraged this large-scale, high-frequency screening program to enroll symptomatic, pre-symptomatic, and asymptomatic SARS-CoV-2-infected individuals. We enrolled university faculty, staff, and students who reported a negative RTqPCR test result in the past seven days and were either (a) within 24 hours of a positive RTqPCR result, or (b) within five days of exposure to someone with a confirmed positive RTqPCR result. These criteria ensured that we only enrolled people within the first days of infection.

We collected both nasal and saliva samples daily for up to 14 days to generate a high-resolution portrait of viral dynamics during the early stages of SARS-CoV-2 infection. Participants also completed a daily online symptom survey. Our study cohort was primarily young (median age: 28; range: 19–73), non-Hispanic white, and skewed slightly towards males (Table S1). All infections were either mild or asymptomatic, and none of the participants were ever hospitalized for COVID-19. All participants in this cohort reported that they had never been previously infected with SARS-CoV-2, and none were vaccinated against SARS-CoV-2 at the time of enrollment.

Early SARS-CoV-2 viral dynamics vary significantly between individuals

To examine viral dynamics at the individual level, we plotted Ct/CN values from both saliva and nasal swab samples (the RTqPCR assay used for nasal swab samples reports CN values, an objective measure of the cycle number of the maximal rate of PCR signal increase, rather than Ct values. CN values and Ct values are equivalent in suitability for quantitative estimates 23), Quidel SARS Sofia 2 antigen fluorescent immunoassay (FIA) results, and viral culture data from nasal swabs, as a function of time relative to the lowest observed CN values (Fig 1A, Extended data Fig 1). In many cases, we captured both the rise and fall of viral genome shedding in nasal and/or saliva samples. A comparison between individuals revealed substantial heterogeneity in shedding dynamics, with obvious differences in the duration of detectable infectious virus shedding, clearance kinetics, and the temporal relationship between shedding in nasal and saliva compartments. Further, 9 out of 60 individuals had no detectable infectious virus in nasal samples (Fig 1A, Extended data Fig 1).

Figure 1: SARS-CoV-2 viral dynamics captured through daily sampling.

Figure 1:

(A) Temporal trends for the saliva RTqPCR (teal dots), nasal swab RTqPCR (navy blue dots), nasal swab viral culture (red crosses), and positive nasal swab antigen test results (dark mustard shaded area). The left Y axis indicates Ct values for saliva RTqPCR assay (covidSHIELD) and CN values for nasal swab RTqPCR assay (Abbott Alinity). The right Y axis indicates results of viral culture assays, where day of culture positivity indicates the day of incubation at which > 50% of Vero-TMPRSS2 cells infected with the sample were positive for cytopathic effect. The vertical dotted line shows the day at which the lowest nasal CN value is observed. Horizontal dashed line indicates limit of detection of RTqPCR and viral culture assays. The title of each plot corresponds to the participant ID for the top 30 individuals with the most data points (the remaining 30 participants are shown in figure S1). Single and double asterisks next to the participant ID indicates B.1.1.7 and P.1 variants respectively. (B) Individual Ct (for saliva) and CN (for nasal swabs) values from samples plotted based on concurrent results from the viral culture assay. Negative indicates samples for which viral culture assay showed no viral growth after 5 days. The boxplot shows first and third quartiles (interquartile range IQR), where the horizontal line is the median value, and the whiskers are 1.5 times the IQR. (C) Plot showing antigen FIA results from days where participants tested either positive or negative by viral culture. The text inside the bars indicates the percentage of antigen FIA results that were positive when concurrent viral culture sample was positive or negative.

Generally, earlier positivity results in the viral culture assay (which suggests higher infectious viral loads) were associated with lower CN values in nasal samples (Fig 1B). This is unsurprising, as both nasal viral genome load and viral infectivity were assayed using the same sample. Saliva Ct values tended to be higher than matched nasal samples, likely due in part to the lower molecular sensitivity of the specific saliva RTqPCR assay used which does not include an RNA extraction step 24. For both sample types, the relationship between viral culture results and Ct/CN values was not absolute, as several nasal swab samples with CN values greater than 30 also tested positive for infectious virus. These data indicate that caution must be exercised when using a simple Ct/CN value cutoff as a surrogate for infectious status.

We also assessed the relationship between antigen FIA and viral culture results and found that participants tested positive by antigen FIA on 93% of the days on which they also tested positive by viral culture (Fig 1C). This finding is consistent with earlier cross-sectional studies examining the relationship between antigen tests positivity and infectious virus shedding 25,26.

While the symptom profiles self-reported by study participants varied widely across individuals, all cases were mild and did not require medical treatment (Extended data Fig 2). To determine whether any specific symptoms correlated with viral culture positivity, we compared the reported frequencies for each symptom on days where individuals tested viral culture positive or negative (Extended data Fig 3). Muscle aches, runny nose, and scratchy throat were significantly more likely to be reported on days when participants were viral culture positive, suggesting these specific symptoms as potential indicators of infectious status. No other symptoms examined exhibited a clear association with viral culture status. Self-reported symptom data from this study may be partially skewed by having been collected after participants were notified of their initial positive test result or potential exposure.

Within-host mechanistic models capture viral dynamics in nasal and saliva samples

To better quantify the specific features of viral dynamics within individuals, we implemented five within-host mechanistic models based on models developed previously for SARS-CoV-2 and influenza infection (see Supporting Text for details) (Figs 2A, Extended data Fig 4) 2729. We fit these models to viral genome loads derived from the observed Ct/CN values using a population mixed effect modeling approach (see Supporting Text for details). The viral dynamics in nasal and saliva samples were distinct from each other in most individuals, indicating strong compartmentalization of the oral and nasal cavities. We thus fit the models to data from nasal and saliva samples separately. For each sample type, viral genome loads from 4 individuals remained very low or undetectable throughout the sampling period (Extended data Fig 1), suggesting that these individuals either (a) were enrolled late during infection despite having a recent negative test result, or (b) exhibited highly irregular shedding dynamics. Because we were primarily interested in early infection dynamics, data from these individuals were excluded. Altogether, we selected data from 56 out of 60 individuals for each sample type for model fitting. Adding the excluded individuals did not change the main conclusions (analysis not shown).

Figure 2: Model fits quantify heterogeneity in viral dynamics and discordance in genome shedding between nasal and saliva samples.

Figure 2:

(A) Diagrams outlining structures of the refractory cell model and the immune effector cell model that best fit the nasal swab and saliva RTqPCR data, respectively. In the refractory cell model, target cells (T) are infected by viruses (V) at rate β. Infected cells first become eclipse cells (E) and do not produce viruses; at rate k, eclipse cells become productively infected cells (I) producing both viruses and interferon (F) at rate π and s, respectively. They die at rate δ. Binding of interferons with target cells induces an antiviral response that turns target cells into cells refractory to infection (R). The rate of induction of the antiviral response is Φ. Refractory cells can revert to target cells at rate ρ. In the effector cell model, we assume that over the course of infection, immune effector cells (X) that clear infected cells are activated and recruited. This leads to an increase in infected cell death rate from δ1 to δ1+δ2. (B) Model fits to nasal sample (navy blue) and saliva (teal) RTqPCR results from the same subset of individuals shown in figure 1A. Includes last recorded negative saliva RTqPCR result prior to study enrollment. Dotted lines represent the limit of detection (LoD) for the RTqPCR assays. Dots on the dotted lines denote measurements under the LoD. (C, D) Distributions of the exponential viral growth rates, days from infection to peak viral genome load, and days from peak to undetectable viral genome loads predicted by the refractory cell model (nasal data; panel C) and the immune effector model (saliva data; panel D) across 56 individuals in this cohort. (E) Association between age and the estimated strength of the antiviral immune response (Φ) based on nasal sample data. The Y axis is shown on a log 10 scale. Associations are examined using a standard linear regression analysis. The R-squared value and the p-value are reported. (F) Distribution of the differences in the estimated times of peak viral genome loads between the saliva and nasal samples. Bars colored teal and navy blue represent estimated saliva peaks that occur at least 0.5 day earlier and later than nasal samples, respectively. The gray bar indicates the number of individuals that have similar timing in the peaks.

To identify factors that might partially explain the observed variation in individual-level dynamics, for each model we tested whether the age of the participants or the infecting viral genotype (i.e. non-B.1.1.7 vs. B.1.1.7) co-vary with any of the estimated model parameters in the model fitting. A total of 114 model variations were tested (see Supporting text for the testing strategy). We compared the relative abilities of these model variations to capture the RTqPCR data using the corrected Akaike Information Criterion (AICc) and found that in general, the refractory model and the effector cell model best describe data from nasal and saliva samples, respectively (Table S2, S3). In the refractory model (Fig 2A), we assumed that target cells can be rendered refractory to infection through the activity of soluble immune mediators released by infected cells, such as interferon 30. In the best-fit immune effector cell model (Fig 2A), we assumed that innate and adaptive immune cells are activated and recruited to eliminate infected cells, leading to increased viral clearance 28. See Tables S4, S5 and S6 for estimated values of the population and individual parameters and the fixed parameter values, respectively. Overall, these models described the observed Ct/CN values in both nasal and saliva samples very well (Fig 2B).

The frequent longitudinal sampling of participants during early infection provided a unique opportunity to precisely quantify viral load kinetics during the viral expansion phase, prior to the peak in genome shedding. We estimated the mean early exponential expansion rate r before peak viral load (growth rate, for short) to be 4.4/day (SD: ±0.5/day) in the nasal compartment. The growth rate is 8.8/day (SD: ±1.8/day) in the saliva compartment, i.e. much higher than in the nasal compartment (Fig 2C-D).

Viral clearance kinetics clearly differed between nasal and saliva samples (Fig 2B-D). For nasal samples, viral genome loads decreased relatively quickly after peak. This decrease is mostly driven by lose of productively infected cells, and we estimated an average death rate of productively infected cells at 2.5/day (SD: ±0.4/day); however, the viral decline slowed down over time. In saliva, post-peak viral genome loads declined initially at a slower rate than the decline in nasal samples. Consequently, we estimated a much smaller average death rate of productively infected cells in saliva during this phase, at 0.4/day (SD: ±0.3/day). However, our model suggested the existence of a second clearance phase with a more rapid decline occurring 1–2 weeks after infection, potentially due to the onset of effector cell and/or neutralizing antibody responses. Overall, we estimate that it takes on average 4.9/day (SD: ±0.5/day) and 3.9/day (SD: ±0.8/day) from infection to peak viral loads, in the nasal and the saliva compartments, respectively (Fig 2C-D). The average period from peak to undetectable genome viral load is 22.3 days (SD: ±8.3 days) and 14.9 days (SD: ±3.2 days) in the nasal and the saliva compartments, respectively.

Interestingly, the model predicts a significant correlation (p<0.01) in nasal samples between age and the Φ parameter which describes the effectiveness of the anti-viral immune response in rendering target cells refractory to infection (Fig 2E). This suggests that innate immune responses are less effective at limiting SARS-CoV-2 in the nasal compartment of older individuals within our cohort, consistent with prior studies describing dysregulation of innate immunity to viral infection in aged individuals 3133. There was no significant correlation between age and either growth rate or clearance rate in nasal samples (Extended data Fig 5).

Overall, we noted a surprising degree of discordance in viral dynamics between nasal and saliva samples for many participants. In most individuals (46 out of 54 analyzed), viral genome shedding peaked at least 1 day earlier in saliva than in nasal samples (Fig 2F). In contrast, the peak in nasal shedding only preceded the saliva peak by at least 1 day in 4 individuals.

Significant heterogeneity in the infectious potential of individuals

We next examined the duration of infectious virus shedding in nasal samples, as a surrogate for the infectious potential of an individual. There exists a large variation in the number of days for which an individual tested positive for cell culture on nasal swabs (Fig 3A). 9 out of 60 individuals tested negative by viral culture throughout the sampling period, whereas 1 individual tested positive for 9 days (Fig 3A). We found a weak positive correlation between the duration of viral culture positivity and participant age (Fig 3B). Of note, many study participants were viral culture positive on the first day of sample collection, suggesting that we failed to capture the onset of viral culture positivity for these individuals and thus may be underestimating the duration of infectious virus shedding for a subset of study participants.

Figure 3: Substantial heterogeneity in infectious virus shedding between individuals.

Figure 3:

(A) The distribution in the numbers of days in which participants tested positive by viral culture on nasal swabs following study enrollment. (B) Association between age of study participants and the number of days of positive viral culture using a standard linear regression analysis. The R-squared value and the p-value are reported. (C) Relationship between the CN value in nasal samples and the probability of the sample being viral culture positive, summarized across all individuals. Individual samples were binned based on their CN values. Dots indicate the observed percentage of positive samples within a bin that were viral culture positive. The solid line and the shaded area are the mean and the 90% confidence intervals, respectively, of the trajectories generated using the best-fit parameters of the saturation model (see extended data Fig 7 for individual fits). (D) The relationship between infectious virus shed (in arbitrary units or a.u.) and CN values by the saturation model for 56 individuals in our analysis. (E) Distribution of the estimated total cumulative level of infectious virus (in arbitrary units) shed from the nasal passage by each participant over the course of infection. Solid line shows the best-fit gamma distribution with a shape parameter of 1.6. (F) Association between age and the estimated total infectious virus (in arbitrary units) shed by each individual. The R2 value and the p-value from a linear regression (dashed line) are shown in panels (B) and (F).

To better quantify the infectious potential of each individual, we first used the viral culture data as a measure for intrinsic infectiousness (infectiousness for short below) to characterize how infectiousness depends on the viral genome load. We fitted three alternative models as proposed before 27 to the paired nasal RTqPCR and viral culture data collected from each individual using a non-linear mixed effect modeling approach (see Extended data Fig. 6 for a workflow and Supporting Text for more details). Comparing models using the AIC scores, we found that the relationship is best described by a saturation model where the infectious virus load is a Hill-type function of the viral genome load (Fig 3C, Extended data Fig 7, and Table S7). See Table S8 for the best-fit parameter values.

Using the best-fit model, we estimated the infectiousness of each individual over the course of infection (Extended data Fig 8). Note that the dataset only allows us to estimate a quantity that is a constant proportion of the infectious virus load (rather than its absolute value) across time and between individuals, thus we report the predicted values in arbitrary units (a.u.) as a relative measure of infectiousness. Our model predicts that infectious virus shedding increases sharply when nasal CN values fall below 22, and that the average amount of infectious virus shed is zero for CN values above 29 (Fig. 3D). Importantly, there exists a high level of heterogeneity in infectiousness across different individuals that is not fully explained by differences in viral genome load (Fig. 3D). For example, at nasal CN values around 13, infectious virus shedding reaches values higher than 20 a.u. in 3 individuals; while in 11 individuals it is below 4 a.u. This suggests that viral Ct/CN values are not precisely predictive of infectiousness.

We next estimated the total infectiousness of each individual by integrating the area under the infectious virus load curve over the course of infection. This approach again revealed a large degree of heterogeneity in individual-level infectiousness, with a more than 57-fold difference between the highest and lowest estimated infectiousness (104.0 and 1.8 a.u., respectively) (Fig 3E). We found that a Gamma distribution with a shape parameter of 1.6 describes the distribution of individual infectiousness well (Fig 3E). These data suggest that the previously reported heterogeneity in secondary transmission rates 6,7 is likely to arise from a combination of heterogeneity in contact structure and heterogeneity in intrinsic infectiousness34. This emphasizes the potential for a small subset of individuals that exhibit high intrinsic infectiousness to function as ‘superspreaders’ if they have frequent and/or high-risk contacts during the infectious period. Finally, we observed a significant correlation between age and total infectiousness (p<0.01; R2 = 0.21, Fig 3F).

Analysis of B.1.1.7 (Alpha) viral dynamics

Finally, we asked whether infection with the B.1.1.7 (Alpha) variant of concern (VOC) was associated with any significant differences in viral dynamics that could potentially explain the enhanced transmissibility of this genotype 3537. Previous studies have suggested that B.1.1.7 infection may result in higher peak viral loads or prolonged shedding compared with previously circulating genotypes 3840. Within our cohort, 16 out of 60 individuals were infected with B.1.1.7.

Both the empirical data and our model analysis (Fig 4A, C) suggest that the overall viral genome shedding dynamics in both nasal and saliva samples are indistinguishable between B.1.1.7 and non-B.1.1.7 infections (none of the latter were VOC genotypes except for a single P.1 (Gamma) infection; see Table S9). Comparison of parameter estimates in nasal samples suggested a slightly slower growth rate and time to peak for B.1.1.7 versus non-B.1.1.7 (Fig 4B), however it is not clear whether this difference is biologically meaningful (Fig 4A). Most importantly, we estimate that there is no significant difference between B.1.1.7 and non-B.1.1.7 viruses in total infectiousness in the nasal compartment (Fig 4B). Previously, we have shown that the area under the logarithm of genome viral loads, denoted as AUC(log), may serve as a surrogate for infectiousness 27. Here, we calculated AUC(log) from predicted viral load trajectories in the saliva compartment in each individual, and found no difference between B.1.1.7 and non-B.1.1.7 viruses (Fig 4D). These data indicate that other mechanisms not reflected in viral shedding dynamics drive the increased transmissibility of the B.1.1.7 (Alpha) variant.

Figure 4: Comparison of viral dynamics between B.1.1.7 and non-B.1.1.7 viruses.

Figure 4:

(A, C) Viral genome load of B.1.1.7 infections (purple) and non-B.1.1.7 infections (pink) over time in (A) nasal and (C) saliva samples, as measured by RTqPCR (Dots). Ribbons indicate 90% confidence intervals of predicted CN and Ct value trajectories respectively using population parameters estimated from modeling analysis (Table S4). (B, D) Comparison of estimated values for the indicated summary statistics of viral dynamics between individual B.1.1.7 and non-B.1.1.7 infections. (n=14 and 42 for B.1.1.7 and non-B.1.1.7 infections, respectively). The boxes of the boxplots start in the first quartile and ends in the third quartile of the data (e.g., interquartile range IQR), and the line inside the box represents the median. The whiskers represent 1.5 x the IQR, and the closed circles represent outliers. The p-values for the Wilcoxon rank-sum test are reported. Note that because age covaries with the total infectious virus shed (Fig. 3F), comparison of the total infectious virus shed after adjusting for age is shown in panel (B) (see Methods).

Discussion:

This study describes the results of daily multi-compartment sampling of viral dynamics within dozens of individuals newly infected with SARS-CoV-2 and provides a comprehensive, high-resolution description of viral shedding and clearance dynamics in humans.

Superspreading, in which a small subset of infected individuals are responsible for a disproportionately large share of transmission events, has been identified as a major driver of community spread of SARS-CoV-2, SARS-CoV, and many other acute viral pathogens 6,7,9. Superspreading is believed to arise from heterogeneity in both (1) contact structure between individuals arising from behavioral and environmental factors, and (2) the intrinsic infectiousness of individuals 9,34,41. While heterogeneity in contact structure has been studied extensively 4245, the extent of heterogeneity in infectiousness arising from individual-level viral dynamics remains unknown. Although several studies attempted to quantify this 20,34, the lack of empirical measure of viral genome load and infectious virus shedding dynamics during early infection, i.e. a critical period for SARS-CoV-2 transmission, prevents precise estimation.

To address this question, we empirically quantified infectious virus shedding through daily longitudinal sampling of individuals infected with SARS-CoV-2. The substantial heterogeneity in infectious virus shedding that we observed among individuals indicates that superspreading is likely driven by individual-level variation in specific features of the infection process, in addition to behavioral and environmental factors. We also found that heterogeneity in infectious virus shedding was only partly explained by individual-level heterogeneity in viral genome load dynamics, suggesting that additional factors such as variation in the timing and magnitude of the neutralizing antibody response might contribute 46. Our results here suggest caution in assessing the infectiousness of an individual using viral genome load data alone. Further, the absence of clear viral genetic correlates of infectiousness within this dataset suggests the existence of specific host determinants of superspreading potential. While we identified age as a significant correlate of infectiousness, additional determinants likely exist. Defining these correlates could aid future efforts to mitigate community spread of the virus by helping identify individuals with elevated risk of becoming superspreaders.

Our finding that viral shedding often peaks earlier in saliva versus the nasal compartment, sometimes by several days, corroborates a recent study of four individuals 47 and has several important implications. First, saliva screening may be a more effective sample type than nasal swabs for detecting infected individuals prior to or early in the infectious period 48. Early detection and isolation of infected individuals is absolutely critical for breaking transmission chains 15. Moreover, early viral shedding from the oral cavity may contribute to the high prevalence of pre-symptomatic SARS-CoV-2 transmission. We were unable to directly assess viral infectivity in saliva, so it remains unclear whether the earlier peaks in viral RNA shedding that we observed in saliva reflect earlier shedding of transmission-competent virus. The earlier detection of virus in saliva also raises questions about the initial site of SARS-CoV-2 infection. A recent study demonstrated that both salivary glands and oral mucosal epithelium can support SARS-CoV-2 replication, suggesting that infection could be initiated within the oral cavity 49. Alternatively, if infection is initiated in the nasopharynx or soft palette, viral RNA might be detectable in saliva prior to detection in the mid-turbinate swabs used in this study. The discordance in shedding dynamics between oral and nasal samples that we observed in many participants is consistent with a significant degree of compartmentalization between these adjacent but distinct tissue sites, as has been observed in animal models of influenza virus infection 50,51.

The specific mechanisms driving the enhanced transmissibility of the B.1.1.7 variant remain poorly understood. Recent studies have identified alterations in the structural conformation of the spike protein and enhanced antagonism of innate immunity by B.1.1.7 as potential contributors 52,53. Contrary to previous clinical studies, we observed no significant differences in either peak viral loads or clearance kinetics between B.1.1.7 and non-B.1.1.7 viruses as measured in either nasal swabs or saliva. Our results are consistent with studies demonstrating the absence of a growth advantage for B.1.1.7 in primary human respiratory epithelial cells 54,55. Similarly, a recent longitudinal study of RNA shedding observed no significant differences in mean peak viral RNA loads, clearance kinetics, or infection duration of the Alpha and Delta variants compared with non VOCs 39. If the timing of symptom onset differs between B.1.1.7 and non-B.1.1.7 infections, it could potentially explain why cross-sectional analyses of viral loads might register lower Ct values for B.1.1.7 samples. These data suggest that the enhanced transmissibility of the B.1.1.7 variant may also be driven by features not reflected in shedding dynamics, e.g., enhanced environmental stability or a lower infectious dose threshold.

This study has several limitations that must be considered. First, our study cohort was limited to faculty, students, and staff of the University of Illinois at Urbana-Champaign and did not include anyone who was hospitalized for COVID-19. The limited demographic and clinical profile of this cohort means that our results may not reflect the dynamics that occur during severe and lethal infections and/or in populations not well represented in our study. Second, there are multiple potential sources of technical variation that could contribute to noise in our experimental measurements. These include variability in sample collection quality and the potential for sub-genomic viral RNA to be detected in our RTqPCR assays. While we took steps to minimize variation in sample collection quality, including having all sample collections remotely observed by trained study staff, it is possible that some of the sample-to-sample variation we observe is due to differences in sample quality. Finally, it must be noted that the results of viral culture assays performed on nasal swabs may not perfectly correlate with the actual transmission potential of an individual.

Altogether, our data provide a high-resolution view of the longitudinal viral dynamics of SARS-CoV-2 infection in humans and implicate individual-level heterogeneity in viral shedding as playing a critical role in community spread of this virus.

Methods:

This study was approved by the Western Institutional Review Board, and all participants provided informed consent.

Participants

All on-campus students and employees of the University of Illinois at Urbana-Champaign are required to submit saliva for RTqPCR testing every 2–4 days as part of the SHIELD campus surveillance testing program. Individuals testing positive were instructed to isolate and were eligible to enroll in this study for a period of 24 hours following receipt of their positive test result. Close contacts of individuals who test positive (particularly those co-housed with them) are instructed to quarantine and were eligible to enroll for up to 5 days after their last known exposure to an infected individual. All participants were also required to have received a negative saliva RTqPCR result 7 days prior to enrollment.

Individuals were recruited via either a link shared in an automated text message providing isolation information sent within 30 minutes of a positive test result, a call from a study recruiter, or a link shared by an enrolled study participant or included in information provided to all quarantining close contacts. In addition, signs were used at each testing location and a website was available to inform the community about the study.

Participants were required to be at least 18 years of age, have a valid university ID, speak English, have internet access, and live within 8 miles of the university campus. After enrollment and consent, participants completed an initial survey to collect information on demographics and health history and were provided with sample collection supplies. Participants who tested positive prior to enrollment or during quarantine were followed for up to 14 days. Quarantining participants who continued to test negative by saliva RTqPCR were followed for up to 7 days after their last exposure. All participants’ data and survey responses were collected in the Eureka digital study platform. All study participants were asked if they had previously tested positive for SARS-CoV-2 or vaccinated against SARS-CoV-2. All participants included in this cohort reported no prior SARS-CoV-2 infection and were unvaccinated at time of enrollment.

Sample collection

Each day, participants were remotely observed by trained study staff collecting:

  1. 2 mL of saliva into a 50mL conical tube.

  2. 1 nasal swab from a single nostril using a foam-tipped swab that was placed within a dry collection tube.

  3. 1 nasal swab from the other nostril using a flocked swab that was subsequently placed in a collection vial containing 3mL of viral transport media (VTM). Swab and VTM manufacturer were same throughout study.

The order of nostrils (left vs. right) used for the two different swabs was randomized. For nasal swabs, participants were instructed to insert the soft tip of the swab at least 1 cm into the indicated nostril until they encountered mild resistance, rotate the swab around the nostril 5 times, leaving it in place for 10–15 seconds. After daily sample collection, participants completed a symptom survey. A courier collected all participant samples within 1 hour of collection using a no-contact pickup protocol designed to minimize courier exposure to infected participants.

Saliva RTqPCR

After collection, saliva samples were stored at room temperature and RTqPCR was run within 12 hours of initial collection in a CLIA-certified diagnostic laboratory. The protocol for the covidSHIELD direct saliva-to-RTqPCR assay used has been detailed previously 24. In brief, saliva samples were heated at 95°C for 30 minutes, followed by the addition of 2X Tris/Borate/EDTA buffer (TBE) at a 1:1 ratio (final concentration 1X TBE) and Tween-20 to a final concentration of 0.5%. Samples were assayed using the Thermo Taqpath COVID-19 assay.

Antigen testing

Foam-tipped nasal swabs were placed in collection tubes, transported with cold packs, and stored at 4°C overnight based on guidance from the manufacturer. The morning after collection, swabs were run through the Sofia SARS antigen FIA on Sofia 2 devices according to the manufacturer’s protocol.

Nasal swab RTqPCR

Collection tubes containing VTM and flocked nasal swabs were stored at −80°C after collection and were subsequently shipped to Johns Hopkins University for RTqPCR and virus culture testing. After thawing, VTM was aliquoted for RTqPCR and infectivity assays. One ml of VTM from the nasal swab was assayed on the Abbott Alinity per manufacturer’s instructions in a College of American Pathologist and CLIA-certified laboratory.

Calibration curve for nasal swab RTqPCR assay

Calibration curve for Alinity assay was determined using digital droplet PCR (ddPCR) as previously described 56. Nasal swab samples previously quantified using the Alinity assay were stored in a −80°C freezer between initial quantification and extraction for calibration curve. Samples were extracted simultaneously using the Perkin Elmer Chemagic 360 automated extraction platform with a sample input and eluate volumes of 300 uL and 60 uL respectively. RNA eluates were stored at −80°C. Digital droplet RT-PCR was performed following the BIO-RAD EUA assay package insert (https://www.fda.gov/media/137579/download). A master mix was prepared per sample using the reagents provided in the ddPCR Supermix for Probes kit as follows: 5.5 uL SuperMix (BIO-RAD), 2.2 uL reverse transcriptase (BIO-RAD), 1.1 uL dithiothreitol (BIO-RAD), 1.1 uL of CDC triplex SARS-CoV-2 primer and probe mix (IDT), and 7.1 uL of nuclease-free water. 17 uL of master mix was transferred to a 96-well PCR plate and combined with 5 uL of RNA in eluate. The plate was then loaded on to the QX-200 automated droplet generator (BIO-RAD). The droplet-containing plate was then heat-sealed with foil in a plate sealer (BIO-RAD) and placed on a C1000 Touch thermal cycler (BIO-RAD) to perform reverse-transcription and amplification. Droplet were read by the QX-200 droplet reader (BIO-RAD). Data were analyzed using the QuantaSoft Analysis Pro 1.0 software.

Virus culture from nasal swabs

Vero-TMPRSS2 cells were grown in complete medium (CM) consisting of DMEM with 10% fetal bovine serum (Gibco), 1 mM glutamine (Invitrogen), 1 mM sodium pyruvate (Invitrogen), 100 U/ml of penicillin (Invitrogen), and 100 μg/ml of streptomycin (Invitrogen) 57. Viral infectivity was assessed on Vero-TMPRSS2 cells as previously described using infection media (IM; identical to CM except the FBS is reduced to 2.5%) 26. When a cytopathic effect was visible in >50% of cells in a given well, the supernatant was harvested. The presence of SARS-CoV-2 was confirmed through RTqPCR as described previously by extracting RNA from the cell culture supernatant using the Qiagen viral RNA isolation kit and performing RTqPCR using the N1 (and N2 SARS-CoV-2-specific primers and probes in addition to primers and probes for human RNaseP gene (Using the CDC research use only 2019-Novel Coronavirus (2019-nCoV) Real-time RT-PCR primer and probes sequences) using synthetic RNA target sequences to establish a standard curve 58.

Viral genome sequencing and analysis

Viral RNA was extracted from 140 uL of heat inactivated (30 minutes at 95°C, as part of protocol detailed in 24) saliva samples using the QIAamp viral RNA mini kit (QIAGEN). 100ng of viral RNA was used to generate cDNA using the SuperScript IV first strand synthesis kit (Invitrogen). Viral cDNA was then used to generate sequencing libraries using the Swift SNAP Amplicon SARS CoV2 kit with additional coverage panel and unique dual indexing (Swift Biosciences) which were sequenced on an Illumina Novaseq SP lane. Data were run through the nf-core/viralrecon workflow (https://nf-co.re/viralrecon/1.1.0), using the Wuhan-Hu-1 reference genome (NCBI accession NC_045512.2). Swift v2 primer sequences were trimmed prior to variant analysis from iVar version 1.3.1 (https://genomebiology.biomedcentral.com/articles/10.1186/s13059-018-1618-7) retaining all calls with a minimum allele frequency of 0.01 and higher. Viral lineages were called using the Pangolin tool (https://github.com/cov-lineages/pangolin) version 2.4.2, pango version 1.2.6, and the 5/19/21 version of the pangoLEARN model; based on the nomenclature system described in 59.

Statistics & Reproducibility

Details of statistical analysis methods are below. No statistical method was used to predetermine sample size. For some analyses, a small number of individuals were excluded for reasons detailed in the results section where relevant. The experiments were not randomized and the investigators were not blinded to allocation during experiments and outcome assessment.

Statistical analyses

The difference in the distribution of a parameter of interest between the non-B.1.1.7 infection group and the B.1.1.7 infection group is assessed using a univariate analysis and the p-value is calculated using the Wilcoxon rank-sum test. The comparison of infectious virus shedding between the non-B.1.1.7 infection group and the B.1.1.7 infection group is performed using a multivariate analysis with age as an additional variate. The predicted levels of infectious viral shedding after adjusting for age by assuming age is at 28, i.e. the median age of the cohort, are shown in Fig. 4C.

Figure generation

All figures except for 2A were generated using r studio. Figure 2A was generated using Microsoft Powerpoint.

Overview of model construction and parameter estimation

The goal of the quantitative analyses is to use mathematical models to characterize viral shedding dynamics based on both viral genome loads (as measured by RTqPCR) and the presence or absence of infectious virus (as measured by viral culture assay). Analyzing the model results, we quantify the individual-level heterogeneity in both viral genome shedding dynamics and individual infectiousness. See Extended data Fig 6 for an overview of the analysis workflow.

First, we performed experiments to derive the calibration curves for transforming the Ct/CN values from RTqPCR to viral genome loads (“Viral genome load calibration from Ct/CN values” section below). Note that due to the nature of the RTqPCR assays and the noise in sampling, the viral genome loads derived using the calibration curves represent a proxy for the actual quantities. Nonetheless, this approach is the best approach available to derive viral genome loads for the purpose of viral dynamic modeling and is widely used in understanding SARS-CoV-2 dynamics21,60.

Second, we constructed viral dynamic models and fit these models to viral genome loads (“Viral dynamics models” section below). We estimated key parameters governing infection processes in the nasal and the saliva-associated compartments, such as the viral exponential growth rate before peak viral genome load and the viral clearance rate. This allows us to characterize the individual level heterogeneity in infection kinetics.

Third, we constructed mathematical models to describe how the amount of infectious virus shed relates to changes in the viral genome load, as measured by RTqPCR (“Modeling infectiousness of an individual” section below). We fit the models to viral culture assay data. Using the best model and the predicted viral genome load kinetics from the viral dynamic model, we predicted the extent of infectious virus shedding, i.e. the infectiousness, for each individual, and thus quantified the individual-level heterogeneity in infectiousness.

Viral genome load calibration from Ct/CN values

Viral genome load calibration – nasal samples

To calculate viral genome loads from CN values reported for the nasal samples, we performed calibration curve experiments to empirically define the relationship between CN values obtained from the RTqPCR assay used on nasal swab samples, and absolute viral genome loads within samples, as quantified by digital droplet PCR. We quantified viral genome loads for 62 nasal samples with CN values ranging between 17 and 38. For each sample, absolute copy numbers of viral genomes were measured using two different N-gene-specific primer sets (N1 and N2). To account for technical noise between samples, we also determined the concentration of the host RNAse P (RP) transcript as a control (see Table S10). We then normalized the copy numbers of N1 and N2 targets by dividing by their corresponding RP target numbers, and then multiplied the mean of the RP concentration across all samples. Note that the unit of these measurement is /mL. This is because nasal swab samples were each collected in 3mL of viral transport media (VTM).

Plotting the logarithm of the normalized viral genome loads against the associated CN values shows a clear linear relationship, justifying the use of a linear regression below. Linear regression lines with similar coefficients were used as calibration curves in other studies21,60. We also note that the noise in genome viral loads is high when CN values are high (e.g. >33), likely a reflection of increased noise when signal is low26. However, this high level of variation at high CN values will not impact on the conclusion of our study, because the range of viral loads relevant to transmission is much higher (>106 copies/mL; see Fig 3D in the main text).

We then performed a linear regression on the measured CN values and the log10 viral genome loads (Extended data Fig 9). This leads to the following formula for the relationship between CN values and viral genome load:

log10V=11.350.25CN

where V and CN denote the viral genome load and the CN value, respectively. Note that, because of the high number of data points we measured, the level of uncertainty in the regression line is minimal (Extended data Fig 9).

Viral genome load calibration – saliva samples

Unlike the nasal samples, we were not able to measure the calibration curve using saliva samples taken from participants. To quantify the efficiency of the RTqPCR assay used on saliva samples, we used data from calibration experiments in which saliva samples obtained from healthy donors were spiked with SARS-CoV-2 genomic RNA. More specifically, 0.9 mL of saliva from a healthy donor were spiked with 0.1mL of 1.8×10^8, 5.4×10^5 or 6.0×10^4 RNA copies/mL. For samples spiked with 1.8×10^8 RNA copies/mL, 10-fold serial dilutions were performed until the final concentration was 1.8×10^4 RNA copies/mL. A total of 24 samples were collected and Ct values of the N gene were then measured (Table S11).

As above, we plotted the logarithm of the viral loads against the Ct values (Extended data Fig 10). The plot shows a clear linear relationship, justifying the use of a linear regression below. We then performed a linear regression on the measured CN values and the log10 viral genome loads (Extended data Fig 10). This leads to the following formular for the relationship between CN values and viral genome load:

log10V=14.240.28Ct

where V and Ct denote the viral genome load and the Ct value, respectively. As for the nasal calibration curve, the level of uncertainties in the regression line is minimal (Extended data Fig 10).

Note that a major difference between samples spiked with viral genomes with samples taken from infected individuals is that the latter is likely to be noisier because of variation in the sample collection process. However, the two approaches should not differ substantially in assessing the efficiency of the RT-PCR protocol. The impact of the noise in the nasal sample can be minimized by taking a large number of samples over a wide range of CN values, as we have done for the nasal samples. Therefore, the calibration curves we derived above represent an accurate translation of Ct/CN values to viral load.

Viral dynamic models

We constructed viral dynamic models to describe the dynamic changes in viral genome load. The viral genome load patterns in nasal and saliva samples are distinct from each other in many individuals, suggesting compartmentalization of infection dynamics in these two sample sites. Therefore, we use the models below to describe data collected from these two compartments separately. See Fig 2A and Extended data Fig 4 for schematics of these models.

The target cell limited (TCL) model

We first constructed a within-host model based on the target cell limited (TCL) model used for other respiratory viruses, such as influenza61 and recently SARS-CoV-227,29,62. We keep track of the total numbers of target cells (T), cells in the eclipse phase of infection (E), i.e. infected cells not yet producing virus, productively infected cells (I) and viruses (V). The ordinary differential equations are:

dTdt=βVTdEdt=βVTkEdIdt=kEδIdVdt=πIcV [S1]

In this model, target cells are infected by virus with rate constant β. Cells in the eclipse phase become productively infected cells at per capita rate k. Productively infected cells die at per capita rate δ. We use V to describe viruses measured in nasal or saliva samples and it is a proportion of the total virus in the compartment under consideration. Therefore, the rate π is the product of the viral production rate per infected cell and the proportion of virus that is sampled. See Ke et al.27 for a detailed derivation. Viruses are cleared at per capita rate c.

Refractory cell model

We extend the TCL model by including an early innate response, i.e. the type-I/III interferon response, where interferons are secreted from infected cells and bind to receptors on uninfected target cells, stimulating an antiviral response that renders them refractory to viral infection. Note that this is the best model to describe the viral genome load dynamics as measured by RTqPCR from nasal samples.

We keep track of interferon (F) and cells refractory to infection (R), in addition to other quantities in the TCL model. The full ODEs for target cells, refractory cells and interferon are

dTdt=βVTϕFT+ρRdRdt=ϕFTρRdEdt=βVTkEdIdt=kEδIdVdt=πIcVdFdt=sIμF [S2]

In this model, the impact of the innate immune response is to convert target cells into refractory cells at rate ϕFT, where ϕ is a rate constant. Refractory cells can become target cells again at rate ρ. Interferon is produced and cleared at rates s and μ, respectively.

For simplicity and due to a lack of empirical data on interferon responses in our study, we simplify the model by making the quasi-steady-state assumption that the interferon dynamics are much faster than the dynamics of infected cells and assume that dFdt=0. Thus sI=μF or F=sμI.

Let Φ=ϕsμ, so that the ODEs for the innate immunity model become:

dTdt=βVTΦIT+ρRdRdt=ΦITρRdEdt=βVTkEdIdt=kEδIdVdt=πIcV [S3]

Viral production reduction model

In addition to making target cells refractory to infection, the impact of interferons may include reducing virus production from infected cells. We include this action of interferons in the viral production reduction model. As above, we make the quasi-steady-state assumption that the interferon dynamics are much faster than the dynamics of infected cells and assume that F is proportional to I. The ODEs for the model are:

dTdt=βVTdEdt=βVTkEdIdt=kEδIdVdt=π1+γIIcV [S4]

where γ is a constant representing the effect of interferon in reducing viral production.

Immune effector cell model

Over the course of infection, immune effector cells are activated and recruited to kill infected cells. These immune effector cells include innate immune cells such as macrophages and natural killer cells, as well as cells developed during the adaptive immune response, such as cytotoxic T lymphocytes and antibody-secreting B cells. To consider the impact of these immune effector cells, we develop a model, i.e. the effector cell model, based on a previous model for influenza infection28. In this model, we assume that the death rate of infected cells is δ1 at the beginning of the infection. This may reflect the cytotoxic effects of viral infection. After time t1, the death rate of infected cells increases by δ2, where δ2 models the killing of infected cells by immune effector cells. The ODEs for the model are:

dTdt=βVTdEdt=βVTkEdIdt=kEδtIdVdt=πIcVδt=δ1t<t1δ1+δ2tt1 [S5]

Note that this is the best model to describe the viral genome load dynamics as measured by RTqPCR from saliva samples.

Combined model

In the full model, we combine the refractory cell model and the immune effector cell model to consider both the immediate interferon response and the immune effector response. The ODEs for the model are:

dTdt=βVTΦIT+ρRdRdt=ΦITρRdEdt=βVTkEdIdt=kEδtIdVdt=πIcVδt=δ1t<t1δ1+δ2tt1 [S6]

Choice parameter values:

Total target cell numbers T0

We calculate the total numbers of target cells in the nasal and saliva compartments by multiplying the total number of epithelial cells in these two compartments by the fraction of epithelial cells expected to be targets for SARS-CoV-2 infection.

For the total number of epithelial cells in the nasal compartment, we use the estimate from Baccam et al., i.e. 4×108 cells61. This is calculated from the estimate that the surface area of nasal turbinates is 160 cm2 63, and the surface area per epithelial cell is 2 × 10−11 to 4 × 10−11 m2/cell61. For the saliva compartment, the total surface area of the mouth was estimated to be 214.7 cm2 64. Therefore, we estimate that the total number of epithelial cells in the mouth is approximately 4×108×214.7/160=5.4×108 cells.

Hou et al. estimated that the fraction of cells that express angiotensin-converting enzyme 2 (ACE2), i.e. the receptor for SARS-CoV-2 entry, on cell surface is approximately 20% in the upper respiratory tract (URT)65. Therefore, in our model, the initial numbers of target cells are calculated as 4×108×20%=8×107 cells and 5.4×108×20%=1.08×108 cells, in the nasal and the saliva compartment, respectively.

Note that these estimates are approximations using available best-estimates in the literature. For a standard viral dynamics model, the number of initial target cells and the virus production rate are unidentifiable and only their product is identifiable66. Thus, if the actual number of target cells differs from what we estimated here, an increase in the initial number of target cells will lead to corresponding decrease in the estimate of the virus production rate, and vice versa.

The initial number of infected cells (E0)

We assume that one cell in the compartment of interest is infected at the start of infection, E0=1 cell, consistent with Refs. 27 and 67. The small number of infected cells is also consistent with a recent work that estimated from sequencing data that the transmission bottleneck is small for SARS-CoV-2 and there are likely 1–3 infected cells at the initiation of infection6870. Note that, in an earlier work, we showed that changes in the number of initially infected cells between 1–5 in the model do not substaintially change the inference results27.

The initial viral growth rate, r

For all the models above, the initial growth of the viral population before peak viral genome load is dominated by viral infection. This means that the immune responses considered in our models act to change the viral growth trajectory substantially only at later time points71. Thus, we derive an approximation to the initial viral growth rate using the TCL model only (Eqn. S1). This approximation also represents a good approximation for other models.

We first make two simplifying assumptions commonly used in analyzing initial dynamics of the viral dynamic models72,73. First, because at the initial stage of infection, the number of infected cells is orders of magnitude lower than the number of target cells, we assume that the number of target cells is at a constant level, T0. Second, the dynamics of viruses are much quicker than the dynamics of infected cells. For example, the rate of viral clearance is in the time scale of minutes and hours, whereas the death of productively infected cells is in days. Therefore, we make the quasi-steady state assumption, dVdt0, such that the concentrations of viruses are always in proportion to the concentration of productively infected cells, i.e.πIcV. This gives VπcI.

With these two assumptions, Eqn. S1 becomes a system of linear ODEs with 2 variables, E and I:

dEdt=βπcIT0kEdIdt=kEδI [S7]

The Jacobian matrix, J, for this system of ODEs is:

J=kβπcT0kδ

The initial growth rate r is the leading eigenvalue of the Jacobian matrix of the ODE system. We calculate the eigenvalues, λ, for the Jacobian matrix above from JλI=0, where I is the identity matrix, and get:

λ=12k+δ±k+δ2+4kδR01, where R0=βπδcT0.

Then, the leading eigenvalue, i.e. the initial growth rate r, is:

r=12k+δ+k+δ2+4kδR01 [S8]

Model fitting strategy

Fitting viral dynamic models to the viral genome load data

We took a non-linear mixed effect modeling approach to fit the viral dynamic models to viral genome load data from all individuals simultaneously. All estimations were performed using Monolix (Monolix Suite 2019R2, Antony, France: Lixoft SAS, 2019. lixoft.com/products/monolix/). We allowed random effects on the fitted parameters (unless specified otherwise). All population parameters except for the starting time of simulation, t0, are positive and therefore, we assume that they follow log-normal distributions. For t0, we assume a normal distribution because t0 can be positive or negative.

The parameters β and π in the viral dynamic models strongly correlate with each other when the models are fitted to viral genome load data66. We tested three choices in handling this correlation in fitting all 5 viral dynamic models: 1) a correlation is assumed between parameter β and π in Monolix, 2) parameter β has fixed effect only (i.e. its value is set to be the same across all individuals), and 3) parameter π has fixed effect only.

To test if the age of the individuals and/or the infecting viral genotype (categorized as either non-B.1.1.7 or B.1.1.7) explains the heterogenous patterns in the viral genome load trajectories across the cohort, we tested whether they covary with any of the fitted parameters in the model by setting the two variables as a continuous and a categorical covariate, respectively, in Monolix.

The assumptions on parameters β and π and the choice of parameters that covariate with age or viral strain of infection led to a large number of model choices for fitting. Therefore, we took the following strategy to ensure that we identified the best model and parameter combinations to describe the data:

  • First, we tested the three assumptions about parameters β and π in the five viral dynamic models without any covariate and selected the best assumption for further analysis based on their AICc scores.

  • Second, using the best assumption, we tested the model by including the age of the individuals as a continuous covariate of all fitted parameter values with a random effect first. We then took an iterative approach to test whether if the covariate should be removed from any of the parameters in the model using the Pearson’s correlation test in Monolix. The parameter(s) that has a non-significant p-value (p-value>0.05) or with the lowest p-value is removed from next round of parameter fitting. We iterated the process until all parameters were removed.

  • Then, the best model variant with the lowest AICc score was selected for analysis on whether parameter estimates differed in individuals infected by different viral strains. As before, we take an iterative approach. We first set the viral strain, i.e. non-B.1.1.7 or B.1.1.7 as a categorical covariate of all fitted parameter values with a random effect in the model. We then tested whether the covariate should be removed from any of the parameters in the model using the Analysis of Variance (ANOVA) in Monolix. The parameter(s) that has a non-significant p-value (p-value>0.05) or with the lowest p-value is removed from next round of parameter fitting. We iterated the process until all parameters were removed.

  • Finally, the model variant with the lowest AICc score was selected as the best model.

Predicting viral genome load trajectories for non-B.1.1.7 and B.1.1.7 strains

We randomly sampled 5000 sets of parameter combinations from the distribution specified by the best-fit population parameters (Table S4). For the effector cell model for the saliva compartment, β and π are strongly correlated. We thus applied formulations such that the correlations between the two parameter values are preserved in the random sampling in accordance with the estimated correlation coefficient. We simulated the best-fit model using the 5000 sets of parameter combinations for each of the strain. The median and the 5th and 95th quantile of viral genome loads at each time points are reported.

Modeling infectiousness of an individual

We model how infectiousness depends on the viral genome load in an individual similar to the framework proposed in Ke et al.27. Specifically, we first use the viral culture data collected in this study to infer how the level of infectious virus shed relates to viral genome loads as measured by RTqPCR. From this model, we predict how the level of infectious virus shedding changes over time in each individual and how the overall infectiousness of the infection varies among the participants.

Relationship between viral genome load and infectious viruses

We first consider three alternative models describing how the amount of infectious virus in a sample is related to viral genome load (derived from the CN values): the ‘linear’ model, ‘power-law’ model and the ‘saturation’ model. In these models, due to the nature of stochasticity in sampling, we assume the number of infectious viruses that was in the sample for cell culture experiment to be a random variable, Y, that follows a Poisson distribution, and Vinf represents the expected number of infectious viruses, i.e.Vinf=EY.

  1. The linear model

    We assume that Vinf, is proportional to the viral genome load, V, in the sample:
    Vinf=EY=AV [S9]
    where A is a constant.
  2. The power-law model

    We assume that Vinf, is related to the viral genome load, V, by a power function:
    Vinf=EY=BVh [S10]
    where B and h are constants.
  3. 3. The saturation model

    We assume that Vinf, is related to the viral genome load, V, by a Hill function
    Vinf=EY=VmVhVh+Kmh [S11]
    where Vm and Km are constants and h is the Hill-coefficient.

Probability of cell culture being positive

If each infectious virus has a probability ϱ to establish infection such that the cell culture becomes positive, the number of viruses that successfully establish an infection in cell culture is Poisson distributed with parameter λ=EYϱ=Vinfϱ. Thus, the probability of one or more viruses successfully infecting the culture so that it tests positive is

ppositive=1expλ=1expVinfϱ [S12]

Substituting the expressions of Vinf from the three models above, we get the following expressions for ppositive from the three models (note that we use the subscripts ‘1’, ‘2’ and ‘3’ to denote the three models for Vinf):

ppositive,1=1expVinfϱ=1expDV [S13]

where D=Aϱ.

ppositive,2=1expVinfϱ=1expGVh [S14]

where G=Bϱ.

ppositive,3=1expVinfϱ=1expJVhVh+Kmh [S15]

where J=Vmϱ.

Note that from the expressions above, it becomes clear that we will not be able to estimate parameters A, B and Vm in the three models, because they appear as products with the unknown parameter ϱ in the equations. This means that the viral culture data does not allow us to estimate the absolute number of infectious viruses in a sample or given a viral genome load; instead, we are able to estimate a quantity that is a constant proportion of the actual number of infectious viruses over time and across individuals. Therefore, we report estimations of infectious viruses in an arbitrary unit (a.u.). These estimates represent relative measure of infectiousness. Two estimates measured at different time points and/or from different individuals can be compared with each other using this method.

Model fitting using a population effect modeling approach

For each sample, the viral genome load and the cell culture positivity were measured. Using this data, we estimate parameter values in the three models by minimizing the negative log-likelihood of the data.

More specifically, the likelihood of the mth observation being positive or negative in cell culture is calculated as:

pi,m=ppositive,iVm,ifthekthobservationispositive1ppositive,iVm,ifthekthobservationisnegative [S16]

where Vm is the viral genome load of the mth observation.

Because we have the paired nasal RTqPCR and viral culture data for each individual, we fit the three mathematical models using a non-linear mixed effect modeling approach. Again, all estimations were performed using Monolix (lixoft.com/products/monolix/). We allowed random effects on the fitted parameters (unless specified otherwise). All population parameters with a random effect are assumed to follow log-normal distributions.

To find the best-model explaining the data, we tested models with different combination of parameters with either a random effect or without a random effect (Table S7). The model with the lowest AIC score is selected as the best model.

Note that for each of the three models, we tested a model variation where all parameters in the models have fixed effects only, i.e. a single set of parameters is used to explain viral culture data from every individual. In this case, there is no heterogeneity in parameter values across individuals. The resulting AIC scores are significantly worse than the best-fit model assuming random effects on parameters (see Table S7). This indicates that there is a substantial level of individual heterogeneity in the relationship between infectious viruses shedding and viral genome loads (as shown in Fig 3D).

Calculating confidence intervals of the cell-culture-positivity curve (in Fig 3C)

Similar as the procedures performed for predicting the confidence intervals of viral genome load trajectories, we randomly sampled 5000 sets of parameter combinations from the distribution specified by the best-fit population parameters of the best model, i.e. the saturation model assuming Km only has a fixed effect (Table S8). More specifically, we sampled parameters from a log-normal distribution for J and h, with their means and standard deviations at the best-fit values. Using the parameter combinations, we generated curves of probability of cell-culture positivity at CN values ranging between 10 and 40. The median and the 5th and 95th quantile of viral genome loads at each CN values are reported.

Extended Data

Extended Data Fig. 1. Remainder of individual plots.

Extended Data Fig. 1

Plots of longitudinal assay results from study participants not shown in Figure 1A. Single asterisk next to the participant ID indicates B.1.1.7 variant infection, while double asterisks indicate P1 variant infection.

Extended Data Fig. 2. Individual-level symptom data.

Extended Data Fig. 2

Self-reported symptom data from study participants, overlaid with viral culture status. Participants were asked to complete a survey through the Eureka digital study platform inquiring about the presence or absence of the indicated set of symptoms each day after sample collection.

Extended Data Fig. 3. Comparison of symptoms and viral culture status.

Extended Data Fig. 3

Plots show the proportions of either viral culture negative or viral culture positive days for which participants reported the indicated symptoms. The p-values for the Wilcoxon rank-sum test are reported. Data are only shown for individuals who reported the indicated symptom at least once.

Extended Data Fig. 4. Model structures.

Extended Data Fig. 4

Diagrams showing the structures of the additional three models (not shown in Figure 2A) considered for describing viral load data. See Supporting Text for descriptions of the models.

Extended Data Fig. 5. Model parameter estimates as a function of age.

Extended Data Fig. 5

Plots showing the relationship between age and the indicated model parameter estimates for (A) the refractory cell model (nasal data) and the (B) the immune effector cell model (saliva data). Linear regressions were performed on the data. R2 values and p-values are shown.

Extended Data Fig. 6. Analysis workflow.

Extended Data Fig. 6

Diagram indicating how empirical RTqPCR and viral culture data were used to generate estimations of individual level viral dynamics and infectiousness.

Extended Data Fig. 7. The saturation model accurately predicts the cell culture positivity data.

Extended Data Fig. 7

Lines denote the predicted probability of cell culture being positive. Dots denotes cell culture positivity data, where a dot is at 1 or 0 when the cell culture is positive or negative, respectively.

Extended Data Fig. 8. Individual infectiousness plots.

Extended Data Fig. 8

Estimated infectiousness over time plotted for individual study participants. Dashed lines indicate inferred peak in infectiousness.

Extended Data Fig. 9. The relationship between genome viral load (y-axis; on a log10 scale) and CN value of the nasal samples.

Extended Data Fig. 9

The black line, i.e. the center of the error band, represents the linear regression calibration curve. The shading around the black line shows the standard error for the regression.

Extended Data Fig. 10. The relationship between genome viral load (y-axis; on a log10 scale) and Ct value of the saliva samples.

Extended Data Fig. 10

The black line, i.e. the center of the error band, represents the linear regression calibration curve. The shading around the black line shows the standard error for the regression.

Supplementary Material

Peer Review File
raw data
supplemental tables

Acknowledgments:

We wish to thank Shumon Ahmed, Carly Bell, Nate Bouton, Callie Brennen, Justin Brown, Coleco Buie, Emmaline Cler, Gary Cole, Trey Coleman, Alastair Dunnett, Lauren Engels, Savannah Feher, Kelsey Fox, Lexi Freeman, Yesenia Gonzalez, Montez Harris, Darcy Henness, Dan Hiser, Ayeshah Hussain, Daryl Jackson, Junko Jarrett, Michael Jenkins, Kalombo Kalonji, Syntyche Kanku, Steven Krauklis, Mary Krouse, Elmore Leshoure, Joe Lewis, Maggie Li, Angel Lopez, Guadalupe Lopez, Emily Luna, Chun Huai Luo, Colby Mackey, Skyler McLain, Yared Berhanu Melesse, Madison O’Donnell, Savanna Pflugmacher, Denver Piatt, Skyler Pierce, Gina Quitanilla, Ameera Samad, MacKenzie Scroggins, Monique Settles, Macie Sinn, Pete Varney, Evette Vlach, Raeshun Williams-Chatman, and Todd Young for their efforts supporting recruitment, enrollment, logistics, and/or sample collection and processing. We also thank Jeffrey Olgin, Noah Peyser, and Xochitl Butcher for assistance with the Eureka platform, Michelle Lore for assistance with REDcap, Melanie Loots for assistance with administration, Gillian Snyder for assistance in development of study protocols and logistics, and Erin Iturriaga and Jue Chen for study protocol development. We would also like to thank Avidan Neumann for helpful input and suggestions for improving the analyses. Finally, we are grateful to Alvaro Hernandez and Chris Wright of the DNA Services Lab within the Roy J. Carver Biotechnology Center for assistance in establishment of a SARS-CoV-2 genomic sequencing protocol. Vero-TMPRSS2 cells were provided by the National Institute of Infectious Diseases, Japan. Sofia 2 devices and associated supplies were provided to Carle Foundation Hospital by Quidel, however Quidel played no role in the design of the study or the interpretation or presentation of the data.

Funding.

This work was supported by the National Heart, Lung, and Blood Institute at the National Institutes of Health [3U54HL143541–02S2] through the RADx-Tech program to DDM, LLG, and CBB. The views expressed in this manuscript are those of the authors and do not necessarily represent the views of the National Institute of Biomedical Imaging and Bioengineering; the National Heart, Lung, and Blood Institute; the National Institutes of Health, or the U.S. Department of Health and Human Services. RK and CBB were further supported by the Defense Advanced Research Projects Agency INTERCEPT program through contracts R-00676–19-0 (to RK) and W911NF-17–2-0034 (to CBB).

Footnotes

Code availability:

The computer codes for the mathematical analyses in this paper are available both at https://github.com/BROOKELAB/Viral-dynamics-modeling and DOI: 10.5281/zenodo.6311388.

Competing interests. CBB and LW are listed as inventors on a patent application for the saliva RTqPCR test used in this study. The remaining authors declare no competing interests.

Supplementary information is available for this paper.

Data availability:

All raw data used is included as a supplemental table. Raw sequence data files can be found under BioProject ID PRJNA809434.

References:

  • 1.He X et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat Med 26, 672–675 (2020). [DOI] [PubMed] [Google Scholar]
  • 2.Ferretti L et al. The timing of COVID-19 transmission 10.1101/2020.09.04.20188516 (2020) doi: 10.1101/2020.09.04.20188516. [DOI] [Google Scholar]
  • 3.Szablewski CM et al. SARS-CoV-2 Transmission and Infection Among Attendees of an Overnight Camp — Georgia, June 2020. MMWR Morb. Mortal. Wkly. Rep 69, 1023–1025 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Long Q-X et al. Clinical and immunological assessment of asymptomatic SARS-CoV-2 infections. Nat Med 26, 1200–1204 (2020). [DOI] [PubMed] [Google Scholar]
  • 5.Li R et al. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). Science 368, 489–493 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sun K et al. Transmission heterogeneities, kinetics, and controllability of SARS-CoV-2. Science 371, eabe2424 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Adam DC et al. Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nat Med 26, 1714–1719 (2020). [DOI] [PubMed] [Google Scholar]
  • 8.Bi Q et al. Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. The Lancet Infectious Diseases 20, 911–919 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lloyd-Smith JO, Schreiber SJ, Kopp PE & Getz WM Superspreading and the effect of individual variation on disease emergence. Nature 438, 355–359 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Néant N et al. Modeling SARS-CoV-2 viral kinetics and association with mortality in hospitalized patients from the French COVID cohort. Proc Natl Acad Sci USA 118, e2017962118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Silva J et al. Saliva viral load is a dynamic unifying correlate of COVID-19 severity and mortality 10.1101/2021.01.04.21249236 (2021) doi: 10.1101/2021.01.04.21249236. [DOI] [Google Scholar]
  • 12.The Massachusetts Consortium for Pathogen Readiness et al. SARS-CoV-2 viral load is associated with increased disease severity and mortality. Nat Commun 11, 5493 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zheng S et al. Viral load dynamics and disease severity in patients infected with SARS-CoV-2 in Zhejiang province, China, January-March 2020: retrospective cohort study. BMJ m1443 (2020) doi: 10.1136/bmj.m1443. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Marks M et al. Transmission of COVID-19 in 282 clusters in Catalonia, Spain: a cohort study. The Lancet Infectious Diseases S1473309920309853 (2021) doi: 10.1016/S1473-3099(20)30985-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Larremore DB et al. Test sensitivity is secondary to frequency and turnaround time for COVID-19 screening. Sci. Adv 7, eabd5393 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Kim JY et al. Viral Load Kinetics of SARS-CoV-2 Infection in First Two Patients in Korea. J Korean Med Sci 35, e86 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wölfel R et al. Virological assessment of hospitalized patients with COVID-2019. Nature 581, 465–469 (2020). [DOI] [PubMed] [Google Scholar]
  • 18.Lescure F-X et al. Clinical and virological data of the first cases of COVID-19 in Europe: a case series. The Lancet Infectious Diseases 20, 697–706 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Young BE et al. Epidemiologic Features and Clinical Course of Patients Infected With SARS-CoV-2 in Singapore. JAMA 323, 1488 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Jones TC et al. Estimating infectiousness throughout SARS-CoV-2 infection course. Science eabi5273 (2021) doi: 10.1126/science.abi5273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kissler SM et al. Viral dynamics of acute SARS-CoV-2 infection and applications to diagnostic and public health strategies. PLoS Biol 19, e3001333 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ranoa DRE et al. Mitigation of SARS-CoV-2 Transmission at a Large Public University 10.1101/2021.08.03.21261548 (2021) doi: 10.1101/2021.08.03.21261548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Shain EB & Clemens JM A new method for robust quantitative and qualitative analysis of real-time PCR. Nucleic Acids Research 36, e91–e91 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Ranoa DRE et al. Saliva-Based Molecular Testing for SARS-CoV-2 that Bypasses RNA Extraction 10.1101/2020.06.18.159434 (2020) doi: 10.1101/2020.06.18.159434. [DOI] [Google Scholar]
  • 25.Monel B et al. Release of infectious virus and cytokines in nasopharyngeal swabs from individuals infected with non-B.1.1.7 or B.1.1.7 SARS-CoV-2 variants 10.1101/2021.05.20.21257393 (2021) doi: 10.1101/2021.05.20.21257393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Pekosz A et al. Antigen-Based Testing but Not Real-Time Polymerase Chain Reaction Correlates With Severe Acute Respiratory Syndrome Coronavirus 2 Viral Culture. Clin Infect Dis (2021) doi: 10.1093/cid/ciaa1706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ke R, Zitzmann C, Ho DD, Ribeiro RM & Perelson AS In vivo kinetics of SARS-CoV-2 infection and its relationship with a person’s infectiousness. Proc Natl Acad Sci USA 118, e2111477118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Pawelek KA et al. Modeling Within-Host Dynamics of Influenza Virus Infection Including Immune Responses. PLoS Comput Biol 8, e1002588 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Goyal A, Cardozo-Ojeda EF & Schiffer JT Potency and timing of antiviral therapy as determinants of duration of SARS-CoV-2 shedding and intensity of inflammatory response. Sci. Adv 6, eabc7112 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Blanco-Melo D et al. Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19. Cell 181, 1036–1045.e9 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Shaw AC, Goldstein DR & Montgomery RR Age-dependent dysregulation of innate immunity. Nat Rev Immunol 13, 875–887 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Molony RD et al. Aging impairs both primary and secondary RIG-I signaling for interferon induction in human monocytes. Sci. Signal 10, eaan2392 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Angioni R et al. Age-severity matched cytokine profiling reveals specific signatures in Covid-19 patients. Cell Death Dis 11, 957 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Goyal A, Reeves DB, Cardozo-Ojeda EF, Schiffer JT & Mayer BT Viral load and contact heterogeneity predict SARS-CoV-2 transmission and super-spreading events. eLife 10, e63537 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Washington NL et al. Emergence and rapid transmission of SARS-CoV-2 B.1.1.7 in the United States. Cell S0092867421003834 (2021) doi: 10.1016/j.cell.2021.03.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Davies NG et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science 372, eabg3055 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.The COVID-19 Genomics UK (COG-UK) consortium et al. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature (2021) doi: 10.1038/s41586-021-03470-x. [DOI] [PubMed] [Google Scholar]
  • 38.Calistri P et al. Infection sustained by lineage B.1.1.7 of SARS-CoV-2 is characterised by longer persistence and higher viral RNA loads in nasopharyngeal swabs. International Journal of Infectious Diseases 105, 753–755 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kissler SM et al. Viral Dynamics of SARS-CoV-2 Variants in Vaccinated and Unvaccinated Persons. N Engl J Med NEJMc2102507 (2021) doi: 10.1056/NEJMc2102507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Althaus CL et al. A tale of two variants: Spread of SARS-CoV-2 variants Alpha in Geneva, Switzerland, and Beta in South Africa 10.1101/2021.06.10.21258468 (2021) doi: 10.1101/2021.06.10.21258468. [DOI] [Google Scholar]
  • 41.Lakdawala SS & Menachery VD Catch me if you can: superspreading of COVID-19. Trends Microbiol S0966–842X(21)00124–4 (2021) doi: 10.1016/j.tim.2021.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Mossong J et al. Social Contacts and Mixing Patterns Relevant to the Spread of Infectious Diseases. PLoS Med 5, e74 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhang J et al. Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science 368, 1481–1486 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Mistry D et al. Inferring high-resolution human mixing patterns for disease modeling. Nat Commun 12, 323 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Wallinga J, Teunis P & Kretzschmar M Using Data on Social Contacts to Estimate Age-specific Transmission Parameters for Respiratory-spread Infectious Agents. American Journal of Epidemiology 164, 936–944 (2006). [DOI] [PubMed] [Google Scholar]
  • 46.van Kampen JJA et al. Duration and key determinants of infectious virus shedding in hospitalized patients with coronavirus disease-2019 (COVID-19). Nat Commun 12, 267 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Savela ES et al. SARS-CoV-2 is detectable using sensitive RNA saliva testing days before viral load reaches detection range of low-sensitivity nasal swab tests 10.1101/2021.04.02.21254771 (2021) doi: 10.1101/2021.04.02.21254771. [DOI] [Google Scholar]
  • 48.Smith RL et al. Longitudinal assessment of diagnostic test performance over the course of acute SARS-CoV-2 infection. The Journal of Infectious Diseases jiab337 (2021) doi: 10.1093/infdis/jiab337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.NIH COVID-19 Autopsy Consortium et al. SARS-CoV-2 infection of the oral cavity and saliva. Nat Med (2021) doi: 10.1038/s41591-021-01296-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Lakdawala SS et al. The soft palate is an important site of adaptation for transmissible influenza viruses. Nature 526, 122–125 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Amato KA et al. Influenza A virus undergoes compartmentalized replication in vivo dominated by stochastic bottlenecks 10.1101/2021.09.28.462198 (2021) doi: 10.1101/2021.09.28.462198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Cai Y et al. Structural basis for enhanced infectivity and immune evasion of SARS-CoV-2 variants. Science eabi9745 (2021) doi: 10.1126/science.abi9745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Thorne LG et al. Evolution of enhanced innate immune evasion by the SARS-CoV-2 B.1.1.7 UK variant 10.1101/2021.06.06.446826 (2021) doi: 10.1101/2021.06.06.446826. [DOI] [Google Scholar]
  • 54.Brown JC et al. Increased transmission of SARS-CoV-2 lineage B.1.1.7 (VOC 2020212/01) is not accounted for by a replicative advantage in primary airway cells or antibody escape 10.1101/2021.02.24.432576 (2021) doi: 10.1101/2021.02.24.432576. [DOI] [Google Scholar]
  • 55.Ulrich L et al. Enhanced fitness of SARS-CoV-2 variant of concern B.1.1.7, but not B.1.351, in animal models 10.1101/2021.06.28.450190 (2021) doi: 10.1101/2021.06.28.450190. [DOI] [Google Scholar]
  • 56.Gniazdowski V et al. Repeat COVID-19 Molecular Testing: Correlation of SARS-CoV-2 Culture with Molecular Assays and Cycle Thresholds. Clin Infect Dis (2020) doi: 10.1093/cid/ciaa1616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Matsuyama S et al. Enhanced isolation of SARS-CoV-2 by TMPRSS2-expressing cells. Proc Natl Acad Sci U S A 117, 7001–7003 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Waggoner JJ et al. Triplex Real-Time RT-PCR for Severe Acute Respiratory Syndrome Coronavirus 2. Emerg. Infect. Dis 26, 1633–1635 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Rambaut A et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol 5, 1403–1407 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Han MS, Byun J-H, Cho Y & Rim JH RT-PCR for SARS-CoV-2: quantitative versus qualitative. The Lancet Infectious Diseases 21, 165 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Baccam P, Beauchemin C, Macken CA, Hayden FG & Perelson AS Kinetics of Influenza A Virus Infection in Humans. JVI 80, 7590–7599 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Gonçalves A et al. Timing of Antiviral Treatment Initiation is Critical to Reduce SARS-CoV-2 Viral Load. Clin. Pharmacol. Ther 9, 509–514 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Ménache MG et al. UPPER RESPIRATORY TRACT SURFACE AREAS AND VOLUMES OF LABORATORY ANIMALS AND HUMANS: CONSIDERATIONS FOR DOSIMETRY MODELS. Journal of Toxicology and Environmental Health 50, 475–506 (1997). [DOI] [PubMed] [Google Scholar]
  • 64.Collins LMC & Dawes C The Surface Area of the Adult Human Mouth and Thickness of the Salivary Film Covering the Teeth and Oral Mucosa. J Dent Res 66, 1300–1302 (1987). [DOI] [PubMed] [Google Scholar]
  • 65.Hou YJ et al. SARS-CoV-2 Reverse Genetics Reveals a Variable Infection Gradient in the Respiratory Tract. Cell 182, 429–446.e14 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Miao H, Xia X, Perelson AS & Wu H On Identifiability of Nonlinear ODE Models and Applications in Viral Dynamics. SIAM Rev 53, 3–39 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Smith AP, Moquin DJ, Bernhauerova V & Smith AM Influenza Virus Infection Model With Density Dependence Supports Biphasic Viral Decay. Front. Microbiol 9, 1554 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Martin MA & Koelle K Comment on “Genomic epidemiology of superspreading events in Austria reveals mutational dynamics and transmission properties of SARS-CoV-2”. Sci. Transl. Med 13, eabh1803 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Braun KM et al. Acute SARS-CoV-2 infections harbor limited within-host diversity and transmit via tight transmission bottlenecks. PLoS Pathog 17, e1009849 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Valesano AL et al. Temporal dynamics of SARS-CoV-2 mutation accumulation within and across infected hosts. PLoS Pathog 17, e1009499 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Michael Lavigne G, Russell H, Sherry B & Ke R Autocrine and paracrine interferon signalling as ‘ring vaccination’ and ‘contact tracing’ strategies to suppress virus infection in a host. Proc. R. Soc. B 288, 20203002 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Perelson AS & Nelson PW Mathematical Analysis of HIV-1 Dynamics in Vivo. SIAM Rev 41, 3–44 (1999). [Google Scholar]
  • 73.Perelson AS & Ke R Mechanistic Modeling of SARS-CoV-2 and Other Infectious Diseases and the Effects of Therapeutics. Clin. Pharmacol. Ther 109, 829–840 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File
raw data
supplemental tables

Data Availability Statement

All raw data used is included as a supplemental table. Raw sequence data files can be found under BioProject ID PRJNA809434.

RESOURCES