Key Points
Question
What factors are associated with SARS-CoV-2 viral load at the time of COVID-19 diagnosis, and is viral load associated with disease severity?
Findings
In this secondary cross-protocol analysis of 1667 placebo recipients from 4 harmonized, randomized, phase 3 COVID-19 vaccine efficacy trials, no associations were found between viral load and any of the measured covariates or disease severity.
Meaning
The findings of this study suggest that caution should be exercised in the use of individual-level viral load in comparisons across trials and/or settings and as a surrogate for COVID-19 severity, especially given increasing diversity in preexisting immunity.
This secondary cross-protocol analysis investigates the viral load at COVID-19 diagnosis among previously uninfected and unvaccinated individuals by demographic and clinical characteristics, viral variant, and trial, as well as the ability of viral load to predict severe disease.
Abstract
Importance
SARS-CoV-2 viral load (VL) in the nasopharynx is difficult to quantify and standardize across settings, but it may inform transmission potential and disease severity.
Objective
To characterize VL at COVID-19 diagnosis among previously uninfected and unvaccinated individuals by evaluating the association of demographic and clinical characteristics, viral variant, and trial with VL, as well as the ability of VL to predict severe disease.
Design, Setting, and Participants
This secondary cross-protocol analysis used individual-level data from placebo recipients from 4 harmonized, phase 3 COVID-19 vaccine efficacy trials sponsored by Moderna, AstraZeneca, Janssen, and Novavax. Participants were SARS-CoV-2 negative at baseline and acquired COVID-19 during the blinded phase of the trials. The setting included the US, Brazil, South Africa, Colombia, Argentina, Peru, Chile, and Mexico; start dates were July 27, 2020, to December 27, 2020; data cutoff dates were March 26, 2021, to July 30, 2021. Statistical analysis was performed from November 2022 to June 2023.
Main Outcomes and Measures
Linear regression was used to assess the association of demographic and clinical characteristics, viral variant, and trial with polymerase chain reaction–measured log10 VL in nasal and/or nasopharyngeal swabs taken at the time of COVID-19 diagnosis.
Results
Among 1667 participants studied (886 [53.1%] male; 995 [59.7%] enrolled in the US; mean [SD] age, 46.7 [14.7] years; 204 [12.2%] aged 65 years or older; 196 [11.8%] American Indian or Alaska Native, 150 [9%] Black or African American, 1112 [66.7%] White; 762 [45.7%] Hispanic or Latino), median (IQR) log10 VL at diagnosis was 6.18 (4.66-7.12) log10 copies/mL. Participant characteristics and viral variant explained only 5.9% of the variability in VL. The independent factor with the highest observed differences was trial: Janssen participants had 0.54 log10 copies/mL lower mean VL vs Moderna participants (95% CI, 0.20 to 0.87 log10 copies/mL lower). In the Janssen study, which captured the largest number of COVID-19 events and variants and used the most intensive post-COVID surveillance, neither VL at diagnosis nor averaged over days 1 to 28 post diagnosis was associated with COVID-19 severity.
Conclusions and Relevance
In this study of placebo recipients from 4 randomized phase 3 trials, high variability was observed in SARS-CoV-2 VL at the time of COVID-19 diagnosis, and only a fraction was explained by individual participant characteristics or viral variant. These results suggest challenges for future studies of interventions seeking to influence VL and elevates the importance of standardized methods for specimen collection and viral load quantitation.
Introduction
There have been more than 750 million confirmed SARS-CoV-2 infections worldwide since the start of the COVID-19 pandemic.1 Understanding drivers of transmission is critical for addressing issues of public health, developing outbreak mitigation policies, and informing individual decision-making. Numerous lines of evidence point to viral load (VL) as a marker of transmission potential.2,3,4,5,6,7 SARS-CoV-2 VL at or near the time of hospitalization has been associated with symptom severity and mortality8,9,10,11,12 and used as an end point in COVID-19 treatment trials.13,14,15,16,17,18,19 However, accurately measuring VL is challenging: it is dynamic, typically peaking before or soon after onset of symptoms; it is highly variable across participants; it differs by specimen type and adequacy of collection; and results from different assays and laboratories may not be directly comparable.6,20
This cross-protocol analysis describes the distribution of SARS-CoV-2 VL at the time of COVID-19 diagnosis for over 1600 placebo recipients from 4 phase 3 COVID-19 vaccine trials conducted in partnership with the COVID-19 Prevention Network (CoVPN).21,22 The data span 8 countries, and waves of infection attributable to the ancestral variant and 9 others. The harmonized COVID-19 definitions and timing of specimen collection across this diverse cohort allowed us to systematically examine factors of variability in SARS-CoV-2 VL at diagnosis prior to any immunization. It also allowed for an assessment of the ability of VL to predict COVID-19 severity in a largely outpatient disease context.
Methods
Participants
This cross-protocol analysis included participant-level data from the placebo groups of 4 randomized, placebo-controlled, phase 3 COVID-19 vaccine efficacy trials.22,23,24,25,26 The trials (herein referenced by study sponsor: [1] Moderna, [2] Janssen, [3] AstraZeneca, and [4] Novavax) were conducted under a US government–funded program, with the CoVPN providing organizational leadership and infrastructure.21 The trials featured harmonized protocols, including primary end points, with start dates from July 27 to December 27, 2020, and primary analysis data cutoffs from March 26 to July 30, 2021. Trial sites were located in the US, Brazil, South Africa, Colombia, Argentina, Peru, Chile, and Mexico,22 comprising a diverse study population and varying epidemiological trends.27 Of note, the Moderna, AstraZeneca, and Novavax trials were conducted primarily in the US (exclusively US for Moderna), while the Janssen trial spanned 8 countries across 3 continents. Local or central institutional review board and/or ethics committee approvals were obtained by each site participating in the 4 trials.23,24,25,26
The study cohort consisted of participants from the placebo groups of the trials who were SARS-CoV-2 negative at enrollment (based on polymerase chain reaction (PCR) and antinucleocapsid serology, or antinucleocapsid serology alone for the AstraZeneca trial) and who were diagnosed with COVID-19 meeting the primary end point definitions (eMethods in Supplement 1) during the blinded phase of the trial. This cohort reflects the COVID-19 experience for the immunologically naive early in the pandemic.
Primary end point COVID-19 was defined as independently adjudicated COVID-19 occurring at least 14 days (≥7 days for Novavax) following the last placebo injection, as detailed in the original trial publications23,24,25,26 and the eMethods in Supplement 1. Broadly, criteria included a positive molecular test (eg, PCR) accompanied by systemic and/or respiratory symptoms; severe COVID-19 was characterized by more substantial symptoms (eMethods in Supplement 1).
The analysis cohort consisted of study cohort participants with VL measurements at COVID-19 diagnosis, including participants who were PCR-negative (0 VL) on their protocol-defined COVID-19 diagnosis date (described in next section). For the Moderna trial, the analysis cohort was further limited to participants who tested negative through day 57 by reverse transcriptase PCR (RT-PCR) and antinucleocapsid serology assay, and PCR positive on the protocol-defined date of COVID-19 diagnosis.
Viral Load Measurements
The primary outcome for this analysis was SARS-CoV-2 VL at COVID-19 diagnosis, defined by VL measured from the nasal and/or nasopharyngeal (NP) swab closest to protocol-defined COVID-19 onset. COVID-19 onset was defined as the date of first positive SARS-CoV-2 PCR test (AstraZeneca), symptom onset (Janssen), the earlier of the 2 (Novavax), or the later of the 2 (Moderna) (eMethods in Supplement 1). Therefore, for some participants it was possible for a PCR test on the date of COVID-19 diagnosis to be negative. Trial was included as a covariate in all analyses, as RT-PCR and VL quantification were performed at different laboratories and using different assays across the trials (see eMethods in Supplement 1).
Symptom-driven PCR testing varied by protocol: Moderna and AstraZeneca brought participants into clinics for confirmatory PCR testing of nasal and/or NP swabs within 1 to 3 days of symptom onset; Janssen and Novavax provided nasal swabs for home collection at symptom onset. Days since COVID-19 onset was included in all analyses to account for timing differences in sample collection. Protocol-specific central laboratories derived VL using validated RT-PCR assays with concurrently run standards for conversion to log10 copies/mL (eMethods in Supplement 1).
Variant Identification
Sequencing was attempted for all infections by protocol-specific laboratories and successful sequences were lineage-typed to identify the viral variant. Full genomes from Janssen and Novavax infections were lineage-typed with the PANGOLIN tool,28 whereas spike-only sequences from Moderna and AstraZeneca were assigned a World Health Organization (WHO) variant label using a tool we developed for this purpose (eMethods in Supplement 1). Specimens with sequences that met one of the WHO-named variant definitions were classified as such; the remaining sequences were from the A.1 and B.1 lineages and classified as ancestral.29 Specimens without sequencing data were considered to have missing variant.
As an alternative to variant classification, for samples with sequencing data, spike Hamming distances were calculated as the number of amino acid positions differing from the Wuhan-Hu-1 ancestral strain (GenBank accession number NC_045512).30
Statistical Analyses
Linear regression was used to assess the association of baseline participant characteristics, exposure risk factors, and disease characteristics with log10 VL at COVID-19 diagnosis. Covariate definitions are provided in the eMethods in Supplement 1. A multivariate model included prespecified covariates based on literature review: age at baseline, sex assigned at birth, self-identified race, ethnicity, baseline self-reported comorbidities associated with high risk of severe COVID-19, country of residence, variant, COVID-19 severity, days since protocol-defined COVID-19 onset, and trial.
Multiple imputation was used to ascribe missing variants using the population proportion of diagnosed infections attributed to each variant within 2 weeks of the date of COVID-19 onset, based on country- or state-specific genomic surveillance data from the Global Initiative on Sharing All Influenza Data (GISAID).31 Results were combined across 20 imputed data sets using Rubin rules (eMethods in Supplement 1).32 The Holm method33 was used to control the familywise error rate at 0.05 across univariate analyses and separately among the variables in the multivariate model.
Univariate, sensitivity, and exploratory analyses were also performed to explore the robustness of our conclusions (eMethods in Supplement 1). The multivariate model was fit without imputation to the subset of participants with viral sequence data, to participants infected with the ancestral variant, and using the spike Hamming distance of the sequence to the ancestral strain in lieu of variant. Additionally, a generalized additive model (GAM) extension of the multivariate model was fit. The GAM model included country-specific smoothed calendar time trends to account for local epidemic dynamics.34,35 Finally, the multivariate analysis was repeated for the subsets of participants who had nonzero VL at diagnosis, who enrolled in the US, and who enrolled in the Janssen trial. By examining more homogenous populations, we aimed to circumvent the effects of confounding variables.
For the Janssen trial, which captured the largest number of COVID-19 events and had the most intensive post–COVID-19 diagnosis specimen collection, log10 VL at diagnosis and area under the 28-day log10 VL curve (corresponding to the mean VL) were evaluated for their ability to predict severe COVID-19, with and without the full set of baseline participant characteristics and covariates. Risk of severe COVID-19 was estimated using super learning, and prediction performance was measured using the cross-validated area under the receiver operating characteristic (ROC) curve (AUC) (eMethods in Supplement 1).36,37,38 Two-sided P < .05 was deemed statistically significant. Statistical analysis was performed using R version 4.0.4 (R Project for Statistical Computing) from November 2022 to June 2023.
Results
Participant Demographics and Disease Characteristics
The analysis cohort included 1667 participants, with Moderna (n = 594 [35.6%]) and Janssen (n = 916 [54.9%]) contributing the majority (Figure 1). In total, 886 participants (53.1%) were male; 995 (59.7%) were enrolled in the US; the mean (SD) age was 46.7 (14.7) years, 204 (12.2%) were 65 years or older; 196 (11.8%) were American Indian or Alaska Native, 41 (2.5%) were Asian, 150 (9.0%) were Black or African American, 110 (6.6%) reported multiple races, 1112 (66.7%) were White, 13 (2.2%) were other race; and 762 (45.7%) were Hispanic or Latino. Differences in the number of cases between trials were affected by the sizes of the placebo groups and epidemiological trends during follow-up.27
Baseline and disease characteristics are summarized by trial in the Table and eTable 1 in Supplement 1; 577 participants (34.6%) had preexisting comorbidities, and 1226 (73.5%) were categorized as overweight or obese at baseline. Among these participants with symptomatic COVID-19, for 263 (15.8%) the disease was classified as severe, although hospitalization rates were generally low (less than 0.2% in the placebo group of each contributing trial).23,24,25,26,39 VL was typically measured within 1 day of COVID-19 onset; sequences and infecting variants were available for 1323 participants (79.4%); of those, 857 (65.5%) corresponded to the ancestral variant. Nine other variants (Alpha, Beta, Gamma, Epsilon, Zeta, Iota, Delta, Lambda, and Mu) were detected at frequencies less than 10%. The Janssen trial included the greatest number of countries and variants. Because of this and other epidemiological and trial factors, country, variant, and trial were confounded in the data set.
Table. Baseline Characteristics and Clinical Characteristics of Placebo Recipients Who Developed COVID-19 in 1 of 4 CoVPN Phase 3 COVID-19 Vaccine Efficacy Trials, With SARS-CoV-2 Viral Load Measured at COVID-19 Diagnosis.
Characteristic | Parent protocol | ||||
---|---|---|---|---|---|
Moderna (n = 594) | AstraZeneca (n = 97) | Janssen (n = 916) | Novavax (n = 60) | Total (n = 1667) | |
Sex assigned at birth, No. (%) | |||||
Female | 293 (49.3) | 33 (34.0) | 417 (45.5) | 38 (63.3) | 781 (46.9) |
Male | 301 (50.7) | 64 (66.0) | 499 (54.5) | 22 (36.7) | 886 (53.1) |
Country, No. (%) | |||||
Argentina | 0 | 0 | 95 (10.4) | 0 | 95 (5.7) |
Brazil | 0 | 0 | 169 (18.4) | 0 | 169 (10.1) |
Chile | 0 | 7 (7.2) | 9 (1.0) | 0 | 16 (1.0) |
Colombia | 0 | 0 | 192 (21.0) | 0 | 192 (11.5) |
Mexico | 0 | 0 | 8 (0.9) | 4 (6.7) | 12 (0.7) |
Peru | 0 | 20 (20.6) | 84 (9.2) | 0 | 104 (6.2) |
South Africa | 0 | 0 | 84 (9.2) | 0 | 84 (5.0) |
US | 594 (100.0) | 70 (72.2) | 275 (30.0) | 56 (93.3) | 995 (59.7) |
Self-reported race, No. (%)a | |||||
American Indian or Alaska Nativeb | 4 (0.7) | 16 (16.5) | 172 (18.8) | 4 (6.7) | 196 (11.8) |
Asian | 23 (3.9) | 0 | 14 (1.5) | 4 (6.7) | 41 (2.5) |
Black or African American | 29 (4.9) | 6 (6.2) | 110 (12.0) | 5 (8.3) | 150 (9.0) |
Multiple | 7 (1.2) | 4 (4.1) | 99 (10.8) | 0 | 110 (6.6) |
Not reported | 10 (1.7) | 1 (1.0) | 33 (3.6) | 1 (1.7) | 42 (2.7) |
White | 508 (85.5) | 70 (72.2) | 488 (53.3) | 46 (76.7) | 1112 (66.7) |
Other | 13 (2.2) | 0 | 0 | 0 | 13 (0.8) |
Ethnicity, No. (%) | |||||
Hispanic or Latino | 134 (22.6) | 36 (37.1) | 580 (63.3) | 12 (20.0) | 762 (45.7) |
Not Hispanic or Latino | 458 (77.1) | 60 (61.9) | 319 (34.8) | 48 (80.0) | 885 (53.1) |
Not reported | 2 (0.3) | 1 (1.0) | 17 (1.9) | 0 | 20 (1.2) |
Age category, No. (%), y | |||||
18-29 | 70 (11.8) | 18 (18.6) | 168 (18.3) | 18 (30.0) | 274 (16.4) |
30-39 | 108 (18.2) | 14 (14.4) | 122 (13.3) | 11 (18.3) | 255 (15.3) |
40-49 | 141 (23.7) | 25 (25.8) | 226 (24.7) | 10 (16.7) | 402 (24.1) |
50-64 | 194 (32.7) | 28 (28.9) | 292 (31.9) | 18 (30.0) | 532 (31.9) |
≥65 | 81 (13.6) | 12 (12.4) | 108 (11.8) | 3 (5.0) | 204 (12.2) |
BMI category, No. (%) | |||||
<18.5 | 3 (0.5) | 1 (1.0) | 6 (0.7) | 0 | 10 (0.6) |
18.5-<25.0 | 115 (19.4) | 24 (24.7) | 263 (28.7) | 23 (38.3) | 425 (25.5) |
≥25.0 | 472 (79.5) | 71 (73.2) | 646 (70.5) | 37 (61.7) | 1226 (73.5) |
Missing | 4 (0.7) | 1 (1.0) | 1 (0.1) | 0 | 6 (0.4) |
COVID-19 comorbidities, No. (%) | 138 (23.2) | 61 (62.9) | 346 (37.8) | 32 (53.3) | 577 (34.6) |
Severe COVID-19 symptoms, No. (%) | 81 (13.6) | 4 (4.1) | 175 (19.1) | 3 (5.0) | 263 (15.8) |
Days since COVID-19 onset, No. (%)c | |||||
−1 | 25 (4.2) | 0 | 0 | 0 | 25 (1.5) |
−2 | 10 (1.7) | 0 | 0 | 0 | 10 (0.6) |
0 | 554 (93.3) | 97 (100.0) | 263 (28.7) | 12 (20.0) | 926 (55.5) |
1 | 3 (0.5) | 0 | 312 (34.1) | 12 (20.0) | 327 (19.6) |
2 | 2 (0.3) | 0 | 149 (16.3) | 22 (36.7) | 173 (10.4) |
3 | 0 | 0 | 96 (10.5) | 12 (20.0) | 108 (6.5) |
4 | 0 | 0 | 96 (10.5) | 2 (3.3) | 98 (5.9) |
Infecting variant: No. (%) | |||||
Ancestral | 435 (73.2) | 43 (44.3) | 376 (41.0) | 13 (21.7) | 867 (52.0) |
Alpha | 0 | 9 (9.3) | 25 (2.7) | 24 (40.0) | 58 (3.5) |
Beta | 0 | 0 | 49 (5.3) | 1 (1.7) | 50 (3.0) |
Gamma | 1 (0.2) | 1 (1.0) | 105 (11.5) | 3 (5.0) | 110 (6.6) |
Epsilon | 13 (2.2) | 5 (5.2) | 16 (1.7) | 3 (5.0) | 37 (2.2) |
Zeta | 1 (0.2) | 0 | 74 (8.1) | 1 (1.7) | 76 (4.6) |
Iota | 0 | 0 | 4 (0.4) | 2 (3.3) | 6 (0.4) |
Delta | 0 | 0 | 7 (0.8) | 0 | 7 (0.4) |
Lambda | 0 | 16 (16.5) | 43 (4.7) | 0 | 59 (3.5) |
Mu | 0 | 0 | 53 (5.8) | 0 | 53 (3.2) |
No sequence | 144 (24.2) | 23 (23.7) | 164 (17.9) | 13 (21.7) | 344 (20.6) |
Abbreviations: BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); CoVPN, COVID Vaccine Prevention Network.
Self-reported race is defined across all clinical sites. Participants were asked to select all applicable categories, including “Other”; Multiple indicates more than one self-reported category; Not reported indicates a missing response.
Indigenous people from South America were classified together with the American Indian or Alaska Native US and Mexico demographic according to the FDA definition (American Indian or Alaska Native: A person having origins in any of the original peoples of North and South America (including Central America), and who maintains tribal affiliation or community attachment). In this analysis, the Moderna, AstraZeneca, Janssen, and Novavax trials included 4, 1, 5, and 4 participants, respectively, who identified as American Indian or Alaskan Native from North America.
Days since COVID-19 onset is defined as the number of calendar days between protocol-defined onset of COVID-19 and the specimen collection corresponding to diagnosis. Negative days since onset in Moderna implies the positive swab was obtained before qualifying symptom onset.
Variability in Viral Load
For most participants, VL was highest near COVID-19 onset and declined over time, with considerable interindividual variation (Figure 2). All protocols measured VL with nasal and/or NP swabs at diagnosis, but collection method and frequency differed thereafter (eMethods in Supplement 1). Accordingly, the present analysis focuses on VL at diagnosis.
VL at diagnosis was highly variable, with a median (IQR) of 6.18 (4.66-7.12) log10 copies/mL. From the 3 protocols that provided undetectable (0) VL results, 68 of 1073 participants (6.3%) in the analysis cohort were in this category. Importantly, these participants at some point did meet the primary end point definition of symptomatic PCR-confirmed COVID-19, and therefore had another (positive) PCR swab associated with this infection. Distributions of log10 VL by trial, COVID-19 severity, SARS-CoV-2 variant, and days since disease onset are summarized in Figure 3, and univariate associations are summarized in eTable 2 in Supplement 1. Although viral sequence data would naturally be missing for participants with 0 or very low VL (given that amplification of viral RNA is necessary for sequencing), the 344 (20.6%) missing sequences included a wide range of VLs (Figure 3C).
Multivariate Model Associations With Viral Load
The multivariate model identified few independent factors associated with VL at diagnosis (Figure 4). Trial showed the strongest association (P = .02): participants in the Janssen trial had 0.54 log10 copies/mL lower mean VL compared to those in the Moderna trial (95% CI: 0.20 to 0.87 log10 copies/mL lower).
Although race did not show a statistically significant association overall, mean VL was 0.58 (95% CI, 0.99-0.16) log10 copies/mL lower among those who reported as Black or African American vs White, and 1.34 log10 (95% CI, 95% CI, 0.23-2.45) copies/mL higher for the 13 Moderna participants who reported other race and ethnicity vs White.
COVID-19 severity was not significantly associated with VL. There were likewise no differences in VL between countries or infecting variants. COVID-19 severity was not significantly associated with VL. Considering SARS-CoV-2 variants, the highest VL was observed for the 7 Delta infections, although the 95% CI for the mean difference was wide (0.67 log10 copies/mL higher vs ancestral virus; 95% CI, 1.40 lower to 2.74 higher log10 copies/mL). In total, the multivariate model explained just 5.9% of the variability in VL, indicating considerable unexplained variability even after accounting for all variables in the model.
Hamming Distance and Sensitivity Analyses
The multivariate model was also fit to participants with available sequence data using spike Hamming distances instead of variant; the analysis restricted attention to participants with sequence data. There was no association found between VL and Hamming distance, with a nonsignificant 0.01 log10 copies/mL lower mean VL per additional nucleotide difference from the ancestral strain (95% CI, 0.04 lower to 0.01 higher log10 copies/mL) (eMethods, eFigure 1, eTable 4 in Supplement 1).
The complete case analysis, wherein the multivariate model was fit to the subset of 1323 participants with sequence data (ie, without imputation), yielded similar results to the primary analysis (eFigure 2 in Supplement 1). Participants in the Janssen trial had 1.06 log10 copies/mL lower mean VL vs participants in the Moderna trial (95% CI, 1.35-0.76 log10 copies/mL lower; P < .001).
In an analysis restricted to participants in the analysis cohort with PCR-positive results at diagnosis, race was the only significant association (P = .006): compared with White-identifying participants, Black or African American participants had 0.59 log10 copies/mL lower mean VL (95% CI, 0.94 to 0.23 lower log10 copies/mL), and other race was associated with a 1.32 log10 copies/mL higher mean VL (95% CI, 0.39 to 2.29 higher log10 copies/mL) (eFigure 3 in Supplement 1). In this analysis, trial did not show a significant association with VL.
For the sensitivity analyses that separately accounted for local temporal trends, restricted to US participants, or restricted to ancestral SARS-CoV-2 infections, qualitative conclusions were unchanged. In each analysis, participants in the Janssen trial exhibited lower mean VL at diagnosis, but no associations were statistically significant (eFigures 4-6 in Supplement 1). Moreover, the sensitivity analysis restricted to Janssen participants did not identify any factors significantly associated with VL (eFigure 7 in Supplement 1).
Viral Load as a Predictor of Severe COVID-19
Among 916 participants in the analysis cohort from the Janssen trial, neither VL at diagnosis nor area under the VL curve (AUC-VL), interpreted as the mean VL over days 1 to 28 post–COVID-19 diagnosis, predicted COVID-19 severity (cv-AUC, 0.52 [95% CI 0.47 to 0.57]; and AUC-VL cv-AUC, 0.49 [95% CI, 0.42 to 0.57]) (eMethods, eFigure 8 in Supplement 1). Incorporating baseline participant characteristics, characteristics of COVID-19 diagnosis, and VL measurements were associated with improved predictive performance (cv-AUC, 0.71 [95% CI, 0.67 to 0.75]); however, variable importance measures suggest the dominant predictors were race and variant, and neither VL predictor was among the top 10 in the model (eFigure 9 in Supplement 1). Moreover, VL at diagnosis was not associated with improved predictions when added to a model including other baseline characteristics (eFigure 11 in Supplement 1).
Discussion
In a large and diverse cohort of immunologically naive participants with acute COVID-19, we observed considerable variability in VL at diagnosis, only a small fraction of which was explained by participant characteristics. Although not a statistically significant result, we estimated the highest mean VL among participants infected with Delta, consistent with previous literature.40,41 The strongest measured association with VL in our study was that with trial, which suggests timing of specimen collection or other factors associated with the specimen collection, storage, or VL assays across protocols may have influenced VL measurements.42,43,44,45 These results should temper expectations of future research comparing VL across trials or settings, especially given increasing diversity in preexisting immunity.
Intriguingly, we did not find an association between VL at diagnosis and severe COVID-19 in this largely outpatient setting. Neither VL at diagnosis nor averaged over days 1 to 28 post COVID-19 onset predicted severe disease. This may reflect that severe COVID-19 is typically caused by lower respiratory tract infection, which may not be detected by nasal and/or nasopharyngeal swabbing, and in the case of VL at diagnosis, may also reflect that severe disease may take days or weeks to fully manifest. This result contrasts with previous studies which have documented associations between VL at diagnosis46,47,48,49,50 or during the second week of infection40 and severe disease. Importantly, however, prior studies were conducted primarily in hospitalized populations, and our study is unique in its capture of individuals with COVID-19 symptoms in a primarily outpatient context. The observed variability in VL among these participants may undermine the use of VL as a proxy for clinical outcomes in this population. The utility of VL end points in future trials of prophylactic and therapeutic interventions should also be considered carefully given this result.
There are several possible explanations for the lower mean VL observed in the Janssen trial. Swabs from this trial were self-collected, which may have resulted in poorer sample quality. While Novavax also used self-collection, there were fewer samples from this trial, potentially obscuring statistical significance. It is also noteworthy that Janssen samples underwent RT-PCR diagnostic testing at study sites or at a local central laboratory and were then frozen and shipped to the central virology laboratory (University of Washington) to undergo confirmatory testing and VL quantification, resulting in at least 2 freeze-thaw cycles. It has been shown that multiple freeze-thaw cycles can degrade RNA specimens, potentially more substantially for low VL specimens, which would further decrease low VL measurements.51 Also, while our analysis excluded participants PCR-positive or seropositive at baseline, participants who were previously infected but seroreverted prior to enrollment may have been included; these individuals would likely have lower VL and be overrepresented in the Janssen trial (conducted later in the pandemic). Importantly, however, whether there is clinical importance to the estimated 0.54 log10 copies/mL lower mean VL in the Janssen trial is also uncertain.
Limitations
Our study has limitations. Differences across protocols in collection schedules and methods, specimen types, and timing after the onset of symptoms limited our analyses to VL measurements from a single time point. Furthermore, nasal swabs are subject to heterogeneity; less variability may have been observed if blood samples were analyzed. We were unable to evaluate any association between VL and transmission, because secondary cases were not assessed, and differences in available VL data across protocols limited our analyses to primary end point infections among immunologically naive participants, although analyses assessing the association between COVID-19 vaccination with VL will be reported separately. VL and COVID-19 severity analyses were limited to a single trial (Janssen), although it did include the largest number of COVID-19 events, countries, variants, and severe disease events among the trials included. Furthermore, our analyses were limited to the harmonized, adjudicated secondary end point definition of severe COVID-19; this included patients exhibiting prespecified signs and symptoms, most of whom were not hospitalized. It is also worth noting that our study does not characterize viral load in asymptomatic individuals; however, contact-tracing and household studies suggest that asymptomatic cases may transmit at a lower rate.52,53,54,55,56,57 Additionally, even with prompt PCR testing shortly after symptom onset, the collected specimens likely missed the peak VL and reflect the declining phase of the VL trajectory.58
Statistical comparisons were also limited by the data available. Importantly, laboratories and assays measuring VL differed across protocols, and were thus confounded with trial, preventing an analysis that adjusts for or stratifies by assay type. Because most participants in the analysis cohort who resided outside the US were enrolled in the Janssen trial, primary analyses could not fully disentangle country and study associations with VL. However, the fact that sensitivity analyses restricted to the Janssen trial detected no VL differences between countries, while US-restricted analyses still detected differences in VL between protocols, suggests that the variable with the strongest association with VL was indeed trial.
Conclusions
The large variability in VL that we observed in this secondary cross-protocol analysis has important implications. Studies evaluating mucosal COVID-19 vaccines, which are thought to potentially affect transmission as measured by VL, are especially relevant. Future studies will likely be conducted among even more diverse settings, including participants with a wide variety of infection and vaccination histories. Studies including participants with both symptomatic and asymptomatic SARS-CoV-2 infection are expected to have even greater variability in VL than what was observed here among exclusively symptomatic participants. Standardization in prompts for testing, collection, processing, storage, and assaying of specimens will be critical to minimize variability and allow the effects of interventions and other exposures to be evaluated.
References
- 1.World Health Organization . WHO Coronavirus (COVID-19) Dashboard. Accessed March 6, 2024. https://covid19.who.int/
- 2.Goyal A, Reeves DB, Cardozo-Ojeda EF, Schiffer JT, Mayer BT. Viral load and contact heterogeneity predict SARS-CoV-2 transmission and super-spreading events. Elife. Published online February 23, 2021. doi: 10.7554/eLife.63537 [DOI] [PMC free article] [PubMed]
- 3.Jones TC, Biele G, Mühlemann B, et al. Estimating infectiousness throughout SARS-CoV-2 infection course. Science. 2021;373(6551):eabi5273. doi: 10.1126/science.abi5273 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kawasuji H, Takegoshi Y, Kaneda M, et al. Transmissibility of COVID-19 depends on the viral load around onset in adult and symptomatic patients. PLoS One. 2020;15(12):e0243597. doi: 10.1371/journal.pone.0243597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Marc A, Kerioui M, Blanquart F, et al. Quantifying the relationship between SARS-CoV-2 viral load and infectiousness. Elife. Published online September 27, 2021. doi: 10.7554/eLife.69302 [DOI] [PMC free article] [PubMed]
- 6.Puhach O, Meyer B, Eckerle I. SARS-CoV-2 viral load and shedding kinetics. Nat Rev Microbiol. 2023;21(3):147-161. doi: 10.1038/s41579-022-00822-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Singanayagam A, Hakki S, Dunning J, et al. ; ATACCC Study Investigators . Community transmission and viral load kinetics of the SARS-CoV-2 delta (B.1.617.2) variant in vaccinated and unvaccinated individuals in the UK: a prospective, longitudinal, cohort study. Lancet Infect Dis. 2022;22(2):183-195. doi: 10.1016/S1473-3099(21)00648-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.El Zein S, Chehab O, Kanj A, et al. SARS-CoV-2 infection: Initial viral load (iVL) predicts severity of illness/outcome, and declining trend of iVL in hospitalized patients corresponds with slowing of the pandemic. PLoS One. 2021;16(9):e0255981. doi: 10.1371/journal.pone.0255981 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Shenoy S. SARS-CoV-2 (COVID-19), viral load and clinical outcomes; lessons learned one year into the pandemic: a systematic review. World J Crit Care Med. 2021;10(4):132-150. doi: 10.5492/wjccm.v10.i4.132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Souverein D, van Stralen K, van Lelyveld S, et al. Initial severe acute respiratory syndrome coronavirus 2 viral load is associated with disease severity: a retrospective cohort study. Open Forum Infect Dis. 2022;9(7):ofac223. doi: 10.1093/ofid/ofac223 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Trunfio M, Venuti F, Alladio F, et al. Diagnostic SARS-CoV-2 cycle threshold value predicts disease severity, survival, and six-month sequelae in COVID-19 Symptomatic patients. Viruses. 2021;13(2):281. doi: 10.3390/v13020281 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Westblade LF, Brar G, Pinheiro LC, et al. SARS-CoV-2 viral load predicts mortality in patients with and without cancer who are hospitalized with COVID-19. Cancer Cell. 2020;38(5):661-671.e2. doi: 10.1016/j.ccell.2020.09.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Weinreich DM, Sivapalasingam S, Norton T, et al. ; Trial Investigators . REGEN-COV antibody combination and outcomes in outpatients with COVID-19. N Engl J Med. 2021;385(23):e81. doi: 10.1056/NEJMoa2108163 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chen P, Nirula A, Heller B, et al. ; BLAZE-1 Investigators . SARS-CoV-2 neutralizing antibody LY-CoV555 in outpatients with Covid-19. N Engl J Med. 2021;384(3):229-237. doi: 10.1056/NEJMoa2029849 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hammond J, Leister-Tebbe H, Gardner A, et al. ; EPIC-HR Investigators . Oral nirmatrelvir for high-risk, nonhospitalized adults with Covid-19. N Engl J Med. 2022;386(15):1397-1408. doi: 10.1056/NEJMoa2118542 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Gottlieb RL, Vaca CE, Paredes R, et al. ; GS-US-540-9012 (PINETREE) Investigators . Early remdesivir to prevent progression to severe Covid-19 in outpatients. N Engl J Med. 2022;386(4):305-315. doi: 10.1056/NEJMoa2116846 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Moser C, Li JZ, Eron JJ, et al. ; ACTIV-2/A5401 Study Team . Predictors of SARS-CoV-2 RNA from nasopharyngeal swabs and concordance with other compartments in nonhospitalized adults with mild to moderate COVID-19. Open Forum Infect Dis. 2022;9(11):ofac618. doi: 10.1093/ofid/ofac618 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Moser CB, Chew KW, Giganti MJ, et al. Statistical challenges when analyzing SARS-CoV-2 RNA measurements below the assay limit of quantification in COVID-19 clinical trials. medRxiv. Preprint posted online March 17, 2023. . doi: 10.1101/2023.03.13.23287208 [DOI] [PMC free article] [PubMed]
- 19.Pajon R, Paila YD, Girard B, et al. ; COVE Trial Consortium . Initial analysis of viral dynamics and circulating viral variants during the mRNA-1273 Phase 3 COVE trial. Nat Med. 2022;28(4):823-830. doi: 10.1038/s41591-022-01679-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lee MJ. Quantifying SARS-CoV-2 viral load: current status and future prospects. Expert Rev Mol Diagn. 2021;21(10):1017-1023. doi: 10.1080/14737159.2021.1962709 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Corey L, Mascola JR, Fauci AS, Collins FS. A strategic approach to COVID-19 vaccine R&D. Science. 2020;368(6494):948-950. doi: 10.1126/science.abc5312 [DOI] [PubMed] [Google Scholar]
- 22.Mena Lora AJ, Long JE, Huang Y, et al. ; COVID-19 Prevention Network . Rapid development of an integrated network infrastructure to conduct phase 3 COVID-19 vaccine trials. JAMA Netw Open. 2023;6(1):e2251974. doi: 10.1001/jamanetworkopen.2022.51974 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dunkle LM, Kotloff KL, Gay CL, et al. ; 2019nCoV-301 Study Group . Efficacy and safety of NVX-CoV2373 in adults in the United States and Mexico. N Engl J Med. 2022;386(6):531-543. doi: 10.1056/NEJMoa2116185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.El Sahly HM, Baden LR, Essink B, et al. ; COVE Study Group . Efficacy of the mRNA-1273 SARS-CoV-2 vaccine at completion of blinded phase. N Engl J Med. 2021;385(19):1774-1785. doi: 10.1056/NEJMoa2113017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Falsey AR, Sobieszczyk ME, Hirsch I, et al. ; AstraZeneca AZD1222 Clinical Study Group . Phase 3 safety and efficacy of AZD1222 (ChAdOx1 nCoV-19) Covid-19 vaccine. N Engl J Med. 2021;385(25):2348-2360. doi: 10.1056/NEJMoa2105290 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sadoff J, Gray G, Vandebosch A, et al. ; ENSEMBLE Study Group . Safety and efficacy of single-dose Ad26.COV2.S vaccine against Covid-19. N Engl J Med. 2021;384(23):2187-2201. doi: 10.1056/NEJMoa2101544 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Theodore DA, Branche AR, Zhang L, et al. ; COVID-19 Prevention Network (CoVPN) . Clinical and demographic factors associated with COVID-19, severe COVID-19, and SARS-CoV-2 infection in adults: a secondary cross-protocol analysis of 4 randomized clinical trials. JAMA Netw Open. 2023;6(7):e2323349. doi: 10.1001/jamanetworkopen.2023.23349 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.O’Toole Á, Scher E, Underwood A, et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021;7(2):veab064. doi: 10.1093/ve/veab064 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.World Health Organization . Tracking SARS-CoV-2 variants. Accessed April 18, 2024. https://www.who.int/activities/tracking-SARS-CoV-2-variants
- 30.Aluísio P, Hildete Prisco P, Pranab Kumar S. The use of hamming distance in bioinformatics. In: Ranajit C, Rao CR, Pranab Kumar S, eds. Handbook of Statistics. Elsevier; 2012:129-162. [Google Scholar]
- 31.Khare S, Gurry C, Freitas L, et al. GISAID’s Role in Pandemic Response. China CDC Wkly. 2021;3(49):1049-1051. doi: 10.46234/ccdcw2021.255 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations in R. J Stat Soft. Published online December 12, 2011. doi: 10.18637/jss.v045.i03 [DOI]
- 33.Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat Theory Appl. 1979;6(2):65-70. [Google Scholar]
- 34.Lin Y, Yang B, Cobey S, et al. Incorporating temporal distribution of population-level viral load enables real-time estimation of COVID-19 transmission. Nat Commun. 2022;13(1):1155. doi: 10.1038/s41467-022-28812-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tso CF, Garikipati A, Green-Saxena A, Mao Q, Das R. Correlation of Population SARS-CoV-2 Cycle Threshold Values to Local Disease Dynamics: Exploratory Observational Study. JMIR Public Health Surveill. 2021;7(6):e28265. doi: 10.2196/28265 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Polley EC, van der Laan MJ. Super learner in prediction. UC Berkeley Division of Biostatistics Working Paper Series. 2010. Accessed April 18, 2024. https://biostats.bepress.com/ucbbiostat/paper266
- 37.Williamson BD, Gilbert PB, Simon NR, Carone M. A General framework for inference on algorithm-agnostic variable importance. J Am Stat Assoc. 2023;118(543):1645-1658. doi: 10.1080/01621459.2021.2003200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Williamson BD, Gilbert PB, Carone M, Simon N. Nonparametric variable importance assessment using machine learning techniques. Biometrics. 2021;77(1):9-22. doi: 10.1111/biom.13392 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Marchese AM, Zhou X, Kinol J, et al. NVX-CoV2373 vaccine efficacy against hospitalization: A post hoc analysis of the PREVENT-19 phase 3, randomized, placebo-controlled trial. Vaccine. 2023;41(22):3461-3466. doi: 10.1016/j.vaccine.2023.04.054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Puhach O, Adea K, Hulo N, et al. Infectious viral load in unvaccinated and vaccinated individuals infected with ancestral, Delta or Omicron SARS-CoV-2. Nat Med. 2022;28(7):1491-1500. doi: 10.1038/s41591-022-01816-0 [DOI] [PubMed] [Google Scholar]
- 41.Blanquart F, Abad C, Ambroise J, et al. Characterisation of vaccine breakthrough infections of SARS-CoV-2 Delta and Alpha variants and within-host viral load dynamics in the community, France, June to July 2021. Euro Surveill. 2021;26(37):2100824. doi: 10.2807/1560-7917.ES.2021.26.37.2100824 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Callahan C, Lee RA, Lee GR, Zulauf K, Kirby JE, Arnaout R. Nasal swab performance by collection timing, procedure, and method of transport for patients with SARS-CoV-2. J Clin Microbiol. 2021;59(9):e0056921. doi: 10.1128/JCM.00569-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Liu M, Li Q, Zhou J, et al. Value of swab types and collection time on SARS-COV-2 detection using RT-PCR assay. J Virol Methods. 2020;286:113974. doi: 10.1016/j.jviromet.2020.113974 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Rhoads DD, Pinsky BA. The truth about SARS-CoV-2 cycle threshold values is rarely pure and never simple. Clin Chem. 2021;68(1):16-18. doi: 10.1093/clinchem/hvab146 [DOI] [PubMed] [Google Scholar]
- 45.Viloria Winnett A, Porter MK, Romano AE, et al. Morning SARS-CoV-2 testing yields better detection of infection due to higher viral loads in saliva and nasal swabs upon waking. Microbiol Spectr. 2022;10(6):e0387322. doi: 10.1128/spectrum.03873-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Dogan L, Allahverdiyeva A, Önel M, et al. Is SARS-CoV-2 viral load a predictor of mortality in COVID-19 acute respiratory distress syndrome patients? J Int Med Res. Published online November 30, 2022. doi: 10.1177/03000605221137443 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Rao SN, Manissero D, Steele VR, Pareja J. A Systematic review of the clinical utility of cycle threshold values in the context of COVID-19. Infect Dis Ther. 2020;9(3):573-586. doi: 10.1007/s40121-020-00324-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Knudtzen FC, Jensen TG, Lindvig SO, et al. SARS-CoV-2 viral load as a predictor for disease severity in outpatients and hospitalised patients with COVID-19: a prospective cohort study. PLoS One. 2021;16(10):e0258421. doi: 10.1371/journal.pone.0258421 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Piqué B, Peña K, Riu F, et al. SARS-CoV-2 serum viral load and prognostic markers proposal for COVID-19 pneumonia in low-dose radiation therapy treated patients. J Clin Med. 2023;12(3):798. doi: 10.3390/jcm12030798 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Walsh KA, Jordan K, Clyne B, et al. SARS-CoV-2 detection, viral load and infectivity over the course of an infection. J Infect. 2020;81(3):357-371. doi: 10.1016/j.jinf.2020.06.067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Dzung A, Cheng PF, Stoffel C, et al. Prolonged unfrozen storage and repeated freeze-thawing of SARS-CoV-2 patient samples have minor effects on SARS-CoV-2 detectability by RT-PCR. J Mol Diagn. 2021;23(6):691-697. doi: 10.1016/j.jmoldx.2021.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Bender JK, Brandl M, Höhle M, Buchholz U, Zeitlmann N. Analysis of asymptomatic and presymptomatic transmission in SARS-CoV-2 outbreak, Germany, 2020. Emerg Infect Dis. 2021;27(4):1159-1163. doi: 10.3201/eid2704.204576 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Byambasuren O, Cardona M, Bell K, Clark J, McLaws ML, Glasziou P. Estimating the extent of asymptomatic COVID-19 and its potential for community transmission: systematic review and meta-analysis. J Assoc Med Microbiol Infect Dis Can. 2020;5(4):223-234. doi: 10.3138/jammi-2020-0030 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Methi F, Madslien EH. Lower transmissibility of SARS-CoV-2 among asymptomatic cases: evidence from contact tracing data in Oslo, Norway. BMC Med. 2022;20(1):427. doi: 10.1186/s12916-022-02642-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Rasmussen AL, Popescu SV. SARS-CoV-2 transmission without symptoms. Science. 2021;371(6535):1206-1207. doi: 10.1126/science.abf9569 [DOI] [PubMed] [Google Scholar]
- 56.Zhang C, Zhou C, Xu W, et al. Transmission risk of asymptomatic SARS-CoV-2 infection: a systematic review and meta-analysis. Infect Med (Beijing). 2023;2(1):11-18. doi: 10.1016/j.imj.2022.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Buitrago-Garcia D, Ipekci AM, Heron L, et al. Occurrence and transmission potential of asymptomatic and presymptomatic SARS-CoV-2 infections: update of a living systematic review and meta-analysis. PLoS Med. 2022;19(5):e1003987. doi: 10.1371/journal.pmed.1003987 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Killingley B, Mann AJ, Kalinova M, et al. Safety, tolerability and viral kinetics during SARS-CoV-2 human challenge in young adults. Nat Med. 2022;28(5):1031-1041. doi: 10.1038/s41591-022-01780-9 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.