Skip to main content
Microbiology Spectrum logoLink to Microbiology Spectrum
. 2021 Jul 14;9(1):10.1128/spectrum.00086-21. doi: 10.1128/spectrum.00086-21

Saliva Testing Is Accurate for Early-Stage and Presymptomatic COVID-19

Abigail J Johnson a, Shannon Zhou b, Susan L Hoops c, Benjamin Hillmann c, Matthew Schomaker d, Robyn Kincaid d, Jerry Daniel e, Kenneth Beckman f, Daryl M Gohl e,g, Sophia Yohe f, Dan Knights b,c,#, Andrew C Nelson f,✉,#
Editor: Heba H Mostafah
PMCID: PMC8552797  PMID: 34259552

ABSTRACT

Although nasopharyngeal samples have been considered the gold standard for COVID-19 testing, variability in viral load across different anatomical sites could cause nasopharyngeal samples to be less sensitive than saliva or nasal samples in certain cases. Self-collected samples have logistical advantages over nasopharyngeal samples, making them amenable to population-scale screening. To evaluate sampling alternatives for population screening, we collected nasopharyngeal, saliva, and nasal samples from two cohorts with varied levels and types of symptoms. In a mixed cohort of 60 symptomatic and asymptomatic participants, we found that saliva had 88% concordance with nasopharyngeal samples when tested in the same testing lab (n = 41) and 68% concordance when tested in different testing labs (n = 19). In a second cohort of 20 participants hospitalized for COVID-19, saliva had 74% concordance with nasopharyngeal samples tested in the same testing lab but detected virus in two participants that tested negative with nasopharyngeal samples on the same day. Medical record review showed that the saliva-based testing sensitivity was related to the timing of symptom onset and disease stage. We find that no sample site will be perfectly sensitive for COVID-19 testing in all situations, and the significance of negative results will always need to be determined in the context of clinical signs and symptoms. Saliva retained high clinical sensitivity for early-stage and presymptomatic COVID-19 while allowing easier collection, minimizing the exposure of health care workers, and need for personal protective equipment and making it a viable option for population-scale testing.

IMPORTANCE Methods for COVID-19 detection are necessary for public health efforts to monitor the spread of disease. Nasopharyngeal samples have been considered the best approach for COVID-19 testing. However, alternative samples like self-collected saliva offer advantages for population-scale screening. Meta-analyses of recent studies suggest that saliva is useful for detecting SARS-CoV-2; however, differences in disease prevalence, sample collection, and analysis methods still confound strong conclusions on the utility of saliva compared to nasopharyngeal samples. Here, we find that the sensitivity of saliva testing is related to both the timing of the sample collection relative to symptom onset and the disease stage. Importantly, several clinical vignettes in our cohorts highlight the challenges of medical decision making with limited knowledge of the associations between laboratory test data and the natural biology of infection.

KEYWORDS: COVID-19, SARS-CoV-2, saliva, testing, symptoms

INTRODUCTION

Throughout the COVID-19 pandemic, the spread of infection has significantly outpaced laboratory testing to identify severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Seroprevalence studies performed in March to May of 2020 in the United States suggested that the number of infections was 10-fold greater than confirmed laboratory diagnoses (1). This scenario significantly challenged public health efforts to monitor and contain the spread of disease. Nasopharyngeal samples have been considered the gold standard for COVID-19 testing. However, alternative samples, like self-collected saliva, offer advantages for population-scale screening and may perform well in specific clinical situations.

A number of studies comparing saliva, oral, and/or nasal samples with nasopharyngeal samples have reported heterogeneity in sensitivity or positive percent agreement (PPA), ranging from 66% to 98% (212); this heterogeneous performance is likely impacted by differences in patient populations and methods of sample collection and processing. For example, optimization of saliva sample processing within one institution improved the performance of this sample type across two sequential studies (3, 5). More importantly, clinical test performance is dependent on preanalytic variables, such as collection timing relative to the patient’s disease course and anatomic site of collection. A study of inpatients at an advanced disease stage showed that lower respiratory samples (bronchoalveolar lavage) were more frequently positive (93%) than pharyngeal (32%) or nasal (63%) samples (13). Further studies of samples from different anatomic sites at different points in disease course are necessary to better understand how these variables impact clinical test performance.

Here, we acquired patient-collected saliva and anterior nasal research specimens for comparison with concurrent provider-collected nasopharyngeal samples in both outpatient and inpatient settings. The clinical context of specimen collection, the timing of sample collection during disease course, and the analytical performance of different molecular tests were assessed for their impact on the test result agreement between different anatomic sites.

RESULTS

Study design and cohort definition.

Two distinct patient cohorts were included in the study (Fig. 1). Cohort 1 consisted of 354 patients with clinical nasopharyngeal results from tests collected in outpatient or emergency department settings. Cohort 1 study participants, presenting with both symptomatic and asymptomatic concerns for COVID-19, were enrolled opportunistically from a population receiving a nasopharyngeal COVID-19 test in outpatient screening or emergency department (ED) settings. Due to limited testing resources available during the study, we relied on nasopharyngeal results from routine clinical testing. Nasal and saliva samples were prospectively collected and biobanked for retrospective testing (see Materials and Methods). Analysis per design was performed on 30 positive and 30 negative samples, in accordance with the recommended clinical evaluation guidelines in the United States Food and Drug Administration (FDA) Policy for Coronavirus Disease-2019 Tests During the Public Health Emergency. After 30 positive cases were detected by clinical nasopharyngeal samples, enrollment was stopped and 30 negative cases were randomly selected for comparison.

FIG 1.

FIG 1

Cohort flow diagram. Cohort 1 was enrolled prospectively at outpatient screening and emergency department locations for sample collection and biobanking. When the prespecified 30 positive samples were acquired, retrospective testing of saliva and anterior nasal samples was performed for 30 positives and 30 randomly selected negative samples. Heterogeneity in clinical nasopharyngeal (NP) test routing during the pandemic response necessitated subcohort analysis of cohorts 1A and 1B. Cohort 2 was enrolled and tested prospectively at an inpatient ward dedicated to COVID-19 patient care. Clinical sensitivity analysis is performed on all 20 patients enrolled in Cohort 2, while 2 patients were excluded from analytic sensitivity analysis due to negative test results for all three anatomic sites.

Due to clinical test triage patterns outside the study’s purview, clinical nasopharyngeal samples were routed to different test platforms within the hospital system. We divided cohort 1 into two groups to reflect testing methods used for the clinical nasopharyngeal test. Cohort 1A (n = 41, outpatient) had clinical nasopharyngeal samples and biobanked saliva and nasal samples tested on our Clinical Laboratory Improvement Amendments-certified laboratory-developed test (CLIA-LDT). Cohort 1B (n = 19, ED) had clinical nasopharyngeal sample testing performed on one of two different commercial testing platforms (see details in Materials and Methods and the supplemental material). All saliva and nasal samples from cohort 1 were prospectively collected, biobanked, and then tested on the CLIA-LDT.

Cohort 2 consisted of 20 inpatients admitted to a dedicated COVID-19 ward for symptomatic disease management following prior positive laboratory testing for SARS-CoV-2 (range, 1 day to 4 weeks prior to admission). In this group, nasopharyngeal, saliva, and nasal samples were obtained at the same time from participants for concurrent analysis on the CLIA-LDT. Both cohorts were recruited, consented, and enrolled under protocols approved by the University of Minnesota’s Institutional Review Board (cohorts 1A and 1B, STUDY00009393; cohort 2, STUDY00009560).

Cohort characteristics.

Cohort 1A had an average age of 39.6 years and was skewed toward female gender, at 92.7%. Cohort 1B had an average age of 44.5 years and was skewed toward male gender, at 63.2%. Reliable clinical notes to establish symptoms were available for 18 of 41 patients in cohort 1A and 16 of 19 patients in 1B. Symptomatic patients comprised approximately 80% of evaluable patients in both cohorts 1A (n = 14/18) and 1B (n = 13/16), although a higher proportion of symptomatic patients in cohort 1B had positive clinical test results (92.3%) versus 1A (64.3%). Four asymptomatic patients in cohort 1A had negative clinical tests. Of 3 asymptomatic patients in cohort 1B, one had a positive clinical test and two had negative clinical tests. Cohort 2 had an average age of 62.5 years, consistent with the increased hospitalization rate of older COVID-19 patients. Gender was equally distributed in cohort 2 (Table 1).

TABLE 1.

Cohort characteristicsa

Characteristic Value for cohort:
1A 1B 2
n 41 19 20
Age (yr, ±SD) 39.6 ± 10.9 44.5 ± 20.1 62.5 ± 18.1
Sex (% female) 92.7 36.8 50.0
% Participants with symptoms 77.8 81.3 100.0
% Participants with symptoms who are positive 64.3 92.3 90.0
% Participants without symptoms who are positive 0.0 33.3 0.0
N1_NP tests with CT values (n) 16 14 16
N1_Nasal tests with CT values (n) 14 8 11
N1_Saliva tests with CT values (n) 13 8 12
N2_NP tests with CT values (n) 16 14 16
N2_Nasal tests with CT values (n) 13 7 11
N2_Saliva tests with CT values (n) 13 8 14
a

N1 and N2 refer to primer-probe sets for the SARS-CoV-2 N gene (N1 and N2 targets). NP, nasal pharyngeal; CT, cycle threshold.

Concordance of testing on saliva, nasal, and nasopharyngeal samples.

Clinical nasopharyngeal testing results in cohort 1A (n = 41) identified 16 positive and 25 negative patients. Testing of banked saliva and nasal samples showed 87.5% PPA and 95.1% overall percent agreement (OPA), respectively, compared to nasopharyngeal samples (Table 2; see also Table S1 in the supplemental material). Saliva and nasal results were 100% concordant in cohort 1A.

TABLE 2.

PPA and OPA for samplesa

Cohort % PPA and OPA for:
Nasal versus NP
Saliva versus NP
Saliva versus nasal
PPA OPA PPA OPA PPA OPA
1A (n = 41) 87.5 95.1 87.5 95.1 100.0 100.0
2 (n = 20) 68.8 75.0 75.0 70.0 81.8 65.0
1B (n = 19) 57.1 68.4 57.1 68.4 87.5 89.5
a

PPA, positive percent agreement; OPA, overall percent agreement; NP, nasal pharyngeal.

Clinical nasopharyngeal testing on two different commercial platforms in cohort 1B (n = 19) produced 14 positive and 5 negative results. Testing banked saliva and nasal samples on the CLIA-LDT provided more discordant results: the PPA for both saliva and nasal samples against NP was 57.1%. This comparison is subject to platform bias. However, separate internal quality assurance data (see the supplemental material) showed 90% and 97% positive agreement of archived nasopharyngeal samples comparing each of the two commercial assays to the CLIA-LDT (Tables S4 and S5), suggesting that platform bias is not the only reason for the level of discordance observed across anatomic sample types in cohort 1B. Further, there were two discrepancies between nasal and saliva sample results (both tested on the CLIA-LDT) producing 87.5% PPA and 89.5% OPA between these sample types (Table S2).

Simultaneous collection and immediate testing of all three sample types for cohort 2 on the CLIA-LDT demonstrated 16 positive nasopharyngeal, 14 positive saliva, and 11 positive nasal samples from 20 patients. The PPA ranged from 69% to 82% and the OPA from 65% to 75% for all pairwise comparisons (Table 2). Saliva sampling identified 2 positive patients who tested negative by nasopharyngeal sampling, and inversely nasopharyngeal testing identified 4 positive patients who tested negative by saliva (Table S3). Nasal samples performed poorly in this patient cohort.

We used a composite reference standard approach to assess diagnostic sensitivity in cohort 2 (14, 15). All patients positive for SARS-CoV-2 by at least one upper respiratory sample type (n = 18) were considered a reference positive case for the assessment of sensitivity. Using this approach, the diagnostic sensitivity for each sample type was 89% for nasopharyngeal, 78% for saliva, and 61% for nasal.

Comparison of cycle threshold results across sample type.

Cycle threshold (CT) values for each anatomic sample site were compared in cohorts 1A and 2, and all samples were analyzed on the same analytical platform. The interquartile ranges for both N1 and N2 were largely overlapping (Fig. 2A, Fig. S1A). In cohort 1A, the 2 discordant nasopharyngeal sample data points had CT values of 36.6 and 38.4, near the assay limit of detection. Cohort 2 had more discordant data points, consistent with the higher average CT value (lower relative viral load) observed in all anatomic sample types in this inpatient cohort tested at later disease stages. Discordant data points in this cohort ranged as low as 26.2, suggesting greater biologic variability in relative viral load between anatomic sites in patients at later disease stages. Similar results were observed with relative viral load (Fig. S2).

FIG 2.

FIG 2

Cycle thresholds are similar across anatomic sites, and cycle threshold agreement varies by anatomic site and cohort. (A) Box plots showing cycle threshold (CT) values for cohort 1A (n = 41) and 2 (n = 18) from the CLIA-LDT assay for nasal pharyngeal (NP), nasal, and saliva samples using N1 primers. Black circles are samples with concordance between nasal or saliva samples and nasopharyngeal samples (positive/positive or negative/negative). Gray squares show discordance between nasal or saliva samples and nasopharyngeal samples (positive/negative). Groups are compared using Kruskal-Wallis test by ranks. (B) Scatterplots of CT values from N1 primers for concordant samples from different sample types: saliva versus nasopharyngeal, nasal versus nasopharyngeal, and nasal versus saliva. Correlation was assessed using Pearson’s correlation. The correlation coefficient (r) and P value for each comparison are indicated on each plot. See Fig. S1 for analogous analysis of N2 primers and Fig. S2 for analysis of relative viral load.

Comparison of N1 (Fig. 2B) and N2 (Fig. S1B) CT values between sample types from the same patient showed that saliva samples were heterogeneous compared to both nasopharyngeal and nasal samples, generally showing low, nonsignificant correlation. The correlation of nasopharyngeal versus nasal CT values was relatively tighter, with three of four comparisons trending toward or reaching statistical significance.

Bland-Altmann analysis using samples from cohort 1A and cohort 2 demonstrated a mean bias of −3.98 for N1 target CT values and −5.3 for N2 target CT values, indicating average lower CT values for NP samples than saliva samples (Fig. 3A and B). Comparison of nasal and NP samples demonstrated a smaller mean bias of −2.07 for N1 and −3.73 for N2, again showing lower average CT values for NP samples (Fig. 3C and D). Heterogeneity around the mean bias was observed with relatively large limits of agreement (95% confidence ranges). There was no significant skewing or proportional difference at early or later CT values.

FIG 3.

FIG 3

Bland-Altman analysis reveals minimal proportional bias between saliva and nasal cycle thresholds relative to NP. Bland-Altman (mean-difference) plots show the relationship between average cycle threshold (CT) and the difference between CT for saliva versus nasopharyngeal (NP) samples for N1 (A) and N2 (B) primers; and nasal versus nasopharyngeal for N1 (C) and N2 (D) primers. The solid black line shows the mean bias for each comparison. Dashed lines represent the limits of agreement (95% confidence interval) around the mean bias (±1.96 standard deviation [SD]). Solid gray line shows the linear relationship between the mean and difference, with dashed gray lines showing the 95% confidence interval for this relationship. Pearson’s correlation (r) and P value are reported for the correlation between the mean and the difference.

Correlation of patient symptoms with test results.

Symptoms associated with COVID-19 were recorded from the medical record, scored, and used to calculate the probability of COVID [P(Covid)] according to the Menni et al. prediction model (16). Adequate records to calculate P(Covid) were available for 17 of 41 patients in cohort 1A (outpatient), 16 of 19 patients in cohort 1B (ED), and 19 of 20 patients in cohort 2 (inpatient). Two patients in cohort 1A had elevated P(Covid) scores (>0.5) but false-negative saliva and nasal samples (Fig. 4). Interestingly, both were being retested due to persistent symptoms at 2 and 4 weeks, respectively, after onset of their laboratory-confirmed COVID-19. No clear pattern of symptoms within cohort 2 was evident in relation to the concordant or discordant test results from the three anatomic sites (Fig. 4). In cohort 1B, discrepancies between elevated P(Covid) scores and negative saliva tests (Fig. S3A) occurred in three patients without objective fever or oxygen saturation abnormalities who had complete resolution of subjective symptoms in <48 h. We note that the P(Covid) score is heavily weighted toward loss of taste or smell, a subjective symptom. Overall, saliva and nasal samples demonstrated complete clinical agreement with nasopharyngeal samples in patients with low P(Covid) scores (<0.5) who were tested when initial symptoms developed, suggesting that saliva and nasal samples perform well in the population screening setting.

FIG 4.

FIG 4

Symptom heat-map for agreement between positive and negative test results by test method and cohort. Heat map columns are participants and rows are symptoms. Symptom presence is indicated as either severe or yes (purple), no (yellow), or unknown (white). Symptoms are clustered by similarity, and clustering is indicated by the dendrogram (left). Test results are indicated by color as either positive (black) or negative (light gray) for nasopharyngeal (NP), saliva, and nasal samples. Each participant’s calculated probability of COVID-19 [P(Covid)] is indicated with higher probabilities shown in purple and lower probabilities in yellow. Participant sex, age, and cohort are also indicated.

Limitations.

This study has a number of limitations due largely to technical and logistical constraints. The study was not intended to be a case-control study but rather was intended to meet the emergency use authorization guidance from the FDA for clinical assessment of test performance with these alternative sample types in comparison to what was considered at the time a gold standard sample for respiratory virus detection. At the time this study was conducted, the guidance was to obtain at least 30 positive and 30 negative samples for clinical assay validation; therefore, there was no power analysis performed prior to study initiation. The study is underpowered to fully address and detect differences in sample type performance, which was further complicated by the unplanned heterogeneity in clinical nasopharyngeal testing in cohort 1. In cohort 1B there is also technical bias between the commercial and CLIA-LDT platforms, as discussed above, that confounds the results and interpretation of that subcohort. The cohorts in this study were not selected with the intention of comprehensively assessing disease prevalence or positive or negative predictive values. Therefore, these results should be interpreted cautiously given small sample sizes and the potential for bias in study cohort selection and uncertain pretest probability or prevalence of COVID-19 in the two cohorts (17). We note that cohort 2 was intentionally enriched for cases in order to assess agreement of the methods in patients with clinically significant COVID-19 diagnosis.

There are several limitations related to the manner in which sampling was carried out. There is a possibility that the order of sampling, with nasopharyngeal performed first, could introduce bias. Nasopharyngeal samples can also detect more historic cases, possibly due to viral integration into the nasopharyngeal epithelium (18). This could confound results when testing for new-onset disease and could cause an underestimation of the sensitivity of saliva and nasal samples for active infection. Opportunistic collection in emergency departments in cohort 1B, especially during the high-pressure setting of a pandemic, may also have had an impact on the quality of research sample collection and may have impacted the skewed results observed in cohort 1B.

DISCUSSION

COVID-19 is a challenge to diagnose and treat at the individual patient level and to manage through public health measures at the community level. Alternative sample types, such as patient-acquired saliva and anterior nasal samples, are needed to facilitate high-volume testing when there are shortages of qualified health care workers and necessary supplies to support provider-acquired nasopharyngeal sampling. Our study sought to compare the performance of saliva and anterior nasal samples with standard-of-care NP samples for SARS-CoV-2 detection in different clinical settings. We observed variability in test results dependent on the clinical setting, where samples were collected and the heterogeneous pattern of clinical test routing that was an uncontrolled variable in cohort 1. Our results highlight some of the challenges facing diagnostic characterization of COVID-19 and indicate good performance of saliva and anterior nares (nasal) samples to detect SARS-CoV-2 virus when patients present for outpatient testing early in their disease course.

Variability in laboratory test results is caused by both preanalytical and analytical factors. COVID-19 is a biologically heterogeneous infection: the anatomic distribution of active viral replication and the severity of symptoms compared to viral load are variable between different patients and during each individual’s disease course. These are important preanalytical factors impacting test performance on samples from different anatomical sites. A recent systematic review and meta-analysis of saliva samples found that the sensitivity of saliva (83%) was similar to that of nasopharyngeal samples (84%), with diagnostic equivalency highest in the ambulatory setting (19). Additionally, recent preprint data have been released supporting the hypothesis that saliva viral load is a strong clinical predictor of COVID-19 severity (20). These data overall support the important role saliva testing can play in assessing COVID-19.

We observed good PPA in cohort 1A between the testing of banked saliva and nasal samples compared to the clinical nasopharyngeal result. Analytical variation was minimized in this cohort, with all samples tested using the same reverse transcription-PCR (RT-PCR) assay. Saliva and nasal samples showed complete agreement, and only two patients (of 16 total positive) had false-negative saliva and nasal sample results compared to nasopharyngeal results. These patients had previous laboratory-confirmed COVID-19 infections and were being retested due to persistent symptoms. The clinical nasopharyngeal sample for both of these patients had CT values consistent with viral loads below the 95% confidence limit of detection for the clinical assay. Therefore, the false-negative results in these cases could be due to degradation of the low viral load during the freeze-thaw cycle inherent to the preanalytical study sample handling for this cohort. Considering only patients presenting within the first 10 days of symptom onset (n = 14), saliva, nasal, and nasopharyngeal samples had 100% PPA, suggesting that all three anatomic sample sites perform well for initial diagnosis of COVID-19.

Biologic heterogeneity regarding persistence and anatomic distribution of viral replication later in COVID-19 course was apparent in cohort 2 (inpatient setting). An analytical advantage of cohort 2 was the prospective, parallel collection of all three sample types with simultaneous handling and testing on the same platform. In this setting, no single anatomic site provided a positive result for all 18 patients with at least one positive sample type. Nasopharyngeal and saliva samples performed better than nasal samples; however, the combination of an observed, patient-collected nasal and saliva sample together detected the same number of unique positive patients (n = 16) as did the provider-collected nasopharyngeal sample. Theoretically, combining patient-collected saliva and nasal samples into the same collection-stabilization buffer could improve clinical sensitivity.

The data from cohort 1B are difficult to interpret confidently. The unintended use of different molecular tests for the clinical NP test was an analytical confounder discussed above as platform bias. The reported limit of detection for the commercial platforms (250 to 500 viral copies/ml) is similar to the CLIA-LDT (560 copies/ml), but these are not calibrated on the same reference material. Independent clinical quality assurance data from our laboratory comparing replicate testing of nasopharyngeal samples between platforms showed PPA of 90 to 97% (see Tables S4 and S5 in the supplemental material), a level of agreement aligned with published data using standard-of-care nasopharyngeal samples (2123). This suggests additional preanalytical variables impacted the poor concordance observed in cohort 1B. The clinical vignettes of discrepant results across anatomical sites and testing platforms in cohort 1B highlight the challenges of medical decision-making with limited knowledge of the associations between laboratory test data and the natural biology of SARS-CoV-2 infection.

Our results highlight the importance of clinical judgment and integration of the patient’s medical presentation when interpreting individual test results. Understanding how anatomic site, timing of collection in disease course, handling and transport of the specimen, and analytical platform can influence test results is crucially important to making informed medical decisions regarding COVID-19 management (24). Continued study of these factors in diverse clinical settings is necessary for the medical field to improve the response to this global pandemic. Overall, our findings support the conclusion that self-collected saliva testing is effective for COVID-19 detection, especially in early stages of disease progression.

MATERIALS AND METHODS

Sample collection, handling, and biobanking.

Nasopharyngeal samples were collected as posterior nasopharynx swabs into viral or universal transport medium by a health care provider for all participants by following standard clinical procedures via sampling through a single nostril. Study saliva and anterior nares (nasal) samples were patient self-collected under direct observation immediately following collection of the nasopharyngeal sample. Participants were instructed to collect 1 ml of saliva before swabbing each nostril. Study participants were given saliva testing kits (item OM-505; DNA Genotek, Ottawa, ON, Canada) and anterior nares (nasal) testing kits (item OCD-100; DNA Genotek, Ottawa, ON, Canada) with written and verbal instructions from monitoring health care workers (Fig. S4). Patients were instructed to spit oral saliva into the collection device; patients were not instructed to expectorate mucous into the sample. Both of these kits deposit the patient-collected sample into a proprietary buffer that inactivates virus and stabilizes nucleic acids.

In cohort 1, liquid saliva samples mixed with buffer, and nasal swabs in buffer were transported at room temperature and stored at −20°C, according to instructions from the manufacturer. Selected research samples were thawed at room temperature immediately prior to analysis on the CLIA-LDT. In cohort 1, nasopharyngeal swab samples were collected, handled, and processed immediately according to the normal course of clinical testing through the health care system (cohort 1A, CLIA-LDT; cohort 1B, commercial assays).

In cohort 2, all samples were collected simultaneously within 48 h of admission. The three sample types were transported together at room temperature to the CLIA-LDT testing facility and processed within 24 h.

Sample extraction and molecular testing.

An RT-PCR based on primer-probe sets for the SARS-CoV-2 N gene (N1 and N2 targets) and human control RNase P (RP) published by the United States Centers for Disease Control was validated for clinical use (25) by following the regulatory requirements of CLIA and the Federal Drug Administration’s Emergency Use Authorization criteria (CLIA-LDT). A sample was reported positive for SARS-CoV-2 if either N1 or N2 viral targets were detected with a CT of <40 passing data quality control. Negative samples required the internal RP control was detected with a CT of <38. Extraction of clinical nasopharyngeal samples for testing on this CLIA-LDT assay was performed with either the Qiagen QIAamp viral RNA mini prep (number 52906; Qiagen, Germantown, MD) or the Promega Maxwell RSC viral total nucleic acid kit (number AS1330; Promega, Madison, WI) on the Maxwell RSC instrument per the manufacturer’s instructions; bridging studies showed equivalent limits of detection between these two methods (560 viral copies/ml; internal validation data). The sample input volume was 100 μl, and the elution volume was 50 μl for both extraction methods. For saliva and nasal samples obtained in sample buffer, nucleic acid extraction was performed only with the Promega Maxwell method as described above, per the collection device manufacturer’s (DNA Genotek) advice. For cohort 1B, nasopharyngeal swab sample testing was performed on the commercial Cepheid Xpert Xpress SARS-CoV-2 test (published limit of detection, 250 copies/ml) or the Diasorin Simplexa COVID-19 direct assay (published limit of detection, 500 copies/ml) by following manufacturer’s instructions. See the supplemental material for additional performance data comparing the CLIA-LDT and commercial assays.

Medical record review and symptom scoring.

Medical records from telehealth, clinic, or hospital visits were reviewed for relevant symptoms, including loss of taste or smell, shortness of breath, cough, sore throat, fatigue, diarrhea, nausea or vomiting, loss of appetite, chest pain, and myalgia or headache. Physician and nursing notes immediately prior to the initial testing date and over the potentially symptomatic period (approximately 10 to 20 days after testing positive) were reviewed. Subjective symptoms were coded as either present (yes) or absent (no) or as mild, moderate, or severe when the medical record stated severity. In cases where an individual reported at least one symptom; the symptoms that were not reported were considered absent (no). In cases where there was no report of symptoms, symptoms were coded as NA. Objective signs of interest were defined as elevated body temperature (fever) and decreased oxygen saturation based on documentation in the medical record. Data abstraction from the medical records was completed by one study author and reviewed for accuracy by a second author.

Statistical analysis.

Group mean variation between nasopharyngeal, saliva, and nasal samples were assessed with the Kruskal-Wallis rank sum test. Correlations between methods were assessed using Pearson correlation coefficient. Bland-Altman analysis was used to assess method agreement between saliva or nasal samples relative to nasopharyngeal. PPA was calculated as the proportion of comparative method positives where the test method was positive. Overall percent agreement (OPA) was calculated as the proportion of tests where the test and comparative method agreed. To generate a symptom heat map, symptoms were recoded following the COVID-19 probability score [P(Covid)] prediction equation by Menni et al. (16). Mild cough was recoded as No and only severe fatigue was coded as Yes. Heat map rows are clustered by similarity using the complete linkage method, and columns are sorted by cohort, method concordance, P(Covid), age, and sex. Average cycle threshold was calculated for nasopharyngeal, saliva, and nasal samples as the mean CT for N1 and N2. Relative viral load was calculated as [(2(RP-N1)) + (2(RP-N2))]/2. Data analysis and data visualization was completed using R version 3.4.3 (26) and the following packages: ggplot2 version 3.2.1 (27), cowplot version 0.9.4 (28), tidyverse version 1.3.0 (29), reshape 2 version 1.4.3 (30), pheatmap version 1.0.12 (31), viridis version 0.5.1 (32), RColorBrewer version 1.1-2 (33), and ggpubr 0.2.4 (34).

Supplementary Material

Reviewer comments
reviewer-comments.pdf (194.7KB, pdf)

ACKNOWLEDGMENTS

We thank Beth Jorgenson, Sandra Tekmen, Krista Goldsmith, Andrew Snyder, Stephanie McGlone, and Jill Cordes for their efforts in coordination of sample collection. We thank Tyler Bold, Peter Southern, and their laboratory staff for sample management and safety protocols. We acknowledge the significant efforts toward the COVID-19 testing effort made by faculty and staff at the University of Minnesota Genomics Center (Benjamin Auch, Patrick Grady, Darrell Johnson, Ray Watson, Lindsey Gengelbach, Dinesha Walek, Paige Marsolek, Shea Anderson and many others), the Department of Laboratory Medicine & Pathology (Leo Furcht, Patricia Ferrieri, Anthony Killeen, Bharat Thyagarajan, Amy Karger, and Sophie Arbefeville), the Department of Microbiology & Immunology (Ryan Langlois and Ashley Haase), and the Medical School Office of the Dean (Jakub Tolar, Timothy Schacker, Lisa Johnson, and staff). We thank DNA Genotek (Ottawa, Canada) for donating the nasal and saliva kits, and Rafal Iwasiow for general advice and for suggesting useful references. We thank the healthcare staff at the collection sites who took the time to enroll subjects and collect additional samples during the pandemic, and we thank the research subjects for providing additional samples. Finally, we thank the leadership and countless clinical laboratory staff from M Health Fairview, including Kylene Karnuth, Shannon Gascoigne, Michaela Leary, Jessica Gunderson, Klint Kjeldahl, Wendy Walters, and many others.

D.K. and D.M.G. serve as Senior Scientific Advisors to Diversigen, a company involved in the commercialization of microbiome analysis. A.J.J., S.Z., S.L.H., B.H., M.S., R.K., J.D., K.B., S.Y., and A.C.N. have no conflict.

This work was supported by University of Minnesota Office of the Vice President for Research COVID19 rapid response grant number 06 to D.K.

Footnotes

Supplemental material is available online only.

SUPPLEMENTAL FILE 1
Supplemental material. Download SPECTRUM00086-21_Supp_1_seq7.pdf, PDF file, 0.4 MB (455.9KB, pdf)

Contributor Information

Andrew C. Nelson, Email: nels2055@umn.edu.

Heba H. Mostafa, Johns Hopkins Hospital

REFERENCES

  • 1.Havers FP, Reed C, Lim T, Montgomery JM, Klena JD, Hall AJ, Fry AM, Cannon DL, Chiang C-F, Gibbons A, Krapiunaya I, Morales-Betoulle M, Roguski K, Rasheed MAU, Freeman B, Lester S, Mills L, Carroll DS, Owen SM, Johnson JA, Semenova V, Blackmore C, Blog D, Chai SJ, Dunn A, Hand J, Jain S, Lindquist S, Lynfield R, Pritchard S, Sokol T, Sosa L, Turabelidze G, Watkins SM, Wiesman J, Williams RW, Yendell S, Schiffer J, Thornburg NJ. 2020. Seroprevalence of antibodies to SARS-CoV-2 in 10 sites in the United States, March 23-May 12, 2020. JAMA Intern Med 180:1576. doi: 10.1001/jamainternmed.2020.4130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Patel MR, Carroll D, Ussery E, Whitham H, Elkins CA, Noble-Wang J, Rasheed JK, Lu X, Lindstrom S, Bowen V, Waller J, Armstrong G, Gerber S, Brooks JT. 2020. Performance of oropharyngeal swab testing compared to nasopharyngeal swab testing for diagnosis of COVID-19 -United States, January-February 2020. Clin Infect Dis off Publ Infect Dis Soc Am doi: 10.1093/cid/ciaa759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Wyllie AL, Fournier J, Casanovas-Massana A, Campbell M, Tokuyama M, Vijayakumar P, Warren JL, Geng B, Muenker MC, Moore AJ, Vogels CBF, Petrone ME, Ott IM, Lu P, Venkataraman A, Lu-Culligan A, Klein J, Earnest R, Simonov M, Datta R, Handoko R, Naushad N, Sewanan LR, Valdez J, White EB, Lapidus S, Kalinich CC, Jiang X, Kim DJ, Kudo E, Linehan M, Mao T, Moriyama M, Oh JE, Park A, Silva J, Song E, Takahashi T, Taura M, Weizman O-E, Wong P, Yang Y, Bermejo S, Odio CD, Omer SB, Dela Cruz CS, Farhadian S, Martinello RA, Iwasaki A, Grubaugh ND, Ko AI. 2020. Saliva or nasopharyngeal swab specimens for detection of SARS-CoV-2. N Engl J Med 383:1283–1286. doi: 10.1056/NEJMc2016359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Abasiyanik MF, Flood B, Lin J, Ozcan S, Rouhani SJ, Pyzer A, Trujillo J, Zhen C, Wu P, Jumic S, Wang A, Gajewski TF, Wang P, Hartley M, Ameti B, Niemiec R, Fernando M, Aydogan B, Bethel C, Matushek S, Beavis KG, Agrawal N, Segal J, Tay S, Izumchenko E. 2020. Sensitive detection and quantification of SARS-CoV-2 in saliva. medRxiv. 2020.12.04.20241059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Landry ML, Criscuolo J, Peaper DR. 2020. Challenges in use of saliva for detection of SARS CoV-2 RNA in symptomatic outpatients. J Clin Virol off Publ Pan Am Soc Clin Virol 130:104567. doi: 10.1016/j.jcv.2020.104567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Babady NE, McMillen T, Jani K, Viale A, Robilotti EV, Aslam A, Diver M, Sokoli D, Mason G, Shah MK, Korenstein D, Kamboj M. 2021. Performance of severe acute respiratory syndrome coronavirus 2 real-time RT-PCR tests on oral rinses and saliva samples. J Mol Diagn 23:3–9. doi: 10.1016/j.jmoldx.2020.10.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hanson KE, Barker AP, Hillyard DR, Gilmore N, Barrett JW, Orlandi RR, Shakir SM. 2020. Self-collected anterior nasal and saliva specimens versus health care worker-collected nasopharyngeal swabs for the molecular detection of SARS-CoV-2. J Clin Microbiol 58:e01824-20. doi: 10.1128/JCM.01824-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kojima N, Turner F, Slepnev V, Bacelar A, Deming L, Kodeboyina S, Klausner JD. 2020. Self-collected oral fluid and nasal swab specimens demonstrate comparable sensitivity to clinician-collected nasopharyngeal swab specimens for the detection of SARS-CoV-2. Clin Infect Dis off Publ Infect Dis Soc Am. doi: 10.1093/cid/ciaa1589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Pasomsub E, Watcharananan SP, Boonyawat K, Janchompoo P, Wongtabtim G, Suksuwan W, Sungkanuparph S, Phuphuakrat A. 2020. Saliva sample as a non-invasive specimen for the diagnosis of coronavirus disease 2019: a cross-sectional study. Clin Microbiol Infect off Publ Eur Soc Clin Microbiol Infect Dis doi: 10.1016/j.cmi.2020.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.McCormick-Baw C, Morgan K, Gaffney D, Cazares Y, Jaworski K, Byrd A, Molberg K, Cavuoti D. 2020. Saliva as an alternate specimen source for detection of SARS-CoV-2 in symptomatic patients using Cepheid Xpert Xpress SARS-CoV-2. J Clin Microbiol 58:e01109-20. doi: 10.1128/JCM.01109-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Caulley L, Corsten M, Eapen L, Whelan J, Angel JB, Antonation K, Bastien N, Poliquin G, Johnson-Obaseki S. 2021. Salivary detection of COVID-19. Ann Intern Med 174:131–133. doi: 10.7326/M20-4738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chen JH-K, Yip CC-Y, Poon RW-S, Chan K-H, Cheng VC-C, Hung IF-N, Chan JF-W, Yuen K-Y, To KK-W. 2020. Evaluating the use of posterior oropharyngeal saliva in a point-of-care assay for the detection of SARS-CoV-2. Emerg Microbes Infect 9:1356–1359. doi: 10.1080/22221751.2020.1775133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wang W, Xu Y, Gao R, Lu R, Han K, Wu G, Tan W. 2020. Detection of SARS-CoV-2 in different types of clinical specimens. JAMA 323:1843–1844. doi: 10.1001/jama.2020.3786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Tang S, Hemyari P, Canchola JA, Duncan J. 2018. Dual composite reference standards (dCRS) in molecular diagnostic research: a new approach to reduce bias in the presence of Imperfect reference. J Biopharm Stat 28:951–965. doi: 10.1080/10543406.2018.1428613. [DOI] [PubMed] [Google Scholar]
  • 15.Baughman AL, Bisgard KM, Cortese MM, Thompson WW, Sanden GN, Strebel PM. 2008. Utility of composite reference standards and latent class analysis in evaluating the clinical accuracy of diagnostic tests for pertussis. Clin Vaccine Immunol 15:106–114. doi: 10.1128/CVI.00223-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Menni C, Valdes AM, Freidin MB, Sudre CH, Nguyen LH, Drew DA, Ganesh S, Varsavsky T, Cardoso MJ, El-Sayed Moustafa JS, Visconti A, Hysi P, Bowyer RCE, Mangino M, Falchi M, Wolf J, Ourselin S, Chan AT, Steves CJ, Spector TD. 2020. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat Med 26:1037–1040. doi: 10.1038/s41591-020-0916-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kohn MA, Carpenter CR, Newman TB. 2013. Understanding the direction of bias in studies of diagnostic test accuracy. Acad Emerg Med 20:1194–1206. doi: 10.1111/acem.12255. [DOI] [PubMed] [Google Scholar]
  • 18.Zhang L, Richards A, Barrasa MI, Hughes SH, Young RA, Jaenisch R. 2021. Reverse-transcribed SARS-CoV-2 RNA can integrate into the genome of cultured human cells and can be expressed in patient-derived tissues. Proc Natl Acad Sci U S A 118:e2105968118. doi: 10.1073/pnas.2105968118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Butler-Laporte G, Lawandi A, Schiller I, Yao MC, Dendukuri N, McDonald EG, Lee TC. 2021. Comparison of saliva and nasopharyngeal swab nucleic acid amplification testing for detection of SARS-CoV-2: a systematic review and meta-analysis. JAMA Intern Med 181:353. doi: 10.1001/jamainternmed.2020.8876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Silva J, Lucas C, Sundaram M, Israelow B, Wong P, Klein J, Tokuyama M, Lu P, Venkataraman A, Liu F, Mao T, Oh JE, Park A, Casanovas-Massana A, Vogels CBF, Muenker MC, Zell J, Fournier JB, Campbell M, Chiorazzi M, Fuentes ER, Petrone ME, Kalinich CC, Ott IM, Watkins A, Moore AJ, Nakahata M, Team YI, Grubaugh ND, Farhadian S, Cruz CD, Ko AI, Schulz WL, Ring A, Ma S, Omer S, Wyllie AL, Iwasaki A. 2021. Saliva viral load is a dynamic unifying correlate of COVID-19 severity and mortality. medRxiv. 2021.01.04.21249236. [Google Scholar]
  • 21.Lieberman JA, Pepper G, Naccache SN, Huang M-L, Jerome KR, Greninger AL. 2020. Comparison of commercially available and laboratory-developed assays for in vitro detection of SARS-CoV-2 in clinical laboratories. J Clin Microbiol 58:e00821-20. doi: 10.1128/JCM.00821-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Procop GW, Brock JE, Reineks EZ, Shrestha NK, Demkowicz R, Cook E, Ababneh E, Harrington SM. 2021. A comparison of five SARS-CoV-2 molecular assays with clinical correlations. Am J Clin Pathol 155:69–78. doi: 10.1093/ajcp/aqaa181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zhen W, Manji R, Smith E, Berry GJ. 2020. Comparison of four molecular in vitro diagnostic assays for the detection of SARS-CoV-2 in nasopharyngeal specimens. J Clin Microbiol 58:e00743-20. doi: 10.1128/JCM.00743-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Tang Y-W, Schmitz JE, Persing DH, Stratton CW. 2020. Laboratory diagnosis of COVID-19: current issues and challenges. J Clin Microbiol 58:e00512-20. doi: 10.1128/JCM.00512-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Nelson AC, Auch B, Schomaker M, Gohl DM, Grady P, Johnson D, Kincaid R, Karnuth KE, Daniel J, Fiege JK, Fay EJ, Bold T, Langlois RA, Beckman KB, Yohe S. 2020. Analytical validation of a COVID-19 qRT-PCR detection assay using a 384-well format and three extraction methods. bioRxiv. doi: 10.1101/2020.04.02.022186. [DOI]
  • 26.R Core Team. 2020. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
  • 27.Wickham H. 2016. ggplot2: elegant graphics for data analysis. Springer-Verlag, New York, NY. [Google Scholar]
  • 28.Wilke CO. 2020. cowplot: streamlined plot theme and plot annotations for ggplot2.
  • 29.Wickham H, Averick M, Bryan J, Chang W, McGowan LD, François R, Grolemund G, Hayes A, Henry L, Hester J, Kuhn M, Pedersen TL, Miller E, Bache SM, Müller K, Ooms J, Robinson D, Seidel DP, Spinu V, Takahashi K, Vaughan D, Wilke C, Woo K, Yutani H. 2019. Welcome to the Tidyverse. JOSS 4:1686. doi: 10.21105/joss.01686. [DOI] [Google Scholar]
  • 30.Wickham H. 2007. Reshaping data with the reshape package 1. J Stat Softw 21:1–20. [Google Scholar]
  • 31.Kolde R. 2019. pheatmap: pretty heatmaps.
  • 32.Garnier S. 2018. viridis: default color maps from matplotlib.
  • 33.Neuwirth E. 2014. RColorBrewer: ColorBrewer palettes.
  • 34.Kassambara A. 2020. ggpubr: ggplot2 based publication ready plots.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reviewer comments
reviewer-comments.pdf (194.7KB, pdf)
SUPPLEMENTAL FILE 1

Supplemental material. Download SPECTRUM00086-21_Supp_1_seq7.pdf, PDF file, 0.4 MB (455.9KB, pdf)


Articles from Microbiology Spectrum are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES