Skip to main content
Wiley - PMC COVID-19 Collection logoLink to Wiley - PMC COVID-19 Collection
. 2021 Feb 19;52(5):949–958. doi: 10.1002/jrs.6080

An efficient primary screening of COVID‐19 by serum Raman spectroscopy

Gang Yin 1,2, Lintao Li 1,2, Shun Lu 1,2, Yu Yin 3, Yuanzhang Su 4, Yilan Zeng 5, Mei Luo 5, Maohua Ma 5, Hongyan Zhou 1,2, Lucia Orlandini 1,2, Dezhong Yao 3,, Gang Liu 6,, Jinyi Lang 1,2,
PMCID: PMC8014023  PMID: 33821082

Abstract

The outbreak of COVID‐19 coronavirus disease around the end of 2019 has become a pandemic. The preferred method for COVID‐19 detection is the real‐time polymerase chain reaction (RT‐PCR)‐based technique; however, it also has certain limitations, such as sample‐dependent procedures with a relatively high false negative ratio. We propose a safe and efficient method for screening COVID‐19 based on Raman spectroscopy. A total of 177 serum samples are collected from 63 confirmed COVID‐19 patients, 59 suspected cases, and 55 healthy individuals as a control group. Raman spectroscopy is adopted to analyze these samples, and a machine learning support‐vector machine (SVM) method is applied to the spectrum dataset to build a diagnostic algorithm. Furthermore, 20 independent individuals, including 5 asymptomatic COVID‐19 patients and 5 symptomatic COVID‐19 patients, 5 suspected patients, and 5 healthy patients, were sampled for external validation. In these three groups—confirmed COVID‐19, suspected, and healthy individuals—the distribution of statistically significant points of difference showed highly consistency for intergroups after repeated sampling processes. The classification accuracy between the COVID‐19 cases and the suspected cases is 0.87 (95% confidence interval [CI]: 0.85–0.88), and the accuracy between the COVID‐19 and the healthy controls is 0.90 (95% CI: 0.89–0.91), while the accuracy between the suspected cases and the healthy control group is 0.68 (95% CI: 0.67–0.73). For the independent test dataset, we apply the obtained SVM model to the classification of the independent test dataset to have all the results correctly classified. Our model showed that the serum‐level classification results were all correct for independent test dataset. Our results suggest that Raman spectroscopy could be a safe and efficient technique for COVID‐19 screening.

Keywords: COVID‐19; machine learning; Raman spectroscopy; screening, support vector machine


The outbreak of COVID‐19 coronavirus disease around the end of 2019 has become a pandemic. We propose a safe and efficient method for screening COVID‐19 based on Raman spectroscopy. Raman spectroscopy is adopted to analyze these sealed serum samples, and a machine learning support‐vector machines (SVMs) method is applied to the spectrum dataset to build a diagnostic algorithm. Our results suggest that Raman spectroscopy could be safe and efficient technique for COVID‐19 screening.

graphic file with name JRS-52-949-g002.jpg

1. INTRODUCTION

The severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2), causing COVID‐19 coronavirus disease in 2019, became a pandemic.[ 1 ] Clinical manifestation of COVID‐19 is highly nonspecific, including fever, fatigue, cough, myalgia, dyspnea, and headache.[ 2 ] Unfortunately, no effective treatment was available, until now. It already outpaced SARS in 2003 and Middle East Respiratory Syndrome (MERS) in 2012 in incidence and mortality.[ 2 , 3 ] The rapid and accurate detection of viruses is of great importance in controlling SARS‐CoV‐2 spread. Currently, a positive for the SARS‐CoV‐2 viral nucleic acid by polymerase chain reaction based on the technique or deep sequencing of specimens from the respiratory tract or serum is confirmed for COVID‐19.[ 4 , 5 ] However, false negative results for patients with insufficient amounts of the virus at the site of sample collection, causing the COVID‐19 patients to spread the virus, have become a major setback in containing viral transmission.[ 6 ] Therefore, it is of practical significance to develop a safe and efficient diagnostic method.

Raman spectroscopy is a method that detects molecular vibration and molecular rotation energy levels. It detects components on the biomolecule level with a high sensitivity for distinguishing vibrations and conformations of proteins, peptides, and nucleic acid.[ 7 ] The characteristic peaks in Raman spectrum are called “fingerprints,” representing the biochemical composition of the sample, and the noncontact detection process is fast, repeatable, specific, and highly sensitive.[ 8 ] Although the spectrum of SARS‐CoV‐2 is not revealed, the activation of a humoral response by the virus may induce a series of immune reactions, releasing the cytokines and resulting in the change of spectrum in the serum.[ 9 ] Machine learning has proven efficient for the data analysis of the complicated change of Raman spectrum in samples. Because it is a simple process of sample preparation and it is free of sample loss, Raman spectroscopy is highly recommended for diagnosing a diverse range of diseases, including dengue fever, hepatitis C virus, cancer, HIV, Alzheimer's disease, and endometriosis.[ 10 , 11 , 12 , 13 , 14 , 15 , 16 ] Raman can also be used to detect RNA viruses.[ 17 ]

In this study, we screen the serum from COVID‐19 patients, suspected cases, and healthy patients (control) by Raman spectroscopy to evaluate its detection capability.

2. MATERIALS AND METHOD

2.1. Study objects

The study protocol was in accordance with principles of the Declaration of Helsinki and approved by the Ethics Committee of Sichuan Cancer Hospital & Institute (Chengdu, China). None of the authors had access to information that could identify individual participants during or after data collection. In this study, all blood samples were taken from February 10 to May 10, 2020.

The COVID‐19 group contained 63 patients who were recruited at the Chengdu Public Health Clinical Medical Center, including 58 symptomatic patients and 5 asymptomatic COVID‐19 patients. All of the COVID‐19 patients were positive using viral nucleic acid by real‐time polymerase chain reaction (RT‐PCR) detection with specimens from the respiratory tract. For the 58 symptomatic patients, the median (± standard deviation [SD]) interval time between symptom onset and sampling is 3.0 (±4.2) days. The suspected group contained 59 patients with flu symptoms similar to COVID‐19. All of the investigated “suspected” patients were also isolated in the COVID‐19 designated hospital until they were confirmed negative for COVID‐19. All of them showed negative viral nucleic acid tests at least twice by RT‐PCR detection, and the interval time between the two detections is less than 1 week. This group of patients was followed up until no COVID‐19 patients were found in this group. The healthy control group consisted of 55 healthy individuals recruited at the Sichuan Cancer Hospital and Chengdu Public Health Clinical Medical Center. General information for all of the investigated individuals, such as gender, age, clinical characteristics, and detailed information, is shown in Table 1.

TABLE 1.

Clinical characteristics of the investigated individuals

Characteristic Characteristic COVID‐19 Suspected Healthy control
Symptomatic Asymptomatic
Total 58 5 59 55
Age (media, range) 47.6 (20–78) 45.8 (21–74) 45.5 (24–65)
Gender
Male 26 2 36 24
Female 32 3 23 31
Distribution of temperature (blood sampling)
<37.5°C 27 5 13 5
37.5–38.0°C 19 17
38.1–39.0°C 7 21
>39.0°C 5 8
Symptoms
Cough 36 24
Fatigue 21 16
Myalgia or arthralgia 9 6
Headache 11 20
Shortness of breath 6 3
Disease severity
Nonsevere 51 58
Severe 7 1
Abnormalities on chest CT
Ground‐glass opacity 29 19
Local patchy shadowing 21 34
Bilateral patchy shadowing 28 17
Interstitial abnormalities 9 17

Abbreviation: CT, computed tomography.

All of the enrolled individuals gave informed consent to the research purpose and have signed the informed consent. The project was approved by the ethics committee of the participating institutes.

2.2. Sample preparation

Blood samples were taken from the COVID‐19 patients and the suspected cases upon admission from February 10 to May 10, 2020. One‐hour repose of blood sampling, the serum was isolated from blood samples by centrifuging at 3000 rpm for 10 min. All the serum samples were stored at 4°C and measured within 36 h after the collection. For the measurement, approximately 0.5 ml of the serum sample was prepared in cryopreservation tubes (specification: 2 ml; material: polypropylene) and strictly sealed for the Raman scan. Additional spectra data were also collected from cryopreservation tubes with saline solution inside.

2.3. Experimental setup

The system consists of a volume‐phase holographic (VPH) spectrograph, deep‐cooled CCD camera, Raman probe, and laser, designed by the Sichuan Institute for Brain Science and Brain‐Inspired Intelligence, and specific hardware composition and parameters; see Supporting Information S1. A single‐mode diode laser (real‐light) with wavelength 785‐nm and 100‐mW power was used for Raman excitation. The laser power on the sample was detected around 70 mW. The spectra were recorded in the range of 600–1800 cm−1. The Raman spectra were collected 15 times per sample, with 3‐s accumulation taken at each sample. Collecting multiple spectra per sample ensures an accurate representation of the heterogeneous composition of a sample.

First, before each measurement, the ethanol spectrum was measured using an exposure time of 3 s for the wavenumber calibration. Second, the cryopreservation tubes with a 5% normal saline spectrum were acquired using an exposure time of 3 s with five successive scans for every beginning and completion of the experiment. The average spectrum of the cryopreservation tubes was used to investigate the influence of tube wall materials. Next, the Raman spectrum of the serum samples sealed within the cryopreservation tube was measured using the same integration parameters as the cryopreservation tube measurements. Three experimenters took the Raman scan for each sample tube and repeated it five times. The cryopreservation tube was placed in the special card slot of the Raman spectrometer, ensuring the laser passes through the tube wall at a certain angle. After cosmic ray removal from the spectral data, we had 15 scans conducted by each experimenter of each serum sample.

2.4. Data processing steps

A total of 2355 spectra from 157 individuals were subjected to preprocessing steps, including smoothing by automatic‐weighted least squares, baseline correction based on polynomial fitting, and normalization by total area.[ 18 ] These spectra data were used for feature selection and to build the classification model. Analysis of variance (ANOVA) statistical analysis was used to select relevant features included in the training of support‐vector machine (SVM) models.

The model predicts unknown samples and classifies them accordingly. The performance of the proposed model has been evaluated using a cross‐validation method, dividing the whole dataset into 70% for training and the remaining 30% for testing. To test the robustness of the model, we conducted a “blind” validation. Here, 70% of the random sampling of the total sample was used to establish the model, and the remaining 30% of the samples as hold‐out set for SVM model test. To ensure the independence of the data, the random sampling process guaranteed that the spectra data were used to establish the model and for model test from completely different samples.[ 19 ] The aforementioned process was repeated 50 times.

Furthermore, 20 independent individuals, including 5 asymptomatic and 5 symptomatic COVID‐19 patients, 5 suspected patients, and 5 healthy controls, were sampled for model test. These 20 serum samples (corresponding to 300 spectra) were set aside to create an independent external hold‐out dataset. After the prediction model was built, the 20 serum samples were preprocessed in the same way and used to validate the SVM model. The detailed procedure of data analysis is provided in Supporting Information S2.

3. RESULTS

3.1. Clinical characteristics

Among all of the COVID‐19 patients, 55.6% are female, and the median age is 47.6 years (range: 20–78). In the suspected group, 38.9% are female, and the median age is 45.8 years (range: 21–74). Moreover, 31 (56.4%) participants in the healthy control group are female, and the median age is 45.5 years (range: 24–65). No statistically significant difference was observed between these three groups in terms of gender and age.

The most common symptom is a cough (62.1%) in symptomatic COVID‐19 patients upon admission, followed by a fever (53.4%). A fever is present in 78.0% of all of the suspected cases upon admission, and the second most prevalent symptom is a cough (40.1%). The computed tomography (CT) scans were performed upon admission for all COVID‐19 patients and the suspected cases. In the COVID‐19 group, 76.5% showed abnormal results. The most common patterns on the chest CT were ground‐glass opacity (50.0%) and bilateral patchy shadowing (48.3%). In the suspected group, 65.4% of the patients showed abnormal results, the most common patterns on chest CT were local patchy shadowing (57.7%) and ground‐glass opacity (32.2%). According to our follow‐up data, these 59 suspected cases include 13 cases of flu, 6 cases of bacterial infections, and 40 cases of unknown causes.

3.2. Raman spectra and statistical analysis

Figure 1a shows the average of each group's preprocessed spectra. The differences among the average spectrum of the COVID‐19 group, the healthy control group, and the suspected group are depicted in Figure 1b. The difference in the mean spectrum is shown within ±2 SDs, suggesting that the mean difference between the groups is statistically insignificant. To build the diagnostic algorithms, multivariate analysis is required to uncover useful variability within the individual spectrum.

FIGURE 1.

FIGURE 1

The total average serum Raman of the three groups and the difference between the groups. (a) The total average of the three types of Raman, the color band represents the standard deviation. (b) The Raman difference signal between the groups (black) and the Raman signal of the groups between ±2 standard deviations (red and blue) [Colour figure can be viewed at wileyonlinelibrary.com]

An ANOVA test was applied for both the intergroup and intragroup after random sampling, and it was repeated 100 times. In these three groups, the distribution of statistically significant points of difference shows a high consistency for intergroups after repeating the sampling processes. Nevertheless, the distribution consistency of statistically significant points is not found for the intragroup.

After the ANOVA test, the difference between the COVID‐19 group, the suspected group, and the healthy control group was in the spectra range of 600–1800 cm−1 and is observed in Figure 2a. The differences between the healthy control group and the suspected group shown in the spectra range are significant less than those between the COVID‐19 group and the suspected or healthy control group. However, for the intergroup, the ANOVA test result had no apparent consistency, similar to random noise (Figure 2b).

FIGURE 2.

FIGURE 2

The result of the ANOVA test. The spectra range without a significant difference in the ANOVA test (p < 0.05) was indicated in blue, while others were indicated in yellow. (a) The Raman shift spectrum of the difference in p value after a 70% random sampling and repeated training 100 times for the intergroups. (b) The Raman shift spectrum of the difference in p value after a 70% random sampling and repeated training 100 times intragroups [Colour figure can be viewed at wileyonlinelibrary.com]

3.3. Training and validation of model

In the training phase, the SVM learns the relationship between independent and dependent variables. Specifically, it learns to make associations between intensity values at specific wavelengths of the input spectrum data with a sample class membership, which are the designated outputs of the SVM. Wave points with significant differences in ANOVA test results are used as the input of the SVM. Here, the performance parameters for each class, determined by the SVM, are shown. The sensitivity (true positive rate), specificity (true negative rate), and accuracy (percentage of correctly predicted spectra among total cases) for each group's prediction are reported together with 95% CIs. All three groups are successfully classified with average sensitivity, specificity, and accuracy (Table 2). Figure 3 shows the performance of the classifiers through a receiver operating characteristic (ROC) curve. All ROC analyses are based on nonparametric techniques and are conducted for the SVM analyses. For each of the three classification tasks, the area under curve (AUC) value is calculated and shown in Figure 3.

TABLE 2.

Performance parameters of the SVM

Class Performance parameter Value ± SD 95% CI
COVID‐19 versus suspected Sensitivity 0.89 ± 0.08 (0.90 ± 0.08) 0.87–0.91 (0.87–0.92)
Specificity 0.86 ± 0.09 (0.88 ± 0.09) 0.83–0.88 (0.85–0.90)
Accuracy 0.87 ± 0.05 (0.89 ± 0.06) 0.86–0.89 (0.88–0.90)
COVID‐19 versus healthy control Sensitivity 0.89 ± 0.07 (0.89 ± 0.079) 0.90–0.92 (0.87–0.91)
Specificity 0.93 ± 0.06 (0.94 ± 0.06) 0.91–0.94 (0.93–0.96)
Accuracy 0.91 ± 0.04 (0.91 ± 0.04) 0.90–0.92 (0.90–0.93)
Suspected versus healthy control Sensitivity 0.70 ± 0.09 (0.72 ± 0.11) 0.68–0.73 (0.69–0.75)
Specificity 0.66 ± 0.09 (0.71 ± 0.11) 0.64–0.69 (0.68–0.74)
Accuracy 0.69 ± 0.05 (0.71 ± 0.07) 0.68–0.70) (0.70–0.73)

Note: Brackets: serum‐level classification results for each serum samples.

Abbreviations: CI, confidence interval; SVM, support‐vector machine.

FIGURE 3.

FIGURE 3

The ROC curve of the SVM diagnostic algorithm for the COVID‐19 group versus the suspected group, the COVID‐19 group versus healthy control group, and the suspected group versus the healthy control group [Colour figure can be viewed at wileyonlinelibrary.com]

Table 2 shows the results of specificity, accuracy, and sensitivity of the SVM classification; the non‐brackets indicate the specificity, sensitivity, and accuracy of individual spectra, and the brackets indicate the specificity, sensitivity, and accuracy of each serum sample.

For independent test datasets, all of the unlabeled spectra are assigned to the class with the highest probability by SVM model. The classification of each individual spectrum from each of the serum samples is depicted in Table 3 (left‐hand side), demonstrating an overall average accuracy of 90% for all classes.

TABLE 3.

Results of 20 samples for the independent verification

Individual spectra predictions
Sample # Predicted class External validation results
COVID‐19 Suspected Healthy control Sample # Predicted class True class
1 2771 / 529 1 COVID‐19 COVID‐19
2 2570 / 730 2 COVID‐19 COVID‐19
3 2642 / 658 3 COVID‐19 COVID‐19
4 2424 / 876 4 COVID‐19 COVID‐19
5 2631 / 669 5 COVID‐19 COVID‐19
6 a 3300 0 / 6 a COVID‐19 COVID‐19
7 a 3271 29 / 7 a COVID‐19 COVID‐19
8 a 3300 0 / 8 a COVID‐19 COVID‐19
9 a 2811 489 / 9 a COVID‐19 COVID‐19
10 a 3300 0 / 10 a COVID‐19 COVID‐19
11 a 76 3224 / 11 a Suspected Suspected
12 a 196 3104 / 12 a Suspected Suspected
13 a 891 2409 / 13 a Suspected Suspected
14 a 93 3207 / 14 a Suspected Suspected
15 a 1229 2071 / 15 a Suspected Suspected
16 / 5 3295 16 Healthy controls Healthy controls
17 / 115 3185 17 Healthy controls Healthy controls
18 / 0 3300 18 Healthy controls Healthy controls
19 / 0 3300 19 Healthy controls Healthy controls
20 / 16 3284 20 Healthy controls Healthy controls
a

Symptomatic.

The overall serum‐level classification of each of the 20 serum samples, based on spectral‐level predictions, is shown on the right‐hand side of Table 3. A serum sample was assigned to the class (confirmed COVID‐19, suspected COVID‐19, and healthy control groups) that received the majority of spectra assigned to it. The lowest maximum percentage was observed for sample 15 who was assigned to the suspected class with 62.7% of spectra correctly assigned. The true classification of the samples was only revealed after the model had made its predictions. For the independent test dataset, our model showed that the serum‐level classification results were all correct.

4. DISCUSSION

Early detection plays a vital role in infectious diseases and controlling an outbreak. COVID‐19 is difficult to diagnose from the flu through clinical symptoms, proven by clinical characteristics analysis.[ 2 ] In this study, we analyze the Raman spectrum data of serum from 177 individuals, which consisted of confirmed COVID‐19 patients, suspected COVID‐19 patients, and healthy controls group.

The current standard method for detecting COVID‐19 is the RT‐PCR.[ 4 , 20 ] However, the false negative rate is high related to the time between detection and disease onset, because of the difficulty of acquiring SARS‐CoV‐2 specimens from the respiratory tract in the early phase.[ 21 ] Currently, antibodies, including IgM and IgG, reverse transcription multiple cross displacement amplification assay, and other techniques are under development for diagnosing COVID‐19.[ 22 , 23 , 24 , 25 , 26 ]

By using a laser to focus directly on the serum sample, previous studies have demonstrated that Raman spectroscopy may differentiate virus infections from healthy individuals by detecting the characteristic peaks of Raman spectrum, which respond to specific carbohydrates of the viral glycoprotein.[ 10 , 12 , 27 , 28 ]

Because SARS‐CoV‐2 is a virus causing severe infectious disease, for protection, our method seals the serum sample in a polypropylene biological tube for preservation. The laser light goes through the transparent tube wall before approaching the serum sample, and it greatly reduces the signal‐to‐noise ratio of the Raman signal for the serum sample. Multivariate analysis is required to uncover useful variability within individual spectrum and to build diagnostic algorithms. Hence, ANOVA and SVM are used to analyze the spectrum data to obtain qualified results. In the meantime, the nonexposed samples are maintained during the entire procedure, which is a crucial step to ensure safety during COVID‐19 detection. The difference in the mean spectrum is shown as within ±2 SDs in Figure 1b, suggesting that the mean difference between the groups is statistically insignificant. High intergroup consistency is shown after the ANOVA analysis, while the differences in the intragroups are random. Additionally, the data of the spectrum include the signal detected from the tubes and are also in accordance with previous studies.[ 29 ]

The feature extraction is one significant procedure of machine learning. State‐of‐art feature extraction methods, such as filer, wrapper, and embedding, train the models and determine the performance of features. In this study, we use ANOVAL analysis to select the features. Furthermore, a low p value does not necessarily mean that a strong feature and conversely a high p value do not mean a weak feature. Furthermore, the feature extraction methods may increase classification accuracy. Also, we can use deep learning to avoid feature extraction problems when we accumulate more data.

To prevent the virus from spreading, it is recommended that suspected cases should be isolated and diagnosed. This group consists of patients with flu symptoms or individuals in tight contact with COVID‐19 patients. Therefore, we recruit three groups, namely, the COVID‐19 group, the suspected cases group, and the healthy control group. In this study, compared to the healthy control group and the suspected group, the COVID‐19 group shows a significant difference in the same spectrum range of 600–1800 cm−1. The suspected cases were experiencing flu‐like symptoms during the blood draw, a similar clinical manifestation to COVID‐19. However, they showed a negative result for the SARS‐CoV‐2 viral nucleic acid through RT‐PCR detection. The spectrum range of 600–1800 cm−1 is previously reported to have a variety of proteins in the serum.[ 30 , 31 ]

Interestingly, the spectrum results are consistent with a recent study conducted to determine potential biomarker panels for diagnosing COVID‐19.[ 32 ] Additionally, the difference between the healthy control individuals and the suspected cases shows a lower significance, other than the difference between the COVID‐19 and the suspected or healthy control groups. The following are possible reasons for the difference detected by Raman spectroscopy in the serum of COVID‐19 patients.

To begin with, SARS CoV‐2 may express a specific protein that differs from the healthy samples or other diseases.[ 33 ] Meanwhile, during the pathological process of COVID‐19, activation of the humoral response leads to the generation of certain antibodies,[ 9 , 32 ] suggesting the biochemical composition changed in the serum of COVID‐19 patients and is specific and detectable by Raman spectroscopy.

Second, most of the patients in the COVID‐19 group are non‐severe (57 in 63) when drawing blood, indicating that Raman spectroscopy has a high specificity for detecting COVID‐19. Moreover, asymptomatic infections currently account for approximately 20% of all COVID‐19 cases, which is important for prevention measures.[ 34 ] In this study, the results of the independent test dataset showed that this model has a high sensitivity for asymptomatic infection detection. Recommending serum Raman spectroscopy may contribute to pandemic prevention because of its high sensitivity for both asymptomatic and symptomatic patients.

Because of the complicated biochemical components within the serum, the single spectrum collected from each sample is not identical, and it contains information for local concentrations of biomarkers and biochemical components. This increases the possibility for individual spectrum to be misclassified. Bo et al. collected the data from 28 healthy subjects, 25 patients with negative COVID‐19 PCR test result but have similar clinic features with COVID‐19, and 25 COVID‐19 patients. They found the severity of the diseases is highly related with 204 serum metabolites from COVID‐19 patients, and there are 105 protein difference expressions in serum of COVID‐19 patients and subjects without COVID‐19 (Shen et al.[ 9 ]). Liu et al. demonstrated that COVID‐19 patients have increasing inflammatory cytokines level in their serum. The cytokine storm is known as one important reason on death of heavy and critical COVID‐19 patients.[ 35 ] Hence, the difference on serum composition of COVID‐19 patients comparison to no‐COVID‐19 is exists and can be detected. Raman spectrum has high accuracy and high sensitivity. Hence, Raman spectrum is one powerful tool for COVID‐19 testing. Additionally, the relatively small sample size only represents a tiny fraction of the Chinese population. The detected data for a larger, independent population of different races would help build a better model and standard, which may profoundly improve the accuracy of Raman spectroscopy for screening.

False negative results may occur due to the influence of COVID‐19 nucleic acid detection extraction method, kit sensitivity, variation in detection rate from different manufacturers, insufficient viral material in the specimen, low patient viral load, and other factors.[ 36 , 37 ] Researches of Lauren et al. show that on the day of symptom onset, the median false negative rate for initial RT‐PCR was 38% (CI, 18% to 65%), And 3 days after symptom onset, the median false negative rate decreased to 20% (CI, 12% to 30%), and then increased again, from 21% (CI, 13% to 31%) on day 4 days after symptom onset to 66% (CI, 54% to 77%) on 16 days after symptom onset.[ 21 ] Another research find that 21.4% patients experienced a “turn positive” of nucleic acid detection by RT‐PCR test for SARS‐CoV‐2 after two consecutive negative result. Although the false negative rate of our model is lower than RT‐PCR, our model is based on the gold standard of RT‐PCR and the detection performance cannot be higher than the gold standard. This is because the confirmed cases and the suspected cases are all determined with many times RT‐PCR tests in our study, while the false negative rate of RT‐PCR test from literature reports is usually based on the first‐time RT‐PCR test.

The large‐scale global epidemic of COVID‐19 has brought tremendous pressure to medical institutions. As the gold standard, RT‐PCR requires a lot of manpower and material resources. The serum detection based on Raman spectroscopy proposed in this paper is low‐cost, fast, and low manpower requirements. Doctors are provided with more testing methods. Serum testing, as a routine testing item in medical institutions, can provide low‐cost and rapid screening of patients in hospitals. In applications, medical institutions perform Raman testing in routine serum testing items. Once high‐risk patients are found, they are immediately quarantined and then further tested with RT‐PCR, thereby reducing the risk of infection in medical institutions.

With the spread of SARS‐CoV‐2, an increasing number of covert patients could seed new outbreaks.[ 8 ] To contain the virus, tests for more individuals without strict limitations may generate a massive demand for screening. Sealing the serum sample in a cryopreservation tube, portable detection devices, and a simple procedure for testing indicates that Raman spectroscopy is a safe and convenient detection method. Moreover, the accuracy will be higher with additional screening data for input into the SVM, making this a promising method for application in the future.

5. CONCLUSION

In conclusion, our results suggest that Raman spectroscopy may be powerful, effective, and convenient for COVID‐19 screening. Additional studies with a larger independent population are needed to verify our findings and to investigate the particular spectrum bands of the serum spectrum that corresponds to the suspected biomarkers of COVID‐19.

CONFLICT OF INTEREST

The authors declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported.

AUTHOR CONTRIBUTIONS

Lintao Li, Hongyan Zhou, and Yilan Zeng carried out the collection of sample. Gang Yin and Yu Yin carried out data analysis. Yuanzhang Su offered the technique support. Bo Sun participated in data analysis. Mei Luo and Maohua Ma participated in the serum samples collection. Gang Yin, Lintao Li, Shun Lu, and Lucia Orlandini had contributions to literatures review and draft writing. Dezhong Yao, Gang Liu, and Jinyi Lang had contributions to the conception and design of the study. All authors have read and approved the final manuscript.

Supporting information

Data S1 Supporting Information

ACKNOWLEDGMENTS

This study was funded by the grant agency of the Chengdu Science and Technology Bureau (No. 2020‐YF05‐0014‐SN) and Department of Science and Technology of Sichuan Province, Science and Technology Support Project (ID: 2019YFG0185).

Yin G, Li L, Lu S, et al. An efficient primary screening of COVID‐19 by serum Raman spectroscopy. J Raman Spectrosc. 2021;52:949–958. 10.1002/jrs.6080

Gang Yin, Lintao Li, and Shun Lu contributed equally to this manuscript.

Funding information Department of Science and Technology of Sichuan Province, Grant/Award Number: 2019YFG0185; Chengdu Science and Technology Bureau, Grant/Award Number: 2020‐YF05‐0014‐SN

Contributor Information

Dezhong Yao, Email: dyao@uestc.edu.cn.

Gang Liu, Email: lee_scch@yeah.net.

Jinyi Lang, Email: langjy610@163.com.

REFERENCES

  • 1. https://www.who.int/emergencies/diseases/novel‐coronavirus‐2019/situation‐reports
  • 2. Guan W. J., Ni Z. Y., Hu Y., Liang W. H., Ou C. Q., He J. X., Liu L., Shan H., Lei C. L., Hui D. S. C., Du B., Li L. J., Zeng G., Yuen K. Y., Chen R. C., Tang C. L., Wang T., Chen P. Y., Xiang J., Li S. Y., Wang J. L., Liang Z. J., Peng Y. X., Wei L., Liu Y., Hu Y. H., Peng P., Wang J. M., Liu J. Y., Chen Z., Li G., Zheng Z. J., Qiu S. Q., Luo J., Ye C. J., Zhu S. Y., Zhong N. S., N. Engl. J. Med. 2020, 382, 1708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. https://www.who.int/csr/sars/postoutbreak/en/;
  • 4. CDC website.
  • 5. Zhou P., Yang X. L., Wang X. G., Hu B., Zhang L., Zhang W., Si H. R., Zhu Y., Li B., Huang C. L., Chen H. D., Chen J., Luo Y., Guo H., Jiang R. D., Liu M. Q., Chen Y., Shen X. R., Wang X., Zheng X. S., Zhao K., Chen Q. J., Deng F., Liu L. L., Yan B., Zhan F. X., Wang Y. Y., Xiao G. F., Shi Z. L., Nature 2020, 579, 270.. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Rothe C., Schunk M., Sothmann P., Bretzel G., Froeschl G., Wallrauch C., Zimmer T., Thiel V., Janke C., Guggemos W., Seilmaier M., Drosten C., Vollmar P., Zwirglmaier K., Zange S., Wolfel R., Hoelscher M., N. Engl. J. Med. 2020, 382, 970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Duguid J. G., Bloomfield V. A., Benevides J. M., Thomas G. J. Jr., Biophys. J. 1995, 69, 2623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Qiu J., Nature 2020. [Google Scholar]
  • 9. Shen B., Yi X., Sun Y., Bi X., Du J., Zhang C., Quan S., Zhang F., Sun R., Qian L., Ge W., Liu W., Liang S., Chen H., Zhang Y., Li J., Xu J., He Z., Chen B., Wang J., Yan H., Zheng Y., Wang D., Zhu J., Kong Z., Kang Z., Liang X., Ding X., Ruan G., Xiang N., Cai X., Gao H., Li L., Li S., Xiao Q., Lu T., Zhu Y., Liu H., Chen H., Guo T., Cell 2020, 182, 59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Naseer K., Amin A., Saleem M., Qazi J., Spectrochim. Acta A Mol. Biomol. Spectrosc. 2019, 206, 197. [DOI] [PubMed] [Google Scholar]
  • 11. Khan S., Ullah R., Khan A., Ashraf R., Ali H., Bilal M., Saleem M., Photodiagnosis Photodyn. Ther. 2018, 23, 89. [DOI] [PubMed] [Google Scholar]
  • 12. Lee J. H., Kim B. C., Oh B. K., Choi J. W., J. Biomed. Nanotechnol. 2015, 11, 2223. [DOI] [PubMed] [Google Scholar]
  • 13. Ralbovsky N. M., Halamkova L., Wall K., Anderson‐Hanley C., Lednev I. K., J. Alzheimers Dis. 2019, 71, 1351. [DOI] [PubMed] [Google Scholar]
  • 14. Parlatan U., Inanc M. T., Ozgor B. Y., Oral E., Bastu E., Unlu M. B., Basar G., Sci. Rep. 2019, 9, 19795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Feng S., Chen R., Lin J., Pan J., Chen G., Li Y., Cheng M., Huang Z., Chen J., Zeng H., Biosens. Bioelectron. 2010, 25, 2414. [DOI] [PubMed] [Google Scholar]
  • 16. Lin D., Feng S., Pan J., Chen Y., Lin J., Chen G., Xie S., Zeng H., Chen R., Opt. Express 2011, 19, 13565. [DOI] [PubMed] [Google Scholar]
  • 17. Desai S., Mishra S. V., Joshi A., Sarkar D., Hole A., Mishra R., Dutt S., Chilakapati M. K., Gupta S., Dutt A., J. Biophotonics 2020, 13, e202000189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Mehta K., Atak A., Sahu A., Srivastava S., Murali K. C., Analyst 2018, 143, 1916. [DOI] [PubMed] [Google Scholar]
  • 19. Guo S. B. T., Neugebauer U., Popp J., Anal. Methods 2017. [Google Scholar]
  • 20. Corman V. M., Landt O., Kaiser M., Molenkamp R., Meijer A., Chu D. K., Bleicker T., Brunink S., Schneider J., Schmidt M. L., Mulders D. G., Haagmans B. L., van der Veer B., van den Brink S., Wijsman L., Goderski G., Romette J. L., Ellis J., Zambon M., Peiris M., Goossens H., Reusken C., Koopmans M. P., Drosten C., Euro. Surveill. 2020, 25. [Google Scholar]
  • 21. Kucirka L. M., Lauer S. A., Laeyendecker O., Boon D., Lessler J., Ann. Intern. Med. 2020, 173, 262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Li Z., Yi Y., Luo X., Xiong N., Liu Y., Li S., Sun R., Wang Y., Hu B., Chen W., Zhang Y., Wang J., Huang B., Lin Y., Yang J., Cai W., Wang X., Cheng J., Chen Z., Sun K., Pan W., Zhan Z., Chen L., Ye F. J., Med. Virol. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Nguyen T., Duong Bang D., Wolff A., Micromachines (Basel) 2020, 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Xie X., Zhong Z., Zhao W., Zheng C., Wang F., Liu J., Radiology 2020, 200343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Li S., Jiang W., Huang J., Liu Y., Ren L., Zhuang L., Zheng Q., Wang M., Yang R., Zeng Y., Wang Y., Eur. Respir. J. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Lim K. L., Johari N. A., Wong S. T., Khaw L. T., Tan B. K., Chan K. K., Wong S. F., Chan W. L. E., Ramzi N. H., Lim P. K. C., Hakim S. L., Voon K., PLoS One 2020, 15, e0238417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Tong D., Chen C., Zhang J., Lv G., Zheng X., Zhang Z., Lv X., Photodiagnosis Photodyn. Ther. 2019, 28, 248. [DOI] [PubMed] [Google Scholar]
  • 28. Shanmukh S., Jones L., Zhao Y. P., Driskell J. D., Tripp R. A., Dluhy R. A., Anal. Bioanal. Chem. 2008, 390, 1551. [DOI] [PubMed] [Google Scholar]
  • 29. Hsu S. L., Hahn T., Suen W., Kang S., Stidham H. D., Siedle A. R., Macromolecules 2001, 34, 3376. [Google Scholar]
  • 30. Furuta M., Fujisawa T., Urago H., Eguchi T., Shingae T., Takahashi S., Blanch E. W., Unno M., Phys. Chem. Chem. Phys. 2017, 19, 2078. [DOI] [PubMed] [Google Scholar]
  • 31. Depciuch J., Sowa‐Kucma M., Nowak G., Dudek D., Siwek M., Styczen K., Parlinska‐Wojtan M., J. Pharm. Biomed. Anal. 2016, 131, 287. [DOI] [PubMed] [Google Scholar]
  • 32. Guo L., Ren L., Yang S., Xiao M., Chang D., Yang F., Dela Cruz C. S., Wang Y., Wu C., Xiao Y., Zhang L., Han L., Dang S., Xu Y., Yang Q. W., Xu S. Y., Zhu H. D., Xu Y. C., Jin Q., Sharma L., Wang L., Wang J., Clin. Infect. Dis. 2020, 71, 778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Walls A. C., Park Y. J., Tortorici M. A., Wall A., McGuire A. T., Veesler D., Cell 2020, 181, 281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Buitrago‐Garcia D., Egli‐Gany D., Counotte M. J., Hossmann S., Imeri H., Ipekci A. M., Salanti G., Low N., PLoS Med. 2020, 17, e1003346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Liu J., Li S., Liu J., Liang B., Wang X., Wang H., Li W., Tong Q., Yi J., Zhao L., Xiong L., Guo C., Tian J., Luo J., Yao J., Pang R., Shen H., Peng C., Liu T., Zhang Q., Wu J., Xu L., Lu S., Wang B., Weng Z., Han C., Zhu H., Zhou R., Zhou H., Chen X., Ye P., Zhu B., Wang L., Zhou W., He S., He Y., Jie S., Wei P., Zhang J., Lu Y., Wang W., Zhang L., Li L., Zhou F., Wang J., Dittmer U., Lu M., Hu Y., Yang D., Zheng X., EBioMedicine 2020, 55, 102763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Xie X., Zhong Z., Zhao W., Zheng C., Wang F., Liu J., Radiology 2020, 296, E41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Fang Y., Zhang H., Xie J., Lin M., Ying L., Pang P., Ji W., Radiology 2020, 296, E115. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data S1 Supporting Information


Articles from Journal of Raman Spectroscopy are provided here courtesy of Wiley

RESOURCES