To the Editor: In December 2019, coronavirus disease 2019 outbreak caused by the 2019 novel coronavirus (2019-nCoV) happened in Wuhan, China. Now, it has posed a worldwide public health threat. Real-time quantitative polymerase chain reaction (RT-qPCR) was recommended as an effective pathogen detection method and has played an important role in prevention and control of the current outbreak. Many research institutions have released their primer sets for RT-qPCR. If the variant sites were located in the primer regions, the efficiency of RT-qPCR would be reduced, thus possibly causing false negative results, and leading to unpredictable impact on the diagnosis of patients and the control of this outbreak. Therefore, a comprehensive investigation on 2019-nCoV genome variation is necessary to evaluate the effectiveness of current released RT-qPCR methods.
Here, we analyzed 77 public full-length genome sequences of 2019-nCoV from the GISAID website [Supplementary Table 1]. All the sequences were aligned by using MAFFT v7.450.[1] A total of 85 variant sites were found, all of which are single nucleotide variants. Among the 85 variant sites, seven were shared by two 2019-nCoV sequences and nine were found in three or more sequences [Supplementary Table 2].
We investigated the published 2019-nCoV RT-qPCR detection assays and found a total number of 13 RT-qPCR primer sets designed by eight institutions [Table 1, Supplementary Table 3]. These primers were designed to amplify genes of ORF1ab, Spike (S), Envelope (E), and Nucleocapsid (N). The reverse primers of primer sets 3 and 6 had one mismatch against all of the released 2019-nCoV sequences. Among all the observed variation sites, three (positions on the reference genome IVDC-HB-01: 28291, 28688, and 29200) are separately located on the region of forward primers of the primer sets 7 and 9, and the probe of the primer set 8. It is noted that the above three variations was found in the sequences BetaCoV/Shenzhen/SZTH-003/2020, BetaCoV/Shandong/IVDC-SD-001/2020|EPI_ISL_408482, and BetaCoV/Chongqing/YC01/2020|EPI_ISL_408478, respectively. These variants may affect the RT-qPCR detection efficiency. In particular, variations on the probe region of the primer set 8 may have largely negative effects on detection efficiency according to the previous research.[2]
Table 1.
In conclusion, using any of the five RT-qPCR primer sets mentioned above to detect 2019-nCoV may potentially cause false-negative results. Among the five, two have mismatches and three contain some 2019-nCoV genome variants which occurred during the outbreak. It is worth noting that the three primer sets containing variants are all located on the N gene. Therefore, it is suggested that conservative regions, such as nsp12 (RdRp) gene, would be preferable primer targets. Although the multiple-targets designation of RT-qPCR protocol would reduce the false-negative results caused by genome variation, more careful performance evaluation of the currently used primers is needed. Moreover, it is necessary to keep continuous surveillance on the genome variants and their effects on the RT-qPCR assays during the whole outbreak.
Acknowledgements
We gratefully acknowledge the Authors, the Originating and Submitting Laboratories for their sequences and metadata shared through GISAID, on which this research is based. All submitters of data may be contacted directly via https://www.gisaid.org.
Funding
This work was supported by the National Key Research & Development Program of China (No. 2020YFC0840900) and the Beijing Municipal Science and Technology Project (No. Z201100001020004).
Conflicts of interest
None.
Supplementary Material
Supplementary Material
Supplementary Material
Footnotes
How to cite this article: Fan H, Zhang XXL, Zhang YW, Huang Y, Teng Y, Guo Y, Mi ZQ, Yang RF, Song YJ, Cui YJ. In silico assessment of the impact of 2019 novel coronavirus genomic variation on the efficiency of published real-time quantitative polymerase chain reaction detection assays. Chin Med J 2020;133:1612–1613. doi: 10.1097/CM9.0000000000000817
References
- 1.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 2013; 30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Süss B, Flekna G, Wagner M, Hein I. Studying the effect of single mismatches in primer and probe binding regions on amplification curves and quantification in real-time PCR. J Microbiol Methods 2009; 76:316–319. doi: 10.1016/j.mimet.2008.12.003. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.