Skip to main content
Wiley - PMC COVID-19 Collection logoLink to Wiley - PMC COVID-19 Collection
. 2021 Oct 26;43(12):2100194. doi: 10.1002/bies.202100194

There is still no evidence of SARS‐CoV‐2 laboratory origin: Response to Segreto and Deigin (10.1002/bies.202100137)

Alexander Tyshkovskiy 1,2, Alexander Y Panchin 3,
PMCID: PMC8646886  PMID: 34697827

Abstract

The causative agent of COVID‐19 SARS‐CoV‐2 has led to over 4 million deaths worldwide. Understanding the origin of this coronavirus is important for the prevention of future outbreaks. The dominant point of view that the virus transferred to humans either directly from bats or through an intermediate mammalian host has been challenged by Segreto and Deigin, who claim that the genome of SARS‐CoV‐2 has certain features suggestive of its artificial creation. Following their response to our commentary, here we continue the discussion of the proposed arguments for this hypothesis. We show that neither the existence of a furin cleavage site in SARS‐CoV‐2, nor the presence of specific sequences within the nucleotide insertion encoding that site are evidence for intelligent design. We also explain why existing genetic data, viral diversity and past human history suggest that a natural origin of the virus is the most likely scenario. Genetic evidence suggesting otherwise is yet to be presented.

Keywords: comparative genomics, coronavirus, COVID‐19, evolution, furin cleavage site, SARS‐CoV‐2 laboratory origin, SARS‐CoV‐2


Recently discovered bat CoVs share higher genetic similarity with SARS‐CoV‐2, than those previously described. SARS‐CoV‐2 furin site is non‐canonical. The prior probabilities of natural virus origins are higher given past human history and present viral diversity. Other arguments are discussed. The SARS‐CoV‐2 artificial creation hypothesis is not supported by evidence.

graphic file with name BIES-43-2100194-g001.jpg


Abbreviations

CoV

coronavirus

RBD

receptor binding domain

FCS

furin cleavage site

WIV

Wuhan Institute of Virology

nt

nucleotide

INTRODUCTION

More than a year after the start of the COVID‐19 pandemic, the origin of SARS‐CoV‐2 remains a widely discussed topic. Some proponents of the lab origin hypothesis including Segreto and Deigin have claimed that the genome of the coronavirus (CoV) has certain features “which could be consistent with a lab origin”, including (i) the similarity of the SARS‐CoV‐2 backbone and receptor binding domain (RBD) with the backbone of bat CoV RaTG13 and RBD of pangolin CoV MP789 respectively, and (ii) the presence of a 12 nucleotide insertion that resulted in a formation of S1/S2 furin cleavage site (FCS).[ 1 , 2 ] Even though the authors themselves admit that these observations are also consistent with the scenario of natural emergence, that is, “the genetic structure of SARS‐CoV‐2 is consistent with both natural or laboratory origin”, they use it as an argument for the hypothesis of SARS‐CoV‐2 artificial creation.

This line of argumentation appears to be scientifically invalid since any genetic structure of the virus would be consistent with some scenario of laboratory engineering. Even if one finds a natural reservoir with a close relative of the original Wuhan SARS‐CoV‐2 strain demonstrating 99.8% genome similarity, one could still claim that the remaining ∼60 mutations in the genome were introduced in the Wuhan Institute of Virology (WIV). Therefore, given that the discovery of a complete clone of one of the earliest SARS‐CoV‐2 representatives is highly improbable, the general hypothesis that the virus has been modified in the lab doesn't appear to be falsifiable and therefore doesn't meet the necessary criteria for scientific hypotheses sensu Popper, unless a very specific scenario is presented.[ 3 ] In contrast, the natural evolution scenario can be falsified by both genetic sequence analysis and formal investigations.

In addition, the hypothesis of natural origin has a higher prior probability given past human history as well as present viral abundance and diversity. Indeed, among more than 80 viral emerging infectious diseases since 1940 none have been caused by a genetically modified virus,[ 4 ], and only one, the 1977 Russian flu pandemic, is speculated to have originated in the laboratory through a live‐vaccine trial,[ 5 ] although no complete evidence for this scenario has been established.[ 6 ] To date there are over 200 known circulating viruses that infect humans, none of which are a result of genetic engineering and only one—the Marburg virus—has leaked from a lab prior to being described.[ 7 ] Remarkably, even in the latter case the infectious agent has originated in nature without laboratory manipulations. Finally, it is estimated that currently wildlife harbors hundreds of thousands of viruses with zoonotic potential, orders of magnitude more than any laboratory.[ 8 ]

In this regard, to be considered plausible, the hypothesis of lab origin of SARS‐CoV‐2 should be accompanied with a specific scenario and strong arguments that would significantly favor it over the more probable and parsimonious hypothesis of natural origin. As we demonstrate below, no such arguments have been provided by Segreto and Deigin in their initial manuscript and response to our commentary.[ 9 ] Meanwhile, the evidence in favor of SARS‐CoV‐2 natural origin has been growing and is reviewed elsewhere.[ 10 ]

POINTS OF DISAGREEMENT WITH THE SEGRETO / DEIGIN HYPOTHESIS

  1. In our criticism of Segreto's and Deigin's lab leak hypothesis we provided a bioinformatic analysis, showing that SARS‐CoV‐2 could not have been made from coronaviruses RaTG13 and MP789. The authors responded that: “we never claim that RaTG13 itself is a ‘proposed ancestor’ or the backbone used for a possible construction of SARS‐CoV‐2″ and emphasized that their hypothesis was about a “RaTG13‐like backbone and an RBD from a MP789‐like pangolin CoV”.[ 2 ]

Although the authors didn't claim that RaTG13 itself was an ancestor of SARS‐CoV‐2 in their article in Bioessays, they explicitly and repeatedly stated that in other resources, including their online blogs:

“And CoV2 is an obvious chimera (though not nesessarily a lab‐made one), which is based on the ancestral bat strain RaTG13, in which the receptor binding motif (RBM) in its spike protein is replaced by the RBM from a pangolin strain, and in addition, a small but very special stretch of four amino acids is inserted, which creates a furin cleavage site that, as virologists have previously established, significantly expands the “repertoire” of the virus in terms of whose cells it can penetrate”.[ 11 ]

Remarkably, the blog post hasn't been corrected after our commentary as we are writing this response.

In contrast with this detailed hypothesis, Segreto's and Deigin's article in BioEssays contains no definition of RaTG‐13‐like or MP789‐like viruses, allowing for multiple interpretations.[ 1 ] After all, SARS‐CoV‐2 itself has an over 96% nucleotide identity to RaTG13 and therefore could be considered “RaTG13‐like”. In this case the hypothesis becomes trivial: SARS‐CoV‐2 was created from SARS‐CoV‐2.

For this reason, in our response we divided this hypothesis into two more specific scenarios:

  1. SARS‐CoV‐2 was made from viruses that are nearly identical to RaTG13 and MP789.

  2. SARS‐CoV‐2 was made from two other distinct and still unpublished viruses.

We then provided arguments against both scenarios.

The first scenario is not consistent with the bioinformatic analysis provided in our original commentary, which still holds true for all RaTG13‐like and MP789‐like viruses discovered since 2013 with at least ∼98% nucleotide identity. Moreover, since the start of the pandemic, new naturally occurring coronaviruses such as RacCS203,[ 12 ] RpYN06,[ 13 ] PrC31,[ 14 ] and RmYN02[ 15 ] have been discovered that are more similar to SARS‐CoV‐2 in their polyprotein 1ab genomic sequences than RaTG13. Remarkably, several coronaviruses recently found in horseshoe bats in Laos share RBDs, which are more similar to that of SARS‐CoV‐2 based on both nucleotide (93.6% similarity) and amino acid (97.4% similarity) sequences compared to RBDs of RaTG13 (85.5% and 89.2%, respectively) and MP789 (86.6% and 96.9%, respectively).[ 16 ] Finally, one of these bat coronaviruses, BANAL‐52, demonstrates even higher overall genome similarity with SARS‐CoV‐2 (96.8%) than RaTG13 (96.2%). This provides the most conclusive evidence that known viruses from WIV cannot be viewed as SARS‐CoV‐2′s templates. It is understandable why this first sub‐hypothesis is especially appealing to lab origin proponents: the fact that RaTG13 has been identified by researchers from the WIV provides an additional link between the lab and the pandemic. But this scenario appears to be implausible.

The second scenario assumes that scientists at the WIV combined two undescribed coronaviruses, which are relatively distinct to RaTG13 and MP789 (> 2% nucleotide difference) but share higher similarity with the backbone and RBD encoding fragment of SARS‐CoV‐2, respectively. Although this assumption could be true, it is less probable than the hypothesis of natural recombination between the unknown coronaviruses, given the much higher prevalence of coronaviruses and recombination events in bat populations compared to laboratories.[ 8 , 17 ] Moreover, since naturally occurring coronaviruses with higher similarity to SARS‐CoV‐2 within both backbone and RBD at the same time have been recently discovered in Laos, the hypothesis of artificial “recombination” between two unknown viruses becomes even less plausible. In the end, this scenario of SARS‐CoV‐2 creation appears to be less parsimonious than the hypothesis of a laboratory leak of a naturally evolved virus or a natural spillover.

Thus, both scenarios of SARS‐CoV‐2 construction from the RaTG13‐like and MP789‐like CoVs are not supported by available genomic data for relative coronaviruses and are further weakened by new discoveries.[ 16 ] Moreover, Boni et al. have argued that under the most parsimonious scenario of S protein evolution the RBD of SARS‐CoV‐2, it is the RBD of RaTG13 that has emerged as a result of recombination.[ 18 ]

  • 2

    On the other hand, the genetic similarity of SARS‐CoV‐2 with other coronaviruses appears to be completely consistent with the hypothesis of its natural origin. Specifically, we observed similar substitution patterns distinguishing nucleotide sequences of SARS‐CoV‐2 and SARS‐CoV from their closest described bat relatives RaTG13 and Rs4231.[ 19 ] While this analysis does not exclude all of the numerous proposed scenarios of SARS‐CoV‐2 intelligent design, it suggests against the use of artificially accelerated mutagenesis during SARS‐CoV‐2 evolution prior to the pandemic. For example, the guanosine analog ribavirin mutagen specifically increases the rate of C > U and G > A transitions.[ 20 ]

The RBD of SARS‐CoV‐2 was also shown to bind ACE2 receptors of bats, which makes the scenario of its emergence in these animals plausible. Commenting the hypothesis of bat origin of SARS‐CoV‐2, Segreto and Deigin state that the RBD of the coronavirus is “peculiar because it is characterized by a very high binding affinity to the human ACE2 receptor, but it binds poorly to the bat ACE2 receptor”.[ 2 ] However, the reference used by the authors to support this claim is not an experimental study but a computational analysis of SARS‐CoV‐2 binding to various vertebrate ACE2 receptors.[ 21 ] Moreover, several crucial experimental studies conducted afterwards appear to challenge this claim. Yan et al. used an infectious assay to show that the ACE2 of 25 out of tested 46 bat species supports SARS‐CoV‐2 entry and that for several bat species the infection rate was comparable to human ACE2.[ 22 ] Schlottau et al. also showed transient infection and virus transmission after SARS‐CoV‐2 exposure in a species of fruit bats.[ 23 ]

In addition, the adaptation of bat coronavirus RBD to the human ACE2 receptor doesn't seem to be an unlikely event. For example, it was shown that a single T403R mutation in RaTG13 spike protein allows this bat coronavirus to utilize the human ACE2 receptor for infection of human cells and intestinal organoids.[ 24 ] Thus, naturally occurring sarbecoviruses can easily change their preference over ACE2 receptors of different hosts. This is further confirmed by the discovery of naturally occurring bat coronaviruses in Laos, which share RBDs highly similar to that of SARS‐CoV‐2. Remarkably, their spikes have been shown to bind the human ACE2 receptor and infect human cells in a pseudovirus entry assay with the efficiency comparable to SARS‐CoV‐2 Wuhan strain.[ 16 ] Therefore, the higher affinity of SARS‐CoV‐2 to the human cells compared to some bat species appears to be consistent with the hypothesis of bat origin of the coronavirus with the subsequent tuning of its RBD to the new host.

  • 3

    In our initial analysis of Segreto's and Deigin's hypothesis we have pointed out that their finding of a FauI site in the 12‐nt FCS insertion encoding sequence is indistinguishable from a random coincidence because there are many possible commercially available restriction sites and there is a high probability (∼99.5%) of finding at least one position cut by some enzyme in a nucleotide fragment of this length.[ 9 ]

In their response Segreto and Deigin provided several alternative analyses of this probability based on a number of assumptions that are different from ours such as limiting the search to restriction enzymes that recognize 5 bp or larger sequences. In the end, the authors concluded that we have “underestimated the probability of a 12‐nt insertion not containing a 5+ restriction enzyme cut site by 2 orders of magnitude: rather than the 0.5% implied by their [our] calculation, the actual probability is around 50%”.[ 2 ]

We could discuss which statistical models and assumptions are more reasonable in the given circumstances. However, this is not necessary, since the smallest probability achieved by Segreto and Deigin in their own calculation (p = 0.468) is still far higher, by an order of magnitude, than the relaxed threshold (p = 0.05) used to reject the null hypothesis in statistical analysis. In this case, the null hypothesis is that the FauI restriction site has emerged in the 12‐nt insertion by coincidence, and Segreto's and Deigin's statistical analysis provides no significant deviation from this scenario.

Thus, the authors themselves have confirmed our thesis that the finding of some restriction site in a 12‐nt fragment is consistent with random coincidence. Therefore, it shouldn't be used as an argument for the scenario of artificial construction of SARS‐CoV‐2, in the same way as two “heads” or “tails” in a row do not suggest that a coin is biased.

Segreto and Deigin also speculate that “the FauI site is notable for its unique property which enables it to be used to screen precisely whether the two arginines (R) in the newly created RRAR polybasic cleavage site are still present, as FauI’s recognition sequence is created by the CGGCGG codons coding for RR”.

This argument seems to be incorrect, since the FauI recognition sequence in the SARS‐CoV‐2 FCS (CGG‐CGG‐GCA‐CGU) consists of five nucleotides starting from the 3rd letter of the first CGG codon (G‐CGG‐G). As a result, numerous point mutations in the first CGG codon leading to the substitution of arginine to other amino acids would leave FauI recognition site unchanged, making it useless for the screening of the RRAR polybasic FCS (e.g., GGG‐CGG‐GCA‐GGU encoding for GRAG sequence would still be recognized by the FauI restriction enzyme).

  • 4

    Following the discussion of the FauI site in the 12‐nt FCS insertion sequence, Segreto and Deigin note that “CGG codons are the rarest codons to code for arginine in SARS‐CoV‐2 and its related CoVs, unlike in humans where they are the most frequent”. However, this observation doesn't appear to support the hypothesis of the FCS design in the lab either, since the authors have not provided any examples of the use of CGGCGG codons within artificial furin cleavage sites.

Moreover, known examples of FCS designed in the labs were preferentially introduced by substitutions rather than insertions and were based on the canonical R‐X‐[R/K]‐R furin motif considered to be consensus.[ 25 , 26 , 27 ] This does not match the RRAR site found in SARS‐CoV‐2. The SARS‐CoV‐2 FCS is also different from those present in other well studied coronaviruses, such as HKU1[ 28 ] and MERS,[ 29 ] the use of which would be reasonable from an experimental perspective, unlike the use of the previously undescribed RRAR.

At the same time, various FCS have emerged independently multiple times in the natural history of coronaviruses, indicating that such events are not rare. Natural generation of novel multibasic cleavage sites via insertions have also been reported for other viruses, such as H5 and H7 influenza viruses.[ 30 , 31 , 32 ]

Although the CGG codon is rarely used by the sarbecoviruses, the naturally occurring Rabbit coronavirus HKU14 (NCBI Reference Sequence: NC_017083.1) also has a CGGCGG sequence encoding two arginines, demonstrating that such coding sequences can emerge in beta coronaviruses in nature. Moreover, if the 12‐nt sequence was acquired as an insertion via template switch, the source CGGCGG nucleotides were not required to code for arginines, because they were not necessarily in frame. This scenario seems plausible, since it has been shown that insertions are not infrequent in SARS‐CoV‐2 and that “although this insert has a high GC‐content compared to the genomic average of SARS‐CoV‐2, it falls within the GC‐content range of the long inserts'' and is located within 20 nucleotides of a template switch hotspot at position 22 582.[ 33 ]

Furthermore, Holmes et al. reported that “both CGG codons are more than 99.8% conserved among the > 1,800,000 near complete SARS‐CoV‐2 genomes sequenced to date,”[ 10 ] suggesting that these codons are preserved by stabilizing selection and, therefore, may be associated with high fitness. In this regard, the FCS insertion structure appears to be well consistent with the hypothesis of natural origin and subsequent evolution‐driven codon optimization.

In the end of their response to our commentary, Segreto and Deigin state that “the out‐of‐frame insertion of the furin cleavage site is not proven to be natural”.[ 2 ] It is important to reiterate that there is no way to prove something is natural based on genetic sequence alone, since one can reproduce any naturally occurring mutation in a lab setting. As we have pointed out in our initial response, it is not unusual for insertions with the length of a multiple of three nucleotides to emerge in viral protein‐coding sequences even if they occur in the middle of existing codons, because they do not cause a frameshift in the original viral protein coding sequence.[ 9 , 34 ] Such cases have been observed even for some strains of SARS‐CoV‐2. Since the start of the pandemics, more than 20 long insertions in the coronavirus genome have been identified, some of which are even bigger than the 12‐nt FCS insertion and have also resulted in a codon split.[ 33 ]

CONCLUSIONS

The genetic features of SARS‐CoV‐2 described by Segreto and Deigin, including nucleotide sequence similarity to several other coronaviruses and the presence of a 12‐nt FCS insertion, do not favor artificial creation over natural evolution. The majority of arguments presented by the authors appear to be severely flawed. Some observations, such as the use of a non‐canonical furin cleavage site, poorly fit the lab origin scenario, thus providing additional support for SARS‐CoV‐2 natural emergence. Past human history and present viral diversity in wildlife also suggest a higher prior probability of the virus's natural origin compared to intelligent design. Taken together, this allows us to conclude that natural emergence remains the most plausible scenario of SARS‐CoV‐2 origin and evidence to the contrary is yet to be presented.

CONFLICT OF INTEREST

The author declares no conflict of interests.

ACKNOWLEDGMENTS

Alexander Panchin and Alexander Tyshkovskiy do not have any conflicts of interest. A.T. is a member of Interdisciplinary Scientific and Educational School of Moscow University “Molecular Technologies of the Living Systems and Synthetic Biology.”

Tyshkovskiy, A. , & Panchin, A. Y. (2021). There is still no evidence of SARS‐CoV‐2 laboratory origin: Response to Segreto and Deigin (10.1002/bies.202100137). BioEssays, 43, e2100194. 10.1002/bies.202100194

DATA AVAILABILITY STATEMENT

Data are available by request from the corresponding authors.

REFERENCES

  • 1. Segreto, R. , & Deigin, Y. (2020). The genetic structure of SARS‐CoV‐2 does not rule out a laboratory origin: SARS‐COV‐2 chimeric structure and furin cleavage site might be the result of genetic manipulation. BioEssays, 43, e2000240. 10.1002/bies.202000240 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Deigin, Y. , & Segreto, R. (2021). The genetic structure of SARS‐CoV‐2 is consistent with both natural or laboratory origin: Response to Tyshkovskiy and Panchin (10.1002/bies.202000325). BioEssays, 43, e2100137. July, 2100137. 10.1002/bies.202100137 [DOI] [PubMed] [Google Scholar]
  • 3. Dallavilla, T. , Bertelli, M. , Morresi, A. , Bushati, V. , Stuppia, L. , Beccari, T. , Chiurazzi, P. , & Marceddu, G. (2020). Bioinformatic analysis indicates that SARS‐CoV‐2 is unrelated to known artificial coronaviruses. European Review for Medical and Pharmacological Sciences, 24(8), 4558–4564. 10.26355/eurrev_202004_21041 [DOI] [PubMed] [Google Scholar]
  • 4. Jones, K. E. , Patel, N. G. , Levy, M. A. , Storeygard, A. , Balk, D. , Gittleman, J. L. , & Daszak, P. (2008). Global trends in emerging infectious diseases. Nature, 451(7181), 990–993. 10.1038/nature06536 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Rozo, M. , & Gronvall, G. K. (2015). The reemergent 1977 H1N1 strain and the gain‐of‐function debate. MBio, 6(4), 1–6. 10.1128/mBio.01013-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Kung, H. C. , Jen, K. F. , Yuan, W. C. , Tien, S. F. , & Chu, C. M. (1978). Influenza in China in 1977: Recurrence of influenzavirus A subtype H1N1. Bulletin of the World Health Organization, 56(6), 913–918. [PMC free article] [PubMed] [Google Scholar]
  • 7. Brauburger, K. , Hume, A. J. , Mühlberger, E. , & Olejnik, J. (2012). Forty‐five years of marburg virus research. Viruses, 4(10), 1878–1927. 10.3390/v4101878 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Carroll, D. , Daszak, P. , Wolfe, N. D. , Gao, G. F. , Morel, C. M. , Morzaria, S. , Pablos‐Méndez, A. , Tomori, O. , & Mazet, J. A. K. (2018). The global virome project. Science, 359(6378), 872–874. 10.1126/science.aap7463 [DOI] [PubMed] [Google Scholar]
  • 9. Tyshkovskiy, A. , & Panchin, A. Y. (2021). There is no evidence of SARS‐CoV‐2 laboratory origin: Response to Segreto and Deigin ( 10.1002/bies.202000240). BioEssays, 43, e2000325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Holmes, E. C. , Goldstein, S. A. , Rasmussen, A. L. , Robertson, D. L. , Crits‐Christoph, A. , Wertheim, J. O. , Anthony, S. J. , Barclay, W. S. , Boni, M. F. , Doherty, P. C. , Farrar, J. , Geoghegan, J. L. , Jiang, X. , Leibowitz, J. L. , Neil, S. J. D. , Skern, T. , Weiss, S. R. , Worobey, M. , Andersen, K. G. , … Rambaut, A. (2021). The origins of SARS‐CoV‐2: A critical review. Cell, 184, 4848‐4856. 10.5281/zenodo.5112546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Deigin, Y. (2020). Lab‐Made? SARS‐CoV‐2 Genealogy Through the Lens of Gain‐of‐Function Research . https://yurideigin.medium.com/lab‐made‐cov2‐genealogy‐through‐the‐lens‐of‐gain‐of‐function‐research‐f96dd7413748
  • 12. Wacharapluesadee, S. , Tan, C. W. , Maneeorn, P. , Duengkae, P. , Zhu, F. , Joyjinda, Y. , Kaewpom, T. , Chia, W. N. , Ampoot, W. , Lim, B. L. , Worachotsueptrakun, K. , Chen, V. C. W. , Sirichan, N. , Ruchisrisarod, C. , Rodpan, A. , Noradechanon, K. , Phaichana, T. , Jantarat, N. , Thongnumchaima, B. , … Wang, L. F. (2021). Evidence for SARS‐CoV‐2 related coronaviruses circulating in bats and pangolins in Southeast Asia. Nature Communications, 12, 972. 10.1038/s41467-021-21240-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Zhou, H. , Ji, J. , Chen, X. , Bi, Y. , Li, J. , Wang, Q. , Hu, T. , Song, H. , Zhao, R. , Chen, Y. , Cui, M. , Zhang, Y. , Hughes, A. C. , Holmes, E. C. , & Shi, W. (2021). Identification of novel bat coronaviruses sheds light on the evolutionary origins of SARS‐CoV‐2 and related viruses. Cell, 184, 4380‐4391. 10.1016/j.cell.2021.06.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Li, L. , Wang, J. , Ma, X. , Sun, X. , Li, J. , Yang, X. , Shi, W. , & Duan, Z. (2021). A novel SARS‐CoV‐2 related coronavirus with complex recombination isolated from bats in Yunnan province, China. Emerging Microbes & Infections, 10, 1683‐1690. 10.1080/22221751.2021.1964925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Zhou, H. , Chen, X. , Hu, T. , Li, J. , Song, H. , Liu, Y. , Wang, P. , Liu, D. , Yang, J. , Holmes, E. C. , Hughes, A. C. , Bi, Y. , & Shi, W. (2020). A novel bat coronavirus closely related to SARS‐CoV‐2 contains natural insertions at the S1/S2 cleavage site of the spike protein. Current Biology, 30, 2196‐2203.e3. 10.1016/j.cub.2020.05.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Temmam, S. , Salazar, E. B. , Munier, S. , & Bonomi, M. (2021). Coronaviruses with a SARS‐CoV‐2‐like receptor‐binding domain allowing ACE2‐mediated entry into human cells isolated from bats of Indochinese peninsula. Research Square. [Google Scholar]
  • 17. Hu, B. , Zeng, L. P. , Yang, X. Lou , Ge, X. Y. , Zhang, W. , Li, B. , Xie, J. Z. , Shen, X. R. , Zhang, Y. Z. , Wang, N. , Luo, D. S. , Zheng, X. S. , Wang, M. N. , Daszak, P. , Wang, L. F. , Cui, J. , & Shi, Z. L. (2017). Discovery of a rich gene pool of bat SARS‐related coronaviruses provides new insights into the origin of SARS coronavirus. PLoS Pathogens, 13(11), 1–27. 10.1371/journal.ppat.1006698 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Boni, M. F. , Lemey, P. , Jiang, X. , Lam, T. T. , Perry, B. W. , Castoe, T. A. , Rambaut, A. , & Robertson, D. L. (2020). Evolutionary origins of the SARS‐CoV‐2 sarbecovirus lineage responsible for the COVID‐19 pandemic. Nature Microbiology, 5, 1408–1417. 10.1038/s41564-020-0771-4 [DOI] [PubMed] [Google Scholar]
  • 19. Panchin, A. Y. , & Panchin, Y. V. (2020). Excessive G‐U transversions in novel allele variants in SARS‐CoV‐2 genomes. PeerJ, 8, e9648. 10.7717/peerj.9648 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Galli, A. , Mens, H. , Gottwein, J. M. , Gerstoft, J. , & Bukh, J. (2018). Antiviral effect of ribavirin against HCV associated with increased frequency of G‐TO‐A and C‐TO‐U transitions in infectious cell culture model. Scientific Reports, 8(1), 1–13. 10.1038/s41598-018-22620-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Damas, J. , Hughes, G. M. , Keough, K. C. , Painter, C. A. , Persky, N. S. , Corbo, M. , Hiller, M. , Koepfli, K. P. , Pfenning, A. R. , Zhao, H. , Genereux, D. P. , Swofford, R. , Pollard, K. S. , Ryder, O. A. , Nweeia, M. T. , Lindblad‐Toh, K. , Teeling, E. C. , Karlsson, E. K. , & Lewin, H. A. (2020). Broad host range of SARS‐CoV‐2 predicted by comparative and structural analysis of ACE2 in vertebrates. Proceedings of the National Academy of Sciences of the United States of America, 117(36), 22311–22322. 10.1073/pnas.2010146117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Yan, H. , Jiao, H. , Liu, Q. , Zhang, Z. , Xiong, Q. , Wang, B. J. , Wang, X. , Guo, M. , Wang, L. F. , Lan, K. , Chen, Y. , & Zhao, H. (2021). ACE2 receptor usage reveals variation in susceptibility to SARS‐CoV and SARS‐CoV‐2 infection among bat species. Nature Ecology and Evolution, 5(5), 600–608. 10.1038/s41559-021-01407-1 [DOI] [PubMed] [Google Scholar]
  • 23. Schlottau, K. , Rissmann, M. , Graaf, A. , Schön, J. , Sehl, J. , Wylezich, C. , Höper, D. , Mettenleiter, T. C. , Balkema‐Buschmann, A. , Harder, T. , Grund, C. , Hoffmann, D. , Breithaupt, A. , & Beer, M. (2020). SARS‐CoV‐2 in fruit bats, ferrets, pigs, and chickens: an experimental transmission study. The Lancet Microbe, 1(5), e218–e225. 10.1016/S2666-5247(20)30089-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Zech, F. , Schniertshauer, D. , Jung, C. , Herrmann, A. , Nchioua, R. , Prelli Bozzo, C. , Volcic, M. , Koepke, L. , Heller, S. , Kleger, A. , Jacob, T. , Conzelmann, K.‐K. , Ensser, A. , Sparrer, K. M. , & Kirchhoff, F. (2021). Spike mutation T403R allows bat coronavirus RaTG13 to use human ACE2. BioRxiv, 2021.05.31.446386. 10.1101/2021.05.31.446386 [DOI] [Google Scholar]
  • 25. Klenk, H. D. , & Garten, W. (1994). Host cell proteases controlling virus pathogenicity. Trends in Microbiology, 2(2), 39–43. 10.1016/0966-842X(94)90123-6 [DOI] [PubMed] [Google Scholar]
  • 26. Zimmer, G. , Conzelmann, K.‐K. , & Herrler, G. (2002). Cleavage at the furin consensus sequence RAR/KR 109 and presence of the intervening peptide of the respiratory syncytial virus fusion protein are dispensable for virus replication in cell culture. Journal of Virology, 76(18), 9218–9224. 10.1128/jvi.76.18.9218-9224.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Yamada, Y. , & Liu, D. X. (2009). Proteolytic activation of the spike protein at a novel RRRR/S motif is implicated in furin‐dependent entry, syncytium formation, and infectivity of coronavirus infectious bronchitis virus in cultured cells. Journal of Virology, 83(17), 8744–8758. 10.1128/jvi.00613-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Wu, Y. , & Zhao, S. (2021). Furin cleavage sites naturally occur in coronaviruses. Stem Cell Research, 50, 102115. 10.1016/j.scr.2020.102115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Örd, M. , Faustova, I. , & Loog, M. (2020). The sequence at Spike S1/S2 site enables cleavage by furin and phospho‐regulation in SARS‐CoV2 but not in SARS‐CoV1 or MERS‐CoV. Scientific Reports, 10(1), 1–10. 10.1038/s41598-020-74101-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Decha, P. , Rungrotmongkol, T. , Intharathep, P. , Malaisree, M. , Aruksakunwong, O. , Laohpongspaisan, C. , Parasuk, V. , Sompornpisut, P. , Pianwanit, S. , Kokpol, S. , & Hannongbua, S. (2008). Source of high pathogenicity of an avian influenza virus H5N1: Why H5 is better cleaved by furin. Biophysical Journal, 95(1), 128–134. 10.1529/biophysj.107.127456 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Taubenberger, J. K. , Reid, A. H. , Lourens, R. M. , Wang, R. , Jin, G. , & Fanning, T. G. (2005). Characterization of the 1918 influenza virus polymerase genes. Nature, 437(7060), 889–893. 10.1038/nature04230 [DOI] [PubMed] [Google Scholar]
  • 32. Dietze, K. , Graaf, A. , Homeier‐Bachmann, T. , Grund, C. , Forth, L. , Pohlmann, A. , Jeske, C. , Wintermann, M. , Beer, M. , Conraths, F. J. , & Harder, T. (2018). From low to high pathogenicity—Characterization of H7N7 avian influenza viruses in two epidemiologically linked outbreaks. Transboundary and Emerging Diseases, 65(6), 1576–1587. 10.1111/tbed.12906 [DOI] [PubMed] [Google Scholar]
  • 33. Garushyants, S. K. , Rogozin, I. B. , & Koonin, E. V. (2021). Insertions in SARS‐CoV‐2 genome caused by template switch and duplications give rise to new variants of potential concern. BioRxiv. 10.1101/2021.04.23.441209 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Perdue, M. L. , García, M. , Senne, D. , & Fraire, M. (1997). Virulence‐associated sequence duplication at the hemagglutinin cleavage site of avian influenza viruses. Virus Research, 49, 173–186. 10.1016/S0168-1702(97)01468-8 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data are available by request from the corresponding authors.


Articles from Bioessays are provided here courtesy of Wiley

RESOURCES