Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2022 Feb 17;17(2):e0263563. doi: 10.1371/journal.pone.0263563

Consecutive deletions in a unique Uruguayan SARS-CoV-2 lineage evidence the genetic variability potential of accessory genes

Yanina Panzera 1,#, Lucía Calleros 1,#, Natalia Goñi 2, Ana Marandino 1, Claudia Techera 1, Sofía Grecco 1, Natalia Ramos 3, Sandra Frabasile 3, Gonzalo Tomás 1, Emma Condon 1, María Noel Cortinas 2, Viviana Ramas 2, Leticia Coppola 2, Cecilia Sorhouet 4, Cristina Mogdasy 2, Héctor Chiparelli 2, Juan Arbiza 3, Adriana Delfraro 3,*, Ruben Pérez 1,*
Editor: Herman Tse5
PMCID: PMC8853529  PMID: 35176063

Abstract

Deletions frequently occur in the six accessory genes of SARS-CoV-2, but most genomes with deletions are sporadic and have limited spreading capability. Here, we analyze deletions in the ORF7a of the N.7 lineage, a unique Uruguayan clade from the Brazilian B.1.1.33 lineage. Thirteen samples collected during the early SARS-CoV-2 wave in Uruguay had deletions in the ORF7a. Complete genomes were obtained by Illumina next-generation sequencing, and deletions were confirmed by Sanger sequencing and capillary electrophoresis. The N.7 lineage includes several individuals with a 12-nucleotide deletion that removes four amino acids of the ORF7a. Notably, four individuals underwent an additional 68-nucleotide novel deletion that locates 44 nucleotides downstream in the terminal region of the same ORF7a. The simultaneous occurrence of the 12 and 68-nucleotide deletions fuses the ORF7a and ORF7b, two contiguous accessory genes that encode transmembrane proteins with immune-modulation activity. The fused ORF retains the signal peptide and the complete Ig-like fold of the 7a protein and the transmembrane domain of the 7b protein, suggesting that the fused protein plays similar functions to original proteins in a single format. Our findings evidence the remarkable dynamics of SARS-CoV-2 and the possibility that single and consecutive deletions occur in accessory genes and promote changes in the genomic organization that help the virus explore genetic variations and select for new, higher fit changes.

Introduction

Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) is a novel Betacoronavirus of the subgenus Sarbecovirus and the causative agent of the pandemic coronavirus disease 2019 (COVID-19) [1].

The SARS-CoV-2 genome is a ~ 30 kb long, single-stranded, positive RNA molecule, with the typical gene organization of coronaviruses. There are 12 open reading frames (ORFs) that encode 26 proteins, including 16 non-structural proteins (nsp1 to nsp16), four structural proteins (M, N, S, and E), and six accessory proteins (3a, 6, 7a, 7b, 8, 10). Accessory proteins are dispensable for replication in cell culture but may have regulatory roles during the viral cycle and thus contribute to the virus fitness by increasing the ability to evade the host’s innate immune response [2,3]. Coronavirus groups usually differ in those accessory proteins, and more infective species have specific pathogenic sets [4].

SARS-CoV-2 has accumulated many variations since its emergence in late 2019, including some mutations in the viral spike (S) protein that underwent rapid growth and spread. These S changes have generated concern and interest, particularly the D614G, N501Y, E484K, and K417N amino acid replacements that have occurred in the receptor-binding domain [5].

Nucleotide substitutions that produce amino acid replacements constitute the primary raw material for genetic variation; however, deletions and insertions (indels) are critical elements in the coronavirus macro and microevolution [6,7]. There is also an apparent link between both processes, as some punctual mutations occur near indels [8].

Indels in coronaviruses remain uncorrected by the proofreading activity of nsp14-exoribonuclease and enhanced by the discontinuous RNA synthesis of the polymerase machinery [5]. Although most indels likely negatively affect viral fitness [9], a small number emerge and spread in viral populations, suggesting a positive effect on viral fitness and adaptive evolution. Thus, the analysis of these indels may reveal evolutionary trends and provide new insight into the surprising variability and rapidly spreading capability that SARS-CoV-2 has shown since its emergence.

Recent evidence establishes the existence of recurrent deletion regions that map to defined antibody epitopes [10]. Furthermore, these deletions appear to emerge independently in a convergent pattern of viral antigenic evolution that may confer resistance to neutralizing antibodies. An excellent example of these recurrent deletions is those acquired in the N-terminal domain of the S glycoprotein during long-term infections of immunocompromised patients, supporting a role in the evolution by altering defined antibody epitopes [10,11].

Deletions also occur frequently in accessory ORFs with interesting outcomes and potential effects on virus evolution [2,12]. These ORFs modulate the host’s immune response and are becoming the focus of active research to decipher their role in viral pathogenicity. The ORF7a and ORF7b are two contiguous genes, with a short overlap of four nucleotides, which undergo deletions that could strongly affect protein structure [13]. Studies performed in the homologous ORF7a and 7b of SARS-CoV-1 have provided insights into their structure and functions. Both ORFs encode transmembrane proteins localized in the endoplasmic reticulum and Golgi network [14].

The 121-residue protein 7a (p7a) of SARS-CoV-2 consists of a 15-residue N-terminal signal peptide, an 81-residue luminal domain, a 20-residue transmembrane domain (TMD), and a 5-residue cytosolic tail [15]. The luminal domain has a 7-stranded β-sandwich fold typical of the immunoglobulin (Ig) superfamily [16].

The 43-residue protein 7b (p7b) of SARS-CoV-2 has 85.4% identity with its SARS-CoV-1 homologous [15] and consists of an 8-residue N-terminus (external), a 22-residue TMD, and a 13-residue C-terminal domain [14].

Previously, we detected a 12-nucleotide deletion (Δ12) in ORF7a that emerged in a single outbreak of a particular B.1.1.33 clade that was later renamed N.7 lineage [17]. Here, we show that the Δ12 variant is widely extended in the Uruguayan unique N.7 lineage and identified a new variant with two deletions, the already reported Δ12 and a subsequent downstream novel 68-nucleotide deletion in the same ORF7a that produce a fused ORF7ab. Our findings evidence the remarkable variation dynamics of SARS-CoV-2 and the possibility that consecutive deletions occur in accessory genes to promote changes in genomic organization and structure.

Materials and methods

Samples and SARS-CoV-2 diagnosis

Combined nasopharyngeal and oropharyngeal swab samples from Uruguayan patients were collected from July to October 2020 by the Reference Center for Influenza and other Respiratory Viruses, National Institute of Health Laboratories (DLSP-MSP). The collection and analysis of samples were performed according to the Declaration of Helsinki; no specific authorization was required because the activities were conducted as part of routine virological surveillance (anonymously, without identification of patients) by the Uruguayan official Institution (DLSP-MSP). The SARS-CoV-2 diagnosis was performed by RNA extraction with the Qiamp Viral RNA Minikit (Qiagen USA) followed by real-time reverse transcription-polymerase chain reaction (RT-qPCR) using the protocol recommended by the Panamerican Health Organization (PAHO-WHO) [18].

PCR, Sanger sequencing, and capillary electrophoresis

An aliquot of RNA (12μL) was used for cDNA synthesis with Superscript II ® reverse transcriptase (Thermo Fisher, USA) and random primers. cDNA was analyzed by PCR with a single pair of primers to screen for the Δ12 variant by amplifying an ORF7a fragment of approximately 406 bp [17]. Amplicons were analyzed by capillary electrophoresis on a Fragment Analyzer 5200 system using the High Sensitivity NGS Analysis Kit (Agilent Technologies, USA). Amplicons were also subjected to Sanger sequencing in Macrogen (Korea).

Full-length genome amplification using ARTIC protocol

The SARS-CoV-2 genome of thirteen samples was enriched using ARTIC primer scheme version 3; information on the primer sequences and protocols are available at the ARTIC network repository (https://artic.network/ncov-2019).

The Nextera DNA Flex Library Preparation kit (Illumina, USA) with dual indexing was used for library preparation. The libraries were purified with AMPure XP beads (Beckman Coulter, USA). All quantifications were carried out using a Qubit DNA High-Sensitivity kit (Invitrogen, USA). The library’s control quality and length were assessed on a Fragment Analyzer 5200 system (Agilent Technologies, USA) using the Standard Sensitivity NGS Analysis Kit (Agilent Technologies, USA).

Whole-genome sequencing was performed on an Illumina MiniSeq (Illumina, USA) platform using MiniSeqTM Mid Output Reagent Cartridge (300-cycles, paired-end reads). Adapter/quality trimming and filtering raw data were performed with BBDuk, and clean reads were mapped to the consensus genome using Geneious Prime 2020.1.2 (https://www.geneious.com).

Deletion characterization and comparison

The online web application CoV-GLUE (http://cov-glue.cvr.gla.ac.uk/#/deletion) was used to assess genetic changes (single nucleotide polymorphisms and indels) in SARS-CoV-2 genomes [19].

SARS-CoV-2 lineage assignment and phylogenetic analysis

The lineage of the strains was assigned according to the nomenclature system proposed by Rambaut et al. [20] using the Pangolin tool [21].

For phylogenetic analysis, all Uruguayan full-length sequences in the GISAD EpiCoV database [22] were retrieved with associated metadata to generate a comprehensive genome dataset of around eight hundred sequences. The dataset was filtered and reduced to over five hundred for complete sequences with good coverage and quality for further comparative analysis.

DNA alignments were performed with MAFFT [23]. With 1000-replicates bootstrap to support internal nodes, maximum-likelihood trees were inferred in Geneious using FastTree [24] and visualized with the ggTree package in R.

Protein structure

The structure of the p7a was obtained from the protein database (PDB ID: 6W37) and I-TASSER server (https://zhanglab.ccmb.med.umich.edu/COVID-19/).

Transmembrane helices were predicted using a Transmembrane Hidden Markov model (TMHMM) available as a Geneious plugin.

Results

Detection of deletions and genome sequencing

We screened 51 SARS-CoV-2 samples for the presence of the ORF7a deletion (Δ12) in Uruguay. The screening was expected to produce two bioanalyzer electropherogram patterns corresponding to a 406-nt amplicon (wild type variant) and a 394-nt amplicon (Δ12 variant). Nine samples had the Δ12 deletion patterns (394-nt amplicon), but four samples showed a pattern corresponding to an amplicon of approximately 326 nucleotides, indicative of putative larger deletion/s (Fig 1A and 1B). The results of Sanger sequencing evidenced that these smaller amplicons had two deletions, the previous Δ12 and a novel 68-nucleotide deletion (Δ68), located 44 nucleotides downstream in the ORF7a. All four individuals had the same two deletions (12 and 68 nucleotides) confirmed by Sanger sequencing (Fig 1C).

Fig 1.

Fig 1

A: ORF7a amplicon for the wildtype and Δ12+68 variants. Artic primers annotations are visualized by blue arrows. B: Chromatogram peaks after capillary electrophoresis, wildtype, and Δ12+68 variants. C: Zoomed diagram details nucleotide sequence of the double deletion obtained by Sanger sequencing.

We obtained and analyzed the thirteen genomes with single and double deletions. The complete genomes had good average coverage (>1000×) and quality (none or very few unassigned bases). Comparison with the Wuhan-Hu-1 reference showed conserved amino acid substitutions in the following positions: L37F (ORF1ab, nsp6), P323L (ORF1ab, nsp12), D614G (S), I33T (ORF6), and four changes in N (P168S, R203K, G204R, I292T). Previous sequences with the 12-nt deletion obtained from Uruguay (Genbank Accession numbers: MW298637-MW298643) have the same residue differences and almost identical sequences.

All genomes belonged to the N.7 lineage of SARS-CoV-2 (also called B.1.1.33.7). Genomes came from patients that tested positive for SARS- CoV-2 and had different clinical manifestations, including asymptomatic or mild infections with fever, headache, asthenia, cough, ageusia, and odynophagia (Table 1). Most patients recovered without treatment, but two with preexisting conditions (cerebrovascular accident or cancer) died after intensive care treatment (Table 1).

Table 1. Epidemiological data of the samples analyzed in the study.

Sample Accession Date Origin Sex Age Clinical data Epidemiological data
299 OK416091 7/16/2020 Canelones M 67 preexisting condition (cerebrovascular accident), died Shared ICU with 300
300 OK416092 7/16/2020 Canelones F 74 preexisting condition (cancer), died Shared ICU with 299
649 OK416093 8/19/2020 Montevideo M 69 symptomatic mild infection/recovered NA
731 OK416094 9/1/2020 Florida M 73 asymptomatic NA
799 OK416095 9/9/2020 Canelones F 63 symptomatic mild infection/recovered NA
822 OK416096 9/14/2020 Montevideo F 7 asymptomatic NA
839* MZ555813.1 9/16/2020 Montevideo M 64 symptomatic mild infection/recovered NA
879* MZ555814.1 9/17/2020 Montevideo M 46 asymptomatic/recovered NA
840 OK416097 9/16/2020 Maldonado M 65 asymptomatic/recovered NA
899* MZ555811.1 9/22/2020 Canelones F 49 asymptomatic/recovered household contact with 917
917* MZ555812.1 9/22/2020 Canelones M 52 asymptomatic/recovered/reinfected household contact with 899
1021 OK416098 10/5/2020 Soriano F 34 symptomatic/recovered NA
1102 OK416099 10/16/2020 Montevideo M NA symptomatic/recovered NA

All samples have the 12-nt deletion; samples with asterisks also have the 68-nt deletion. NA: Not available data.

Samples came from different Uruguayan departments (Montevideo, Soriano, Canelones, Florida, and Maldonado) and were collected from July to October 2020. Some patients were epidemiological related, particularly two with the double deletion variant that livein the same house (Table 1).

Phylogenetic analysis

The newly sequenced genomes from the N.7 lineage were added to the Uruguayan genomes retrieved from the GISAID (n = 497). This dataset included all the available good-quality N.7 sequences in the databases (n = 40). No additional N.7 sequences were available from other countries, including bordering Argentine and Brazil.

The N.7 lineage samples (n = 40) appeared in a single branch in the phylogenetic tree, agreeing with its lineage classification. Sequences of the N.7 lineage with deletions (Δ12 and Δ12+Δ68) fell in a separated branch from samples without deletions collected in the Uruguayan department of Treinta y Tres during June 2020 (Fig 2). This branch of N.7 genomes carrying deletions (n = 20) showed amino acid replacements in nsp6 (L37F) and N (R36Q).

Fig 2. Maximum-likelihood tree based on 497 SARS-CoV-2 sequences.

Fig 2

Details of the N.7 lineage composition are zoomed on the right.

The four genomes with the double deletion (Δ12+Δ68) were identical in the coding and non-coding sequences and had a characteristic amino acid replacement in nsp3 (T976I) lacking in other N.7 sequences.

Deletion effects: The emergence of a fused ORF

The known Δ12 (position: 27629‒27640) and the novel frameshifting Δ68 (position: 27683‒27750) occur at the end of the ORF7a (27394‒27759) (Fig 3, top). The Δ12 results in a 4-aa deletion (RSVS) in position 80‒83 of the 121-aa protein 7a. The Δ68 produces a 23-aa deletion after position 96 and a frameshifting that alters the stop codon and creates an in-frame fusion with the complete ORF7b (44 codons). The simultaneous occurrence of both deletions (Δ12+Δ68) joins the deleted ORF7a and ORF7b, leading to a new ORF7ab with 138 codons, including the ORF7b’s TAA stop codon (Fig 3, top). The first 94 residues of the putative protein ORF7ab (p7ab) belong to the p7a amino end and comprise the signal peptide and all the seven β-strands of the ectodomain Ig-fold; deletions remove most of the 20-residue TMD and the entire 5-residue cytoplasmic tail.

Fig 3.

Fig 3

Top: Nucleotide and amino acid sequences of the ORF 7a/7b in wildtype and Δ12+68 variants. Green bars show the deletion regions. Bottom: Primary structure of the 137-aa 7ab putative protein is compared with the 121-aa 7a protein of the wildtype variant. The different domains are annotated. Yellow arrows denote ß-strands.

The primary sequence of the p7b, consisting of both terminal regions and the central TMD is wholly embedded in the fused p7ab. Therefore, the fused putative protein has only one TMD from the p7b. Transmembrane domains were annotated based on reported data and predicted de novo for the ORF7a, ORF7b, and fused ORF7ab (Fig 3, bottom). The predicted TMD for both proteins has the same size (21 residues) and co-localizes with the reported TMD of 20 (p7a) and 22 residues (p7b).

Discussion

Deletions have played a significant role in the evolution of the Coronaviridae family. The large coronavirus RNA genome is prone to deletions and less affected by the loss of genetic material than other viruses. Among the possible reasons for this feature is the existence of accessory genes that exhibit high plasticity and resilience to undergo indel changes [6]. Remarkably, certain deletions could be so extensive that they eliminate entire accessory ORFs. For example, in Europe and Asia, two phylogenetically unrelated deletions completely removed the 7a, 7b, and 8 consecutive accessory ORFs [25,26]. These extreme deletions pose an interesting question about the role of accessory ORFs in virus viability and reinforce the relevance of deletions in increasing SARS-CoV-2 variability during expansion in the human population.

Some deletions in accessory genes may be related to coronavirus adaptation to the host cell environment. In SARS-CoV-1, a 45-nucleotide deletion in the ORF7b emerged following passage in Vero E6 cells, suggesting attenuation [27]. The same deletion occurs in other cultured strains [28,29]. A 382-nt deletion in ORF8 is associated with milder SARS-CoV-2 infection [30]. Three different deletions in ORF7a truncate the C-terminal half of the protein; two of them result in growth defects and failures to suppress the immune response, providing potential advantages toward the SARS-CoV-2 adaptation to humans [6,3133].

Here we detected single (Δ12) and double deletion (Δ12+68) variants of SARS-CoV-2 within the N.7 lineage (Fig 2). This lineage derived from the Brazilian B.1.1.33 lineage predominant in Brazil during the initial part of the pandemic [31]. In Uruguay, the N. 7 lineage emerged in June 2020 and was replaced during November and December 2020 by Gamma (P.1) and Zeta (P.2) strains derived from the B.1.1.28 lineage [32].

The phylogenetic relatedness (Fig 2), the timing of collection, and the local geographic distribution support that the N.7 lineage emerged in Uruguay during the early pandemic and likely emerged from the B.1.1.33 lineage that lacks the Δ12 variant. Furthermore, the ancestral N.7 lineage also lacked the Δ12 change, as supported by undeleted strains collected during June 2020 in the Treinta y Tres department bordering Brazil.

The topology of the phylogenetic tree implies that the Δ12 variant emerged early in the N.7 lineage, during its initial differentiation in Uruguay (the first case was reported in July 2020), and that the Δ12+68 variant emerged subsequently by the acquisition of a second deletion (Δ68) from a Δ12 individual. To our knowledge, this is the first case where the timing of occurrence of two consecutive deletions in an accessory gene is identified in SARS-CoV-2. It is relatively uncommon to reconstruct the evolutionary pathway of deletions. Indels in other regions, for instance, near the furin cleavage site in the S gene, frequently occur during coronavirus evolution [34]. However, the evolutionary path of these deletions is complex and difficult to determine by analyses of the underlying nucleotide sequence [8].

The Δ12+68 variant has the following characteristics making it unique for SARS-CoV-2: i) both deletions are restricted to a small region of the accessory ORF7a, ii) the successive occurrence of deletions creates a new ORF that retains key functional motifs of 7a and 7b proteins, iii) strains with both deletions are transmitted between individuals, and iv) individual deletions (Δ12 and Δ68) occur consecutively with a known order (the Δ12 occurred first while the Δ68 has a more recent origin).

Addetia et al. described two large and phylogenetically unrelated deletions (392 and 227 nucleotides long) that fuse accessory ORF7a with downstream ORFs [13]. The most extensive 392-nucleotide deletion creates a new ORF by connecting the N-terminus of ORF7a with ORF8. The 227-nucleotide deletion results in a new ORF by fusing the N-terminus of ORF7a with the entire ORF7b. The retained N-terminus of ORF7a in both deletions comprises only the signal peptide and two or three β-strands of the seven that form the β-sheets of the Ig-fold of ORF7a and lost the entire protein’s transmembrane and cytoplasmic domains. The Ig-fold in the luminal domain of the p7a is the putative functional interface, suggesting that variability in this region affects binding capability and is largely deleterious for the virus.

The Δ12+68 variant here described is the first having a fused ORF that keeps the N-terminus signal and Ig-like domain of p7a and the TMD of p7b (Fig 3, bottom). In vitro studies showed that the p7a ectodomain attaches to human monocytes, decreasing antigen-presenting ability and inducing overexpression of pro-inflammatory cytokines [35]. The TMD and cytoplasmic tail mediate the intracellular retention within the Golgi network, suggesting a potential role in viral assembly or budding events in the endoplasmic reticulum to Golgi intermediate compartment (ERGIC) [16,31]. The entire p7b, included in the fused p7ab, provides a TMD that may localize the protein in the Golgi compartment [14,36]. This TMD is supposed to assemble into stable multimers through a leucine zipper and interfere with cellular processes that involve the leucine-zipper association [37]. Functional studies would be needed to establish if the fused protein functions similarly to p7a and p7b in a single format.

A limitation of the study is the low number of samples due to Uruguay’s low prevalence that underpowers statistical analysis (i.e., frequency of cases and clinical outcome). Our analysis, therefore, represents a snapshot in time before the emergence of variants of concern that replaced the N.7 lineage. Sampling was also affected by the relatively reduced number of sampling sites (Table 1); individuals submitted for the report may not fully reflect the prevalence of infections in the overall population. Despite the above limitations, the deletion frequency in the sample (~25%) is significant in the context of the scenario in Uruguay during 2020, characterized by very few outbreaks and cases with epidemiological connections [38].

Although deletions frequently occur in accessory ORFs, the occurrence of two consecutive deletions suggests a functional synergy in viral infections and constitutes evidence that consecutive deletions also act in SARS-CoV-2 accessory genes. Fitness analysis of both deleted variants would be needed to establish whether the second deletion was neutral or had a selective advantage (i.e., increased transmission or milder symptoms) during certain stages of the SARS-CoV-2 infection. We examined the relationship between deletions and the patient’s symptomatology. Patients with deletions exhibit many clinical signs, and two patients with the single deletion (Δ12) died after treatment. All patients with double deletions recovered from the infection, three were asymptomatic, and one had only mild symptoms (Table 1). Although this association suggests a lesser clinical effect of deleted viruses, the results are inconclusive and should be corroborated in global studies with deletions occurring in different SARS-CoV-2 lineages.

Based on previous studies, some degree of indels variability was expected in coronavirus infecting human populations. Still, the current global scenario shows that the deletion potential in SARS-CoV-2 was initially sub-estimated and underdiagnosed before it gained full attention by the scientific community [7,13,25,26,39,40]. In this sense, our study constitutes an example of the impact of deletion in SARS-CoV-2 evolution and underscores the importance of indel variation in providing new insights into the genome variability and dynamics of accessory ORFs within SARS-CoV-2 lineages.

Supporting information

S1 Table. Acknowledgments table for sequences obtained from the GISAID’s EpiCoV™ database.

(PDF)

Acknowledgments

We want to thank all the authors who have kindly deposited and shared genome data on GISAID (S1 Table).

Data Availability

Yes - all data are fully available without restriction Viral sequences were deposited in the GenBank with the accession numbers: MZ555811 to MZ555814 and OK416091 to OK416099.

Funding Statement

This work was supported by the Facultad de Ciencias and Comisión Sectorial de Investigación Científica (CSIC) (Grant CSIC Equipamiento) Plataforma Genómica Facultad de Ciencias. Responsable YP. Fundación Manuel Pérez, UdelaR (Grant Fondo Manuel Pérez). Responsable RP, JA and CM. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Zhou P, Yang X-L, Wang X-G, Hu B, Zhang L, Zhang W, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020; 579: 270–3. doi: 10.1038/s41586-020-2012-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Narayanan K, Huang C, Makino S. SARS coronavirus accessory proteins. Virus Res. 2008; 133: 113–21. doi: 10.1016/j.virusres.2007.10.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li JY, Liao CH, Wang Q, Tan YJ, Luo R, Qiu Y, et al. The ORF6, ORF8 and nucleocapsid proteins of SARS-CoV-2 inhibit type I interferon signaling pathway. Virus Res. 2020; 286: 198074. doi: 10.1016/j.virusres.2020.198074 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stadler K, Masignani V, Eickmann M, Becker S, Abrignani S, Klenk H-D, et al. SARS-beginning to understand a new virus. Nat Rev Microbiol. 2003; 1: 209–18. doi: 10.1038/nrmicro775 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wu A, Wang L, Zhou H-Y, Ji C-Y, Xia SZ, Cao Y, et al. One year of SARS-CoV-2 evolution. Cell Host Microbe. 2021; 145: 1–5. doi: 10.1016/j.chom.2021.02.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zinzula L. Lost in deletion: The enigmatic ORF8 protein of SARS-CoV-2. Biochem Biophys Res Commun. 2020; 54: 251–5. doi: 10.1016/j.bbrc.2020.10.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Peacock TP, Penrice-Randal R, Hiscox JA, Barclay WS. SARS-CoV-2 one year on: evidence for ongoing viral adaptation. J Gen Virol. 2021; 102. doi: 10.1099/jgv.0.001584 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Garry RF, Andersen KG, Gallaher WR, Lam TT, Gangaparapu K, Latif AA, et al. Spike protein mutations in novel SARS-CoV-2 ‘ variants of concern ‘ commonly occur in or near indels. 2021; 789: 1–9. Available from: https://virological.org/t/spike-protein-mutations-in-novel-sars-cov-2-variants-of-concern-commonly-occur-in-or-near-indels/605. [Google Scholar]
  • 9.Grubaugh ND, Petrone ME, Holmes EC. We shouldn’t worry when a virus mutates during disease outbreaks. Nat Microbiol. 2020; 7–8. doi: 10.1038/s41564-020-0690-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.McCarthy KR, Rennick LJ, Nambulli S, Robinson-McCarthy LR, Bain WG, Haidar G, et al. Recurrent deletions in the SARS-CoV-2 spike glycoprotein drive antibody escape. Science. 2021; 371: 1139–42. doi: 10.1126/science.abf6950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Rennick LJ, Robinson-McCarthy LR, Nambulli S, Duprex WP, McCarthy KR. Deletion disrupts a conserved antibody epitope in a SARS-CoV-2 variant of concern. BioRxiv. 2021; 03.05.4341682021. 10.1101/2021.03.05.434168. [DOI] [Google Scholar]
  • 12.Michel CJ, Mayer C, Poch O, Thompson JD. Characterization of accessory genes in coronavirus genomes. bioRxiv. 2020; 1–13. doi: 10.1186/s12985-020-01402-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Addetia A, Xie H, Roychoudhury P, Shrestha L, Loprieno M, Huang M-L, et al. Identification of multiple large deletions in ORF7a resulting in in-frame gene fusions in clinical SARS-CoV-2 isolates. J Clin Virol. 2020; 129: 104523. doi: 10.1016/j.jcv.2020.104523 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Liu DX, Fung TS, Chong KK, Shukla A, Hilgenfeld R. Accessory proteins of SARS-CoV and other coronaviruses. Antiviral Res. 2014; 109: 97–109. doi: 10.1016/j.antiviral.2014.06.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mohammed MEA. The percentages of SARS-CoV-2 protein similarity and identity with SARS-CoV and BatCoV RaTG13 proteins can be used as indicators of virus origin. J Proteins Proteom. 2021; 1–11. doi: 10.1007/s42485-021-00060-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Nelson CA, Pekosz A, Lee CA, Diamond MS, Fremont DH. Structure and Intracellular Targeting of the SARS-Coronavirus Orf7a Accessory Protein. Structure. 2005; 13: 75–85. doi: 10.1016/j.str.2004.10.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Panzera Y, Ramos N, Frabasile S, Calleros L, Marandino A, Tomás G, et al. A deletion in SARS-CoV-2 ORF7 identified in COVID-19 outbreak in Uruguay. Transbound Emerg Dis. 2021; 68: 3075–82. doi: 10.1111/tbed.14002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Corman VM, Landt O, Kaiser M, Molenkamp R, Meijer A, Chu DKW, et al. Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR. Eurosurveillance. 2020; 25(3): 2000045. doi: 10.2807/1560-7917.ES.2020.25.3.2000045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Singer J, Gifford R, Cotten M, Robertson D. CoV-GLUE: A Web Application for Tracking SARS-CoV-2 Genomic Variation. Preprints. 2020; 2020060225. doi: 10.20944/preprints202006.0225.v1 [DOI] [Google Scholar]
  • 20.Rambaut A, Holmes EC, O’Toole Á, Hill V, McCrone JT, Ruis C, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol. 2020; 5: 1403–07. doi: 10.1038/s41564-020-0770-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.O’Toole Á, Scher E, Underwood A, Jackson B, Hill V, McCrone JT, et al. Assignment of Epidemiological Lineages in an Emerging Pandemic Using the Pangolin Tool. Virus Evol. 2021; 7(2): 1–9. doi: 10.1093/ve/veab064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data–from vision to reality. Eurosurveillance. 2017; 22: 30494. doi: 10.2807/1560-7917.ES.2017.22.13.30494 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013; 30: 772–80. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Price MN, Dehal PS, Arkin AP, Liu DX, Fung TS, Chong KK, et al. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol. 2009; 26: 1641–50. doi: 10.1093/molbev/msp077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Tse H, Lung DC, Wong SC-Y, Ip K-F, Wu T-C, To KK-W, et al. Emergence of a Severe Acute Respiratory Syndrome Coronavirus 2 virus variant with novel genomic architecture in Hong Kong. Clin Infect Dis. 2021; 73: 1696–9. doi: 10.1093/cid/ciab198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Mazur-Panasiuk N, Rabalski L, Gromowski T, Nowicki G, Kowalski M, Wydmanski W, et al. Expansion of a SARS-CoV-2 Delta variant with an 872 nt deletion encompassing ORF7a, ORF7b and ORF8, Poland, July to August 2021. Euro Surveill. 2021; 26: 1–6. doi: 10.2807/1560-7917.ES.2021.26.39.2100902 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Thiel V, Ivanov KA, Putics Á, Hertzig T, Schelle B, Bayer S, et al. Mechanisms and enzymes involved in SARS coronavirus genome expression. J Gen Virol. 2003; 84: 2305–15. doi: 10.1099/vir.0.19424-0 [DOI] [PubMed] [Google Scholar]
  • 28.Yan H, Xiao G, Zhang J, Hu Y, Yuan F, Cole DK, et al. SARS coronavirus induces apoptosis in Vero E6 cells. J Med Virol. 2004; 73: 323–31. doi: 10.1002/jmv.20094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Yeh S-H, Wang H-Y, Tsai C-Y, Kao C-L, Yang J-Y, Liu H-W, et al. Characterization of severe acute respiratory syndrome coronavirus genomes in Taiwan: molecular epidemiology and genome evolution. Proc Natl Acad Sci U S A. 2004; 101: 2542–7. doi: 10.1073/pnas.0307904100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Young BE, Fong S-W, Chan Y-H, Mak T-M, Ang LW, Anderson DE, et al. Effects of a major deletion in the SARS-CoV-2 genome on the severity of infection and the inflammatory response: an observational cohort study. Lancet. 2020; 396: 603–11. doi: 10.1016/S0140-6736(20)31757-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Nemudryi A, Nemudraia A, Wiegand T, Nichols J, Snyder DT, Hedges JF, et al. SARS-CoV-2 genomic surveillance identifies naturally occurring truncation of ORF7a that limits immune suppression. Cell Rep. 2021; 35: 109197. doi: 10.1016/j.celrep.2021.109197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Pyke AT, Nair N, van den Hurk AF, Burtonclay P, Nguyen S, Barcelon J, et al. Replication kinetics of B.1.351 and B.1.1.7 SARS-CoV-2 variants of concern including assessment of a B.1.1.7 mutant carrying a defective ORF7a gene. Viruses. 2021; 13: 1087. doi: 10.3390/v13061087 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Joonlasak K, Batty EM, Kochakarn T, Panthan B, Kümpornsin K, Jiaranai P, et al. Genomic surveillance of SARS-CoV-2 in Thailand reveals mixed imported populations, a local lineage expansion and a virus with truncated ORF7a. Virus Res. 2021; 292: 198233. doi: 10.1016/j.virusres.2020.198233 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Garry RF, Gallaher WR. Naturally occurring indels in multiple coronavirus spikes. 2020. Available from: https://virological.org/t/naturally-occurring-indels-in-multiple-coronavirus-spikes/560. [Google Scholar]
  • 35.Zhou Z, Huang C, Zhou Z, Huang Z, Su L, Kang S, et al. Structural insight reveals SARS-CoV-2 ORF7a as an immunomodulating factor for human CD14+ monocytes. iScience. 2021; 24: 102187. doi: 10.1016/j.isci.2021.102187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Schaecher SR, Diamond MS, Pekosz A. The transmembrane domain of the severe acute respiratory syndrome coronavirus ORF7b protein is necessary and sufficient for its retention in the Golgi complex. J Virol. 2008; 82: 9477–91. doi: 10.1128/JVI.00784-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Fogeron M, Montserret R, Zehnder J, Nguyen M, Dujardin M, Cole L, et al. SARS-CoV-2 ORF7b: is a bat virus protein homologue a major cause of COVID-19 symptoms? bioRxiv. 2021; 02.05.428650. 10.1101/2021.02.05.428650. [DOI] [Google Scholar]
  • 38.Panzera Y, Ramos N, Calleros L, Marandino A, Tomás G, Techera C, et al. Transmission cluster of COVID-19 cases from Uruguay: emergence and spreading of a novel SARS-CoV-2 ORF6 deletion. Mem Inst Oswaldo Cruz. 2021; 116. doi: 10.1590/0074-02760210275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Islam MR, Hoque MN, Rahman MS, Alam ASMRU, Akther M, Puspo JA, et al. Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity. Sci Rep. 2020; 10: 14004. doi: 10.1038/s41598-020-70812-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Su YCF, Anderson DE, Young BE, Linster M, Zhu F, Jayakumar J, et al. Discovery and genomic characterization of a 382-nucleotide deletion in ORF7b and ORF8 during the early evolution of SARS-CoV-2. MBio. 2020; 11(4): e01610–20. doi: 10.1128/mBio.01610-20 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Herman Tse

7 Jan 2022

PONE-D-21-32425Consecutive deletions in a unique Uruguayan SARS-CoV-2 lineage evidence the genetic variability potential of accessory genesPLOS ONE

Dear Dr. Perez,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

 As pointed out by the reviewer, while the technical analysis and writing are done very well, there is an overabundance of speculative discussion in the manuscript. It would take much more data and analysis to untangle the combined effect of viral fitness, population genetics, and public health measures on the spread and prevalence of particular SARS-CoV-2 lineages in this pandemic. We appreciate that these hypotheses were presented sensibly, but politely request that the discussions from line 280 onwards be scaled back to strike a more balanced tone. Please also address the other points raised by the reviewer below.

Please submit your revised manuscript by Feb 21 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Herman Tse

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In this manuscript, the authors present evidence of circulation of strains with 2 deletions within the ORF7a gene in 2020. They speculate that the functional consequence of these deletions is the expression of a putative ORF7ab gene product. ORF7 deletions in SARS-CoV-2 (even to the extent of complete absence) have been reported from various parts of the world. The significance of such deletions on SARS-CoV-2 transmissibility or clinical manifestations is not known. The methodology of this study is clear and the results are convincing. The epidemiological and clinical significance of these ORF7-del variants is uncertain. But, I think there is considerable speculation in parts of the manuscript that needs to be revised.

Specific comments below:

- Please cite other papers describing ORF7b deletions: Tse H et al, Emergence of a Severe Acute Respiratory Syndrome Coronavirus 2 virus variant with novel genomic architecture in Hong Kong, Clin Infect Dis; Mazur-Panasiuk N et al, Expansion of a SARS-CoV-2 Delta variant with an 872 nt deletion encompassing ORF7a, ORF7b and ORF8, Poland, July to August 2021, Euro Surveillance.

- How many samples were analyzed in total to obtain the few with 12-nt deletion in table 1? Unable to gauge how common this variant is.

- Any epidemiological link between patients listed in table 1?

- Were any of the patients listed in table 1 immunocompromised?

- "This potential functionality could explain why the sizeable Δ12+68 change could be transmitted at least between four individuals." this seems to imply that these four patients transmitted this variant between each other, which I don't think is the case? Please clarify.

- The delta-68 deletion could also have occurred within 4 hosts independently (intra-host evolution) without being transmitted? So, if there is no epidemiological link between these 4 cases, the authors should not speculate on the transmissibility of this variant.

- "The topology of the phylogenetic tree implies that the Δ12 variant emerged early in the N.7 lineage during its initial differentiation in Uruguay (the first case was reported in July 2020); the Δ12+68 variant emerged subsequently by the acquisition of a second deletion (Δ68) from a Δ12 individual." I agree this is probably most likely, but any possibility of under-sequenced communities or provinces where an earlier emergence of the Δ68 might have been missed?

- I find the discussion on structure and function of the putative ORF7ab gene product to be very speculative (line 280 onwards). Should be edited and condensed heavily to avoid speculations (such as "the Δ12 mutation in ORF7a could be slightly deleterious and temporarily tolerated in a low-competitive population; after acquiring the additional deletion (Δ68)"). Instead, just state the type of experiments that could be performed to determine effects of deletions observed.

- Should add a section on limitations of the study.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Feb 17;17(2):e0263563. doi: 10.1371/journal.pone.0263563.r002

Author response to Decision Letter 0


20 Jan 2022

The following comments were uploaded as "Response to Reviewers" file.

Reviewer #1: In this manuscript, the authors present evidence of circulation of strains with 2 deletions within the ORF7a gene in 2020. They speculate that the functional consequence of these deletions is the expression of a putative ORF7ab gene product. ORF7 deletions in SARS-CoV-2 (even to the extent of complete absence) have been reported from various parts of the world. The significance of such deletions on SARS-CoV-2 transmissibility or clinical manifestations is not known. The methodology of this study is clear and the results are convincing. The epidemiological and clinical significance of these ORF7-del variants is uncertain. But, I think there is considerable speculation in parts of the manuscript that needs to be revised.

Specific comments below:

Please cite other papers describing ORF7b deletions: Tse H et al, Emergence of a Severe Acute Respiratory Syndrome Coronavirus 2 virus variant with novel genomic architecture in Hong Kong, Clin Infect Dis; Mazur-Panasiuk N et al, Expansion of a SARS-CoV-2 Delta variant with an 872 nt deletion encompassing ORF7a, ORF7b and ORF8, Poland, July to August 2021, Euro Surveillance.

Answer: We thank the reviewer for pointing out these two important papers; both references were added to the manuscript discussion.

- How many samples were analyzed in total to obtain the few with 12-nt deletion in table 1? Unable to gauge how common this variant is.

Answer: We screened 51 samples that were available in our Uruguayan biobank of positive samples (this information was added to the manuscript). Therefore, the frequency of the double deletion is 13/51 (approximately 25%). However, we recognize that this number could be biased because we do not have access to all the samples from Uruguay. Therefore, we added a sentence about frequency in the manuscript and clarified this possible bias in the "limitation of the study" statement.

- Any epidemiological link between patients listed in table 1?

Answer: In the study period, the epidemic in Uruguay was characterized by different outbreaks, with an average of about 20 new COVID-19 cases per day. Because of the reduced number, sixty-seven percent of patients had a contact record with a confirmed case in the national territory, and thirteen percent had known contact with a traveler (https://www.gub.uy/ministerio-salud-publica/comunicacion/noticias/informe-epidemiologico-covid-19-del-30-noviembre-2020). The genetic homogeneity of the samples also supports a few transmission clusters.

We obtained and added new data to Table 1, evidencing the epidemiological link among some patients. Regretfully, this epidemiological data were not available for all patients. Therefore, we added a sentence clarifying this point and included the bias in the limitation of the study section.

- Were any of the patients listed in table 1 immunocompromised?

Answer: We could obtain only detailed clinical information of the two patients that required admission to the intensive care unit. Both have preexisting conditions, one had a cerebrovascular stroke, and the other had cancer. Unfortunately, these patients died in the health center. These data were included in Table 1 and the manuscript's text. The data of the other patients did not indicate evidence of immunosuppression.

- "This potential functionality could explain why the sizeable Δ12+68 change could be transmitted at least between four individuals." this seems to imply that these four patients transmitted this variant between each other, which I don't think is the case? Please clarify.

Answer: Two of the four Δ12+68 individuals are from the same family (new data added to Table 1). The four individuals were likely part of one transmission cluster, but the referee is right that we cannot establish the exact connection among them. Therefore, we removed this sentence from the discussion.

- The delta-68 deletion could also have occurred within 4 hosts independently (intra-host evolution) without being transmitted? So, if there is no epidemiological link between these 4 cases, the authors should not speculate on the transmissibility of this variant.

Answer: We obtained the information that two cases with the Δ12+68 variant were epidemiologically closely related (new data in Table 1). These patients were part of one of the few transmission clusters in Uruguay during that period; we added this information in the results.

- "The topology of the phylogenetic tree implies that the Δ12 variant emerged early in the N.7 lineage during its initial differentiation in Uruguay (the first case was reported in July 2020); the Δ12+68 variant emerged subsequently by the acquisition of a second deletion (Δ68) from a Δ12 individual." I agree this is probably most likely, but any possibility of under-sequenced communities or provinces where an earlier emergence of the Δ68 might have been missed?

Answer: The emergence of Δ12 before Δ68 is supported by: i) the few cases occurring in the period or analysis, ii) the fact that Δ12 appeared two months earlier, and iii) the double-deleted samples were genetically homogenous and epidemiologically related (at least two of the four cases). Furthermore, considering that all N7 we analyzed have the Δ12 deletion, the later emergence of Δ68 seems more parsimonious. We agree with the reviewer that there is a chance of a different scenario, but our data does not support it.

- I find the discussion on structure and function of the putative ORF7ab gene product to be very speculative (line 280 onwards). Should be edited and condensed heavily to avoid speculations (such as "the Δ12 mutation in ORF7a could be slightly deleterious and temporarily tolerated in a low-competitive population; after acquiring the additional deletion (Δ68)"). Instead, just state the type of experiments that could be performed to determine effects of deletions observed.

Answer: We reduced and moderated the tone of the discussion from line 280 onwards. We highlighted in yellow the paragraphs that were modified in the discussion section. Thanks for the suggestion; we believe the discussion is now clearer and concise.

- Should add a section on limitations of the study.

Answer: We added a statement in the discussion commenting on the limited number of samples due to Uruguay's low prevalence that underpowers statistical analysis (i.e., frequency of cases). Our analysis, therefore, represents a snapshot in time before the emergence of variants of concern that replace the N.7 lineage. Another limitation was the method of sampling. Individuals submitted for the report may not fully reflect the prevalence of infections in the overall population. Nevertheless, the data obtained were from different geographic origins, and the sample number can be considered relatively representative. Finally, another potential limitation of the study may be the lack of functional studies to conclude the virus fitness. In Uruguay, we limited our analysis to genetic variability because we do not have facilities to culture the virus but recognize the need for these studies. Despite the above limitations, we believe that our research results contribute to understanding indels origin and prevalence by providing new insights into the genome variability and dynamics of accessory ORFs within SARS-CoV-2 lineages.

Editor comments

As pointed out by the reviewer, while the technical analysis and writing are done very well, there is an overabundance of speculative discussion in the manuscript. It would take much more data and analysis to untangle the combined effect of viral fitness, population genetics, and public health measures on the spread and prevalence of particular SARS-CoV-2 lineages in this pandemic. We appreciate that these hypotheses were presented sensibly, but politely request that the discussions from line 280 onwards be scaled back to strike a more balanced tone. Please also address the other points raised by the reviewer below.

Answer: We thank the editor for the positive comments. Following the suggestions, we reduced the most speculative part of the discussion and moderated some sentences' tone (highlighted in yellow).

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article's retracted status in the References list and also include a citation and full reference for the retraction notice.

Answer: References were checked and modified accordingly.

Additional changes

Typos and spelling mistakes were corrected throughout the manuscript. References suggested by the reviewers were included, and figures and tables were checked

Decision Letter 1

Herman Tse

24 Jan 2022

Consecutive deletions in a unique Uruguayan SARS-CoV-2 lineage evidence the genetic variability potential of accessory genes

PONE-D-21-32425R1

Dear Dr. Perez,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Herman Tse

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Herman Tse

8 Feb 2022

PONE-D-21-32425R1

Consecutive deletions in a unique Uruguayan SARS-CoV-2 lineage evidence the genetic variability potential of accessory genes

Dear Dr. Pérez:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Herman Tse

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Acknowledgments table for sequences obtained from the GISAID’s EpiCoV™ database.

    (PDF)

    Data Availability Statement

    Yes - all data are fully available without restriction Viral sequences were deposited in the GenBank with the accession numbers: MZ555811 to MZ555814 and OK416091 to OK416099.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES