Skip to main content
PLOS One logoLink to PLOS One
. 2024 May 10;19(5):e0303176. doi: 10.1371/journal.pone.0303176

SARS-CoV-2 clade dynamics and their associations with hospitalisations during the first two years of the COVID-19 pandemic

Taavi Päll 1,#, Aare Abroi 2,#, Radko Avi 1, Heiki Niglas 3, Arina Shablinskaja 1, Merit Pauskar 1, Ene-Ly Jõgeda 1, Hiie Soeorg 1, Eveli Kallas 1, Andrio Lahesaare 4, Kai Truusalu 1, Dagmar Hoidmets 1, Olga Sadikova 3, Kaspar Ratnik 4, Hanna Sepp 3, Liidia Dotsenko 3, Jevgenia Epštein 3, Heleene Suija 3, Katrin Kaarna 5,6, Steven Smit 7, Lili Milani 7, Mait Metspalu 7, Ott Eric Oopkaup 8, Ivar Koppel 8, Erik Jaaniso 9, Ivan Kuzmin 8, Heleri Inno 8, Uku Raudvere 8, Mari-Anne Härma 3, Paul Naaber 1,4, Tuuli Reisberg 7, Hedi Peterson 9, Ulvi Gerst Talas 8, Irja Lutsar 1, Kristi Huik 1,*
Editor: Nihad A M Al-Rashedi10
PMCID: PMC11086870  PMID: 38728305

Abstract

Background

The COVID-19 pandemic was characterised by rapid waves of disease, carried by the emergence of new and more infectious SARS-CoV-2 virus variants. How the pandemic unfolded in various locations during its first two years has yet to be sufficiently covered. To this end, here we are looking at the circulating SARS-CoV-2 variants, their diversity, and hospitalisation rates in Estonia in the period from March 2000 to March 2022.

Methods

We sequenced a total of 27,550 SARS-CoV-2 samples in Estonia between March 2020 and March 2022. High-quality sequences were genotyped and assigned to Nextstrain clades and Pango lineages. We used regression analysis to determine the dynamics of lineage diversity and the probability of clade-specific hospitalisation stratified by age and sex.

Results

We successfully sequenced a total of 25,375 SARS-CoV-2 genomes (or 92%), identifying 19 Nextstrain clades and 199 Pango lineages. In 2020 the most prevalent clades were 20B and 20A. The various subsequent waves of infection were driven by 20I (Alpha), 21J (Delta) and Omicron clades 21K and 21L. Lineage diversity via the Shannon index was at its highest during the Delta wave. About 3% of sequenced SARS-CoV-2 samples came from hospitalised individuals. Hospitalisation increased markedly with age in the over-forties, and was negligible in the under-forties. Vaccination decreased the odds of hospitalisation in over-forties. The effect of vaccination on hospitalisation rates was strongly dependent upon age but was clade-independent. People who were infected with Omicron clades had a lower hospitalisation likelihood in age groups of forty and over than was the case with pre-Omicron clades regardless of vaccination status.

Conclusions

COVID-19 disease waves in Estonia were driven by the Alpha, Delta, and Omicron clades. Omicron clades were associated with a substantially lower hospitalisation probability than pre-Omicron clades. The protective effect of vaccination in reducing hospitalisation likelihood was independent of the involved clade.

Introduction

As was largely the case in other countries, Estonia experienced multiple COVID-19 waves following the emergence of the pandemic. The first wave to be caused by the SARS-CoV-2 virus occurred between February and June 2020, with a peak hospitalisation of around twelve COVID-19 patients per 100,000 people per day in April 2020, and this was accompanied by a national lockdown which lasted for nine weeks. The next waves were driven by the emergence of Alpha, Delta, and Omicron variants of concern (VOC) [15], with three peaks of hospitalisation of between fifty and sixty COVID-19 patients per 100,000 people per day in March 2021, November 2021, and February 2022 respectively (https://www.terviseamet.ee/en/coronavirus-dataset). A partial lockdown was established between March and May 2021 in response to overwhelming hospitalisation levels. The compulsory use of face masks was implemented between August 2021 and April 2022, until all restrictions were lifted in June 2022. Vaccination against SARS-CoV-2 has been available for essential healthcare and social services workers and risk groups since 27 December 2020, followed by those over the age of forty in May 2021, and the 12–15 year-old group in June 2021. VOCs have been associated with COVID-19 disease severity and transmissibility. The latter increased between Alpha, Delta, and Omicron [6], but Omicron was shown to cause less severe disease [7]. There were two major peaks of hospitalisation, in Estonia in October-November 2021 and February-March 2022 respectively. Intensive care unit admissions peaked in Estonia in October-November 2021 (https://www.terviseamet.ee/en/coronavirus-dataset), suggesting that SARS-CoV-2 virus variants which were circulating in Estonia during those waves could be associated with more severe disease outcomes. We can report the results here from the SARS-CoV-2 whole genome sequencing study which was carried out in Estonia between March 2020 and March 2022. We undertook this study in order to be able to understand how distinct SARS-CoV-2 clades influenced the severity of COVID-19 and hospitalisation, and which other factors—including immunisation against SARS-CoV-2—served to contribute to this process.

Methods

Study design

The study used SARS-CoV-2 positive PCR test sample leftovers for the purpose of genotyping. The samples were collected and stored by the Estonian Health Board, Synlab Eesti OÜ, and hospitals, at dates between 13 March 2020 and 31 March 2022 as part of the national SARS-CoV-2 PCR testing process. To be able to select samples for genotyping purposes, various sampling strategies were applied according to public health needs in Estonia. During the first wave in 2020, samples were sequenced from the first cases and local SARS-CoV-2 outbreaks. After the first wave, sequencing was carried out by using randomly-selected samples (median 336 samples per week). In addition, targeted samples were occasionally sequenced from hospitalised individuals, regardless of the reason for hospitalisation, and from those who had a history of international travel in the 14 days prior to their producing a positive PCR test for SARS-CoV-2 (Fig 1). Sample genotyping data was last accessed on 11 January 2023.

Fig 1. The workflow for the study population.

Fig 1

Epidemiological data availability means that, at least, the date of sampling and age or sex are present. The use of ‘ECDC’ refers to the European Centre for Disease Prevention and Control in Sweden.

SARS-CoV-2 sequencing

During the study period two different protocols were used for sequencing. Between March 2020 and January 2021, RNA was reverse-transcribed and amplified in 2.5kb products [8]. Pooled amplicons of each sample were used to prepare NGS libraries with Nextera XT and were clustered with MiSeq Reagent Kit v2 (500-cycles) on 2 x 150-cycle paired-end runs (Illumina Inc, San Diego, CA, USA). Since February 2021, an Illumina COVIDSeq Test Kit Artic v3 or v4/v4.1 (Illumina Inc, San Diego, CA, USA) was used for reverse transcribing, followed by amplification and sequencing by the Illumina NextSeq500 using paired-end 75bp or 150bp reads. A proportion of samples were sequenced by Eurofins Genomics Germany GmbH, brokered by the ECDC. Further details of the sequencing are described in the ‘Methods’ section in the S1 Appendix. Raw sequence data for analysed samples were submitted to the European Nucleotide Archive (ENA) using the ENA upload tool for Galaxy (https://github.com/usegalaxy-eu/ena-upload-cli). Raw sequences are deposited under ENA Project accessions as shown in Table 2 in the S2 Appendix.

Data collection

Epidemiological and clinical data were retrieved from the Estonian Health Board (https://www.terviseamet.ee/en) and the Estonian Health and Welfare Information Systems Centre (https://www.tehik.ee/en). The following data were collected: sampling date, district of origin, sex, age, possible source of infection, history of visited countries within the 14 days prior to diagnosis, a history of SARS-CoV-2 vaccination, and a history of hospitalisation due to COVID-19 (duration, stay in an intensive care unit, the need for mechanical ventilation, and any reason for discharge). Epidemiological and clinical data was last accessed on 11 January 2023. For this study, the authors had no access to any information which could be used to identify individual participants during or after data collection.

Statistical analysis

All data analyses were conducted using R v4.2.1 (2022-06-23). The study population baseline characteristics table was generated by using the gtsummary R package v1.6.3 [9]. Bayesian modelling was carried out using the R libraries, rstan v2.21.7 [10] and brms v2.18.0 [11]. Weakly informative priors were used to fit the models, and a minimum of 2,000 iterations, including warm-up iterations which amounted to half of all iterations, and three to four chains were used to fit the models. The work of extracting, summarising, and visualising draws from brms models was carried out with the tidybayes v3.0.2 [12] and emmeans v1.8.1–1 [13] R packages. The Shannon index (H) was calculated by using the vegan R package v2.6–2 [14] as H = -∑[(pi) * log(pi)], where pi is the proportion of the ith species in an entire community. In terms of diversity analyses, the species was defined as a distinct Pango lineage. The evenness index (J) was calculated as J = H / log(S), where ‘H’ is the Shannon index and ‘S’ is the observed number of Pango lineages. Raw data manipulations, including importation, transformation, and summaries, were generated with the tidyverse R package v1.3.1 [15]. Plots were prepared using the ggplot2 R package v3.3.6 [16].

Ethics statement

The study was approved by the Research Ethics Committee at the University of Tartu [approvals 304/T-1, 324/T-1]. Informed consent was waived according to § 6 of the national ‘Personal Data Protection Act’.

Results

We obtained a total of 25,375 (or 92%) complete or near-complete SARS-CoV-2 genomes, representing approximately 7% of the PCR-confirmed SARS-CoV-2 infections in Estonia during the study period (Fig 1). Our dataset covers more than 5% of SARS-CoV-2 PCR-confirmed tests for 72% of the weeks during the study period (Fig 2).

Fig 2. Testing and sequencing SARS-CoV-2 in Estonia.

Fig 2

The bars show the number of tests which were carried out (left axis) and the line denotes the percentage of positive samples sequenced (right axis). Vertical lines denote the duration of waves, defined by the prevalence of Nextstrain clade(s). The wave start and end were defined as a week in which the lower or upper bound respectively of the 95% confidence interval of clade prevalence, as obtained through the Clopper-Pearson method, crossed the 50% threshold. SARS-CoV-2 test results were downloaded from the Estonian COVID-19 open-data portal (https://opendata.digilugu.ee; last accessed 5 March 2024).

Study population

Table 1 presents the characteristics of the study population as was stratified by the VOC waves. Overall, our study population’s sex ratio and age group distribution match up well with the population of SARS-CoV-2 RT-PCR-positive people in Estonia. Binomial proportion analysis showed that the study population consisted of 1.6–2.8 percentage points more females than males while, in the target population involving SARS-CoV-2 RT-PCR-positive people in Estonia, females were even more prevalent than males (4.1–4.3 percentage points). The median age of individuals (37 years of age) was similar both in the study and in the target population. Age groups between twenty and sixty years were over-represented relative to their prevalence in the entire Estonian population (S1A Fig). Most people who presented with COVID-19 symptoms at the time of testing (96%) and about two-thirds of the people overall (66%) were infected in Estonia. About 3% of SARS-CoV-2 RT-PCR-positive people were hospitalised. During the study period the prevalence of people in the below-twenty year-old age group increased, while the prevalence of people aged between 60–79 decreased (both trends have a posterior probability of >0.999). No substantial changes were observed in the prevalence of other age groups (S1B Fig). The sex ratio showed a substantial decrease in the proportion of males in the under-twenty group and the 20–39 age range (with a posterior probability of 0.995 and 0.999 respectively), with no changes in age groups above those ages (see S2 Fig).

Table 1. The characteristics of the study population.

Total SARS-CoV-2 positive samples SARS-CoV-2 positive samples sequenced
Variable N = 363,039 N Overall, N = 25,375 Alpha, N = 5,842 Delta, N = 10,793 Omicron, N = 6,851 Other, N = 1,806 Other VOC, N = 83
Sex, n (%)   24,344            
 Female 194,068 (54)   12,712 (52) 2,942 (51) 5,581 (52) 3,384 (54) 771 (51) 34 (41)
 Male 168,660 (46)   11,632 (48) 2,860 (49) 5,133 (48) 2,849 (46) 742 (49) 48 (59)
 Unknown 311   1,031 40 79 618 293 1
Age, median (IQR) 37 [20, 52] 24,567 37 (24, 52) 40 (26, 55) 36 (23, 50) 37 (23, 50) 42 (27, 57) 37 (28, 49)
 Unknown 2,240   808 23 64 432 289 0
Age, n (%)   24,567            
 0–19 88,940 (25)   4,615 (19) 930 (16) 2,205 (21) 1,249 (19) 221 (15) 10 (12)
 20–39 112,361 (31)   8,709 (35) 1,966 (34) 3,874 (36) 2,349 (37) 487 (32) 33 (40)
 40–59 102,343 (28)   7,526 (31) 1,887 (32) 3,201 (30) 1,922 (30) 481 (32) 35 (42)
 60–79 46,138 (13)   3,090 (13) 882 (15) 1,202 (11) 725 (11) 276 (18) 5 (6)
 >80 11,017 (3)   627 (3) 154 (3) 247 (2) 174 (3) 52 (3) 0 (0)
 Unknown 2,240   808 23 64 432 289 0
Possible place of contracting the infection, n (%)   14,341            
 Abroad/related to travelling     4,912 (34) 715 (21) 2,491 (32) 1,446 (60) 227 (31) 33 (55)
 Domestic     9,429 (66) 2,692 (79) 5,231 (68) 974 (40) 505 (69) 27 (45)
 Unknown     11,034 2,435 3,071 4,431 1,074 23
Presenting with symptoms, n (%)   14,542            
 No     632 (4) 160 (6) 339 (4) 82 (4) 50 (8) 1 (6)
 Yes     13,910 (96) 2,340 (94) 9,026 (96) 1,980 (96) 548 (92) 16 (94)
 Unknown     10,833 3,342 1,428 4,789 1,208 66
Covid Cq, mean (SD)   4,135 16 (4) NA (NA) 16 (5) 16 (4) NA (NA) NA (NA)
 Unknown     21,240 5,842 10,774 2,735 1,806 83
Average Cq, mean (SD)   17,843 17 (5) 18 (6) 17 (5) 18 (5) 20 (6) 19 (6)
 Unknown     7,532 895 219 5,431 978 9
Vaccination, n (%)   24,233            
 Boosted     1,065 (4) 0 (0) 70 (1) 995 (16) 0 (0) 0 (0)
 Fully vaccinated     6,489 (27) 222 (4) 3,684 (34) 2,563 (41) 20 (1) 0 (0)
 Unvaccinated     16,679 (69) 5,568 (96) 6,959 (65) 2,692 (43) 1,379 (99) 81 (100)
 Unknown     1,142 52 80 601 407 2
Hospitalisation, n (%)   24,233            
 No     23,498 (97) 5,481 (95) 10,413 (97) 6,214 (99) 1,309 (94) 81 (100)
 Yes     735 (3) 309 (5) 300 (3) 36 (1) 90 (6) 0 (0)
 Unknown     1,142 52 80 601 407 2
Days in COVID unit, median (IQR)   723 7 (5, 12) 8 (5, 12) 7 (5, 11) 4 (2, 10) 10 (6, 12) NA (NA, NA)
 Unknown     24,652 5,542 10,496 6,815 1,716 83
Days in intensive care, median (IQR)   67 9 (5, 16) 8 (3, 14) 9 (7, 15) 1 (1, 1) 12 (6, 21) NA (NA, NA)
 Unknown     25,308 5,816 10,761 6,850 1,798 83
Days on ventilation, median (IQR)   47 9 (5, 15) 11 (5, 20) 9 (5, 14) NA (NA, NA) 8 (6, 12) NA (NA, NA)
 Unknown     25,328 5,827 10,768 6,851 1,799 83
Reason of discharge from hospital, n (%)   735            
 Death     78 (11) 44 (14) 23 (8) 2 (6) 9 (10)  
 Recovery     470 (64) 199 (64) 196 (65) 28 (78) 47 (52)  
 Transfer to non-COVID unit     184 (25) 64 (21) 80 (27) 6 (17) 34 (38)  
 Unknown     3 (0) 2 (1) 1 (0) 0 (0) 0 (0)  
Places of domestic infections, n (%)   9,429            
 Educational or children’s facility     721 (8) 112 (4) 467 (9) 120 (12) 21 (4) 1 (4)
 Home/Family     5,342 (57) 1,568 (58) 2,905 (56) 614 (63) 236 (47) 19 (70)
 Other     1,329 (14) 304 (11) 815 (16) 98 (10) 111 (22) 1 (4)
 Visiting acquaintances     650 (7) 152 (6) 418 (8) 74 (8) 6 (1) 0 (0)
 Workplace     1,387 (15) 556 (21) 626 (12) 68 (7) 131 (26) 6 (22)

Displayed in the table are the epidemiological and clinical characteristics of populations which have been infected with SARS-CoV-2 VOCs in Estonia between March 2020 and March 2022. Other VOCs consist of Beta and Gamma VOCs. See S1 Table for lineages which belong to strata. IQR, interquartile range; NA, not applicable; SD, standard deviation. The dimension of values given in parentheses is shown in the ‘Variable’ column.

The prevalence of clades and lineages during the VOC waves

Overall, 19 Nextstrain clades and 199 Pango lineages were identified during the study period. Those variants which belonged to the Delta (43%), Omicron (27%), and Alpha (23%) VOCs were the most highly-abundant of all sequenced SARS-CoV-2 genomes (Table 1 and S1 Table). During the pre-VOC period in 2020, the two most prevalent clades were 20B (51%), and 20A (42%) (Fig 3A). The Alpha VOC (20I) was first identified in Estonia on 31 December 2020 in a sample which had been collected from an individual with travel history. Additional cases of the Alpha VOC were identified in January 2021, followed by the local spread of the variant which eventually led to the Alpha wave, from W9 to W22 in 2021. Clades which belonged to the Delta VOC emerged thereafter and out-competed the Alpha VOC by W26 in 2021 (Fig 3A and 3B). During the Delta wave we detected three clades (21A, 21I, and 21J), with 21J emerging as dominant. In contrast to the Alpha and Delta waves, the Omicron VOC resulted in two consecutive waves, first with 21K (BA.1 and its sub-lineages) from W52 in 2021 to W5 in 2022, and then with 21L (BA.2 and its sub-lineages), starting in W7 in 2022 (Fig 3A and 3B). We detected a total of twenty-seven different lineages in 2020, with the three most highly-prevalent lineages being B.1.1 (28%), B.1.1.10 (18%), and B.1.258 (13%). The Alpha wave in 2021 was characterised by the dominance of the B.1.1.7 lineage (99%) whereas, during the Delta wave, we detected a total of eighty-two different Delta VOC lineages and sub-lineages, with AY.122 being predominant (43%), followed by AY.100 (10%) and AY.43 (7.3%). During the two Omicron waves up to 31 March 2022, we detected twenty-eight BA.1 and 16 BA.2 sub-lineages (see S1 Table).

Fig 3. Waves of SARS-CoV-2 in Estonia.

Fig 3

(A) the prevalence of Nextstrain clades; (B) the prevalence of VOCs. Full-length sequencing was not applied to samples collected between W47 to W52 in 2020.

The association between travel-related cases and VOC

We found that the proportion of travel-related cases was about 20% in populations which had become infected with the Alpha-, Delta VOC, or other variants, and about 40–50% in those who had become infected with Omicron or Beta VOC (S3 Fig). These cases were associated with 88 different countries. The top five countries into which VOCs had mainly been imported included the neighbouring countries around Estonia (Finland, Sweden, and Russia), and popular travel destinations (Egypt and Great Britain). The Beta VOC import was related to a very limited set of countries and did not result in a wave in Estonia. Compared to the Beta, the Omicron VOC displayed a wider geography, one which was similar to that of the Alpha and Delta VOCs (see S3C Fig).

Higher diversity amongst travel-related cases

We identified a total of 165 different Pango lineages from the period for which we had sample origin info (starting from 7 Feb 2021). A total of 57 (or 35%) of those were unique to individuals who reported that they had travelled compared to 23 (13%) lineages which were uniquely assignable to domestic infections, while 86 (52%) lineages were present in both groups. Out of those 86 lineages which were found in both groups, a total of 51 (59%) were first identified in individuals who had reported that they had travelled. The Shannon diversity index of Pango lineages decreased steeply during the Alpha and Omicron waves, something which was in contrast with the Delta wave in which diversity rebounded quickly after the Alpha wave and remained at a relatively higher level until the end of the wave (Fig 4A). Diversity displayed small peaks at the beginning of waves and was higher amongst travel-related cases when compared to domestic cases in about 68% of weeks during the Alpha, Delta, and Omicron waves (Fig 4B). The second half of the Delta wave was characterised by higher levels of diversity in domestic cases. The observed number of lineages was higher during the Delta wave and the first Omicron wave (21K, BA.1 and its sub-lineages) when compared to the Alpha wave and the second Omicron wave both in travel-related and domestic cases (21L, BA.2 and its sub-lineages) (S5A Fig). For most of the weeks under consideration, the number of unique lineages was higher amongst travel-related cases (S5B Fig). The weekly number of unique lineages in travel-related cases displayed a substantial peak relative to domestically circulating lineages at the beginning of the first Omicron wave (S5A and S5B Fig). The evenness index sharply decreased during the Alpha wave when 99% of cases belonged to the B.1.1.7 lineage (see S1 Table), but this rebounded before the Delta wave, and remained relatively stable afterwards with no single lineage becoming prevalent (S5C Fig). The evenness index was higher in imported cases in about 66% of overall weeks during the Alpha, Delta, and Omicron waves (S5D Fig).

Fig 4. The diversity of SARS-CoV-2 lineages in domestic and travel-related cases in Estonia.

Fig 4

(A) Shannon diversity index. Points denote individual weekly observations. The line denotes the autoregressive model which has been fitted to the data, and the shaded ribbon denotes a 95% credible interval, N = 14,341; (B) the effect-size of the Shannon index, imported cases compared to domestic cases. The line denotes the effect size as derived from the model fit which is shown in Panel A, and the shaded ribbon denotes a 95% credible interval. The model summary is presented in Table 3 in the S2 Appendix.

The association of RT-PCR Cq values with VOC

We observed that people who had become infected with the Delta VOC had lower average RT-PCR Cq values of the ORF1a, S gene, and N gene than people who had been infected with the Alpha or Omicron (16.6 versus 18.2 or 18.2 respectively) VOC (Table 1). Two of the Delta clades (21I and 21J) displayed considerably lower average RT-PCR Cq values (16.0, 95% credible interval (CI) [15.4, 16.8] and 16.8, 95% CI [16.2, 17.3] respectively), adjusted for age and vaccination status, than all other clades, such as 21K Omicron (18.7, 95% CI [17.8, 19.2]) or 20I Alpha (18.9 95% CI [18.5, 19.3]) (see S4 Fig). The 21A Delta clade displayed higher Cq values than did all other clades (except EU1) (22.5, 95% CI [21.2, 23.8]).

The association between hospitalisation and clade and vaccination

We found that the probability of hospitalisation was strongly related to age and COVID-19 vaccination status (Fig 5A). Overall, the probability of hospitalisation was very low for people who were under forty years of age but started to increase after that, whereas vaccinated people had a lower probability of being hospitalised when compared to unvaccinated people. The beneficial effect of any vaccination increased considerably with age, as the odds of hospitalisation decreased with age in the vaccinated population when compared to the unvaccinated (OR = 0.71, 95% CI [0.58, 0.84] at forty years of age; OR = 0.37, 95% CI [0.3, 0.45] at sixty, and OR = 0.19, 95% CI [0.13, 0.27] at eighty. This association between age and vaccination effect was similar across all common clades (Fig 5B). The probability of hospitalisation was higher for males over fifty than it was for females of the same age regardless of vaccination status (S6A Fig). The odds of hospitalisation for males when compared to females was, for fifty year-olds, OR = 1.35, 95% CI [1.17, 1.54] and, for seventy-five year-olds, OR = 2.39, 95% CI: [1.85, 2.99] (S6B Fig). Pairwise comparisons of the clades showed that 21K and 21L Omicron-infected people had a lower probability of hospitalisation in age groups which started at the age of forty, independent of any vaccination status when compared to people of the same age who had been infected with any other common clades (Fig 5C and S7 and S8 Figs). In the population of twenty year-olds and below, 21K and 21L Omicron-infected people had a similar hospitalisation level of probability when compared to other clades (Fig 5C and S7 and S8 Figs). Pairwise comparisons of clades other than 21K and 21L Omicron showed a similar probability of hospitalisation (see S7 and S8 Figs for all pairwise comparisons).

Fig 5. The hospitalisation of SARS-CoV-2-positive individuals in Estonia.

Fig 5

(A) the probability of hospitalisation is associated with vaccination status, age, and clade. Posterior summaries of the conditional effect of the clade, vaccination status, and age in logistic regression when adjusted for sex, the number of weekly cases, and population vaccination coverage, N = 23,456; (B) log the odds ratio of hospitalisation in vaccinated individuals when compared to unvaccinated SARS-CoV-2-positive individuals; (C) log the odds ratio in terms of the hospitalisation of unvaccinated SARS-CoV-2-positive individuals who have been infected with Omicron 21K (upper panels) or 21L (lower panels) when compared to individuals who have been infected with other common SARS-CoV-2 clades. The line denotes the model’s best estimate, and the ribbon denotes a 95% credible interval. The solid line denotes the span of ages which have been observed for each clade, while the dashed line denotes unobserved ages implied by the underlying model. The dotted line denotes log odds = 0. Panels B and C are based on the same model as for A, while the model summary is presented in Table 4 in the S2 Appendix.

Discussion

In this countrywide study, we characterise the circulating variants in Estonia between March 2020 and March 2022 by genotyping about 7% of SARS-CoV-2-positive samples.

We demonstrate the following:

1) the prevalence of the clades in Estonia resembled those in other European countries

2) different VOC waves varied in terms of Pango lineage diversity, with the highest levels of diversity being seen in the Delta wave and the lowest in the Alpha wave

3) the probability of hospitalisation was associated with clade, as two Omicron clades (21K and 21L) displayed lower hospitalisation rates when compared to all other common clades

4) the protective effect of the vaccination against severe disease, with such disease being associated with hospitalisation, was seen in individuals who were aged forty and above, with the effect strongly increasing with age and being independent of the involved clade.

Changes in the most highly-affected population groups during the pandemic

The genotypic surveillance of SARS-CoV-2 in Estonia presents trends which generally match those of the rest of Europe, being and is characterised by the prevalence of clades 20A and 20B in 2020, and then in 2021–2022 by three successive waves of SARS-CoV-2 which were dominated by Alpha, Delta, and Omicron VOC in that order (https://www.ecdc.europa.eu/en/covid-19). As for those population subgroups which were affected, as with others [17], we observed that individuals who were aged between 20–59 years were over-represented in Estonia (>30%) relative to their proportion in the general population (26%). Again as with others [18, 19], we observed an age-group prevalence shift from individuals who were aged forty and above during the first waves of the pandemic to individuals who were below forty, with the most substantial increase being seen during the Omicron waves in individuals who were aged nineteen and below. We could speculate that such shifts can be attributed to several measured and non-measured factors, such as asymptomatic infections and/or milder symptoms in testing younger people, the early vaccination of older at-risk age groups when compared to people in the younger age groups, and public health efforts to manage the pandemic, to mitigation measure enforcement, and also to comply with such measures. The introduction of antigen quick-testing in schools in November 2021 may also have substantially contributed to the measured increase in the prevalence of under-nineteens showing an increase in infections.

Lineage diversity during the waves

Lineage diversity in terms of the Alpha wave was distinct from that of all other waves, including the pre-VOC wave, and the Delta and Omicron waves, through the overwhelming dominance of one single lineage: B.1.1.7. The B.1.1.7 lineage was more highly-transmissible than its pre-VOC predecessors, resulting in its worldwide spread, and this became dominant in a large number of countries [2022]. Omicron waves displayed a similar pattern to Alpha within our study timeframe, albeit with a substantial decrease in diversity. Diversity was at its highest during the Delta wave and, in contrast to the Alpha and Omicron waves, remained high throughout the wave. We could speculate that higher Delta diversity was associated with higher virus loads, as Delta infections were shown to carry about six times more viral RNA when compared to Alpha [23], something which is supported by our findings for the two Delta clades which displayed the lowest Cq values. Interestingly, we observed an apparent increase in the diversity of the imported variants before or at the beginning of new waves. Variants which were present only in those individuals who reported that they had recently travelled and variants which were first detected in individuals who had reported travelling represent 70% of all variants which were circulating in Estonia during the same period, suggesting 70% efficiency in intercepting new variants entering the country.

Hospitalisation

As is the case with others [24, 25], our analysis of hospitalisation data suggests that the risk of hospitalisation was lower in the case of two Omicron clades when compared to all other common clades which were detected in Estonia during the study period. There were no differences in hospitalisation rates between all the other clades. We can see slightly increased hospitalisation rates in relation to Omicron-infected young people (those under the age of twenty) when compared to other clades, with this being a trend which has been spotted by others [24]. Importantly, our data suggest that vaccination substantially reduced the probability of hospitalisation, especially in older populations which were at a higher risk of contracting severe levels of illness, irrespective of the involved virus clade. The vaccination effect on hospitalisation rates was strongly related to age, and tended to wane in younger age groups. We can speculate that the increasing protective effect of vaccination on hospitalisation rates in connection with age, particularly in those who were aged forty and above, can at least partially be attributed to a higher adaptive immune response to SARS-Cov-2 [26].

The significance and implementation of SARS-CoV-2 sequencing

As in many countries, Estonia initiated SARS-CoV-2 sequencing in order to characterise the mutation patterns of the virus, and to complement the epidemiological data of outbreak management, including contact tracing, against viral genetic information. In 2021 the detection of new variants and potentially more virulent strains which were circulating around the world became the main focus of sequencing. This information was constantly integrated into decision-making by the Estonian Health Board and the government. Simultaneously, it is pivotal to acknowledge that, most of the time, we were sequencing the minimum level—or even higher—of samples as recommended by ECDC, enabling us to describe the variants almost in real-time in the background of the constantly and rapidly evolving COVID-19 pandemic.

We emphasise that, based upon an in-depth analysis of SARS-CoV-2 sequencing outcomes, it is possible to devise a sequencing strategy for future crises. For instance, the SARS-CoV-2 sequencing project made it possible for us to work out, from scratch, an effective sequencing approach at the national level, however, the downstream analysis further highlighted the significance of the proper stratification of samples in order to sufficiently cover various population groups and societal segments. Countries around the globe, including Estonia, expended substantial resources in conducting SARS-CoV-2 sequencing. Therefore, it is crucial to assess the optimal means by which such information can be acquired in the future. Our study provides valuable information for formulating areas of strategy in the event of a future pandemic which resembles the COVID-19 pandemic. Data which have been generated during this project could be integrated into modelling efforts which are aimed at enhancing levels of preparedness for future pandemics or epidemics.

Strengths and limitations

The major strength of this study includes complementing our sequencing data with hospitalisation and vaccination data which has been retrieved from our national health databases, giving us a unique opportunity to explore the associations between viral genetics and disease outcomes. Our study has several limitations, including the uneven coverage of sample metadata, with less metadata being available for those samples which were collected at the beginning of the pandemic; and time-dependent confounding from multiple factors which can be related to different sets of public health mitigation measures which were being implemented during the consecutive virus waves, along with acquired immunity from SARS-CoV-2 infections, vaccine availability, and vaccination coverage, which together may have obscured the associations between clades and hospitalisation rates. Some of the analyses were affected by small sample sizes, such as when we pooled all the various vaccines in order to assess vaccine protection levels. Additionally, due to the small sample size, we could not reliably analyse the association of SARS-CoV-2 clades with the use of intensive care units or with mortality rates in Estonia.

Conclusions

This national SARS-CoV-2 sequencing study from the first two years of the COVID-19 pandemic (March 2020 to March 2022) revealed the following:

1) disease waves in Estonia were driven by the Alpha, Delta, and Omicron clades.

2) those variants which belonged to the Omicron clade caused substantially less severe levels of illness and hospitalisation when compared to preceding variants.

3) the COVID-19 vaccination effect on hospitalisation rates was independent of the involved SARS-CoV-2 clade.

During the COVID-19 pandemic, such data helped state agencies to understand how SARS-CoV-2 spread around Estonia and how measures to tackle the pandemic could be implemented. The study also made it possible to follow the evolution of the virus during the pandemic. When the Omicron wave came into Estonia, the virus spread faster, but the disease had become significantly milder by then so strict restrictions to control the spread of the virus were no longer necessary.

Supporting information

S1 Table. Distribution of SARS-CoV-2 Pango lineages by the epidemiological wave.

Epidemiological waves were defined by the Estonian Health Board. Values are counts and percentages relative to the column are shown in parentheses.

(DOCX)

pone.0303176.s001.docx (40.8KB, docx)
S1 Fig. Age-group prevalence and dynamics of SARS-Cov-2 infections.

(A) the study population age group distribution relative to the 2022 general population. Points denote the best estimate of the aggregated binomial model, N = 26,618. Thick and thin lines denote 67% and 95% credible intervals, respectively. Dotted horizontal lines denote age group prevalence in the general population. (B) trends in the study population age group distribution during the study period. Lines denote the best estimate of the simple binomial model fitted to the weekly age group prevalence. Dashed lines denote unobserved dates implied by the underlying model. Ribbons denote a 95% credible interval. Solid horizontal lines denote SARS-Cov-2 waves. Panel A model summary is presented in Table 5 in S2 Appendix. Panel B model summary is presented in Table 6 in S2 Appendix.

(TIF)

pone.0303176.s002.tif (594.9KB, tif)
S2 Fig. Trends of sex balance among age groups.

Lines denote the best estimate of the Bernoulli model, N = 24,343. The dashed lines denote unobserved dates implied by the underlying model. Ribbons denote a 95% credible interval. Solid horizontal lines denote SARS-Cov-2 waves. The model summary is presented in Table 7 in S2 Appendix.

(TIF)

pone.0303176.s003.tif (390.8KB, tif)
S3 Fig. The proportion of domestic and travel-related cases.

(A) the weekly proportions of domestic- and travel-related cases and cases where such info was missing. (B) the association of travel-related cases with VOC indicated by the Bernoulli model, adjusted for the proportion of cases with missing info and population vaccination coverage, N = 14,341. (C) VOC association with countries or territories of travel-related cases indicated by the aggregated binomial model, N = 13,603. Y-axis labels are country ISO codes, and “Other” includes countries or territories associated with less than 30 cases. In panels B and C, points denote the model’s best estimate, and thin and thick lines denote 66% and 95% credible intervals, respectively. Panel B model summary is presented in Table 8 in S2 Appendix. Panel C model summary is presented in Table 9 in S2 Appendix.

(TIF)

pone.0303176.s004.tif (1.6MB, tif)
S4 Fig. Association of average RT-PCR Cq values with SARS-CoV-2 clades.

Posterior summaries of the average Cq value of ORF1a, S gene, and N gene, obtained from a linear model adjusted for age, sex, and vaccination status, skew-normal distribution, N = 5,689. S gene dropouts were treated as missing data and only ORF1a and N gene Ct values were used for these samples. Point denotes the model’s best fit and error bars denote a 95% credible interval. The model summary is presented in Table 10 in S2 Appendix.

(TIF)

pone.0303176.s005.tif (175.5KB, tif)
S5 Fig. Diversity of SARS-CoV-2 lineages in domestic and travel-related cases in Estonia.

(A) the observed number of Pango lineages. (B) the difference in the observed number of Pango lineages, imported cases compared to domestic cases. (C) evenness index. (D) the effect size of the evenness index in imported cases compared to domestic cases. In panels A and C, points denote individual weekly observations. The line denotes the autoregressive model best fit and the shaded ribbon denotes a 95% credible interval. In panels B and D, the line denotes effect size derived from the model fit shown in panels A or C, respectively, and the shaded ribbon denotes a 95% credible interval. Panel A and B model summary is presented in Table 11 in S2 Appendix. Panel C and D model summary is presented in Table 12 in S2 Appendix.

(TIF)

pone.0303176.s006.tif (654.1KB, tif)
S6 Fig. SARS-CoV-2 infected males show a higher probability of hospitalization independent of vaccination status.

(A) the probability of hospitalization is associated with sex, age, and vaccination status. (B) the odds ratio of hospitalization of males relative to females is associated with age, but independent of vaccination status. Posterior distributions of the estimated marginal means were obtained from the same model as in Fig 5. The line denotes the model’s best estimate and the ribbon denotes a 95% credible interval. The dotted line denotes the log odds ratio = 0. The model summary is presented in Table 4 in S2 Appendix.

(TIF)

pone.0303176.s007.tif (522.4KB, tif)
S7 Fig. Lower hospitalization of Omicron-infected people among unvaccinated population.

The odds ratios of hospitalization among the unvaccinated population, pairwise comparisons of the SARS-CoV-2 clades, associated with age, averaged over both sexes. Posterior distributions of the estimated marginal means were obtained from the same model as in Fig 5. The line denotes the model’s best estimate and the ribbon denotes a 95% credible interval. The dotted line denotes the log odds ratio = 0. The model summary is presented in Table 4 in S2 Appendix.

(TIF)

pone.0303176.s008.tif (683.3KB, tif)
S8 Fig. Lower hospitalization of Omicron-infected people among the vaccinated population.

The odds ratios of hospitalization among the vaccinated population, pairwise comparisons of the SARS-CoV-2 clades, associated with age, averaged over both sexes. Posterior distributions of the estimated marginal means were obtained from the same model as in Fig 5. The line denotes the model’s best estimate and the ribbon denotes a 95% credible interval. The dotted line denotes the log odds ratio = 0. The model summary is presented in Table 4 in S2 Appendix.

(TIF)

pone.0303176.s009.tif (686.1KB, tif)
S1 Appendix. Supporting methods.

(DOCX)

pone.0303176.s010.docx (28.3KB, docx)
S2 Appendix. Supporting tables.

(XLSX)

pone.0303176.s011.xlsx (36.4KB, xlsx)

Data Availability

The raw sequence data of analyzed samples were submitted to the European Nucleotide Archive under Project accessions: Accession Description PRJEB51354 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51025 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51498 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50998 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51124 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51230 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51679 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52383 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51023 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51683 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51735 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52298 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51503 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51595 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52047 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51597 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51738 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51007 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51126 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52438 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51231 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51352 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52254 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51348 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47109 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47229 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47713 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50296 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50797 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47099 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47111 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48428 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48514 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49181 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50529 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50538 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47106 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47112 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47331 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47583 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47944 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48700 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50534 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50537 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47326 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47434 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47436 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47570 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48138 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48892 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50275 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50429 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50485 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB46782 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47105 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47120 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47598 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48077 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48558 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48886 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50528 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50535 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50546 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50677 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47110 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47180 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47183 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47194 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47585 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47873 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48696 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48711 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50312 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50486 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50540 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50559 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47121 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48640 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48772 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49981 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50007 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50859 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47123 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49812 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50186 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47125 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48515 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49070 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49805 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49810 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50341 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50524 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50525 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50531 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50675 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50997 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47101 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47102 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47992 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48559 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49076 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49978 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50886 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47116 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47118 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47190 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47871 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48440 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48701 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48771 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48873 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49167 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49807 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50311 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50327 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50436 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50678 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49988 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50316 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50526 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50549 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47100 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47103 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47104 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47113 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47114 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47115 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47119 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47145 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47227 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47674 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48429 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50189 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50192 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50680 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50860 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52934 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52588 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52841 Whole genome sequencing of SARS-CoV-2 in Estonia.

Funding Statement

This work was funded by the Ministry of Education and Research, Republic of Estonia [contract number 1.1-6.2/21/298] and by the University of Tartu. This work was funded partially by the European Union through HORIZON Coordination and Support Actions [grant agreement 101079349] "Boosting the One Health Research Excellence and Management Capacity of the Estonian University of Life Sciences". Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or European Health and Digital Executive Agency. Neither the European Union nor the granting authority can be held responsible for them. This work was funded partially by the European Union through EU4Health Programme (EU4H) [grant agreement 101102733] "Delivering a Unified Research Alliance of Biomedical and public health Laboratories against Epidemics". Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or European Health and Digital Executive Agency. Neither the European Union nor the granting authority can be held responsible for them. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Leung K, Shum MH, Leung GM, Lam TT, Wu JT. Early transmissibility assessment of the N501Y mutant strains of SARS-CoV-2 in the United Kingdom, October to November 2020. Eurosurveillance. 2021;26. doi: 10.2807/1560-7917.ES.2020.26.1.2002106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Updated rapid risk assessment from ECDC on the risk related to the spread of new SARS-CoV-2 variants of concern in the EU/EEA–first update. Eurosurveillance. 2021;26. doi: 10.2807/1560-7917.es.2021.26.3.2101211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Sanyaolu A, Okorie C, Marinkovic A, Haider N, Abbasi AF, Jaferi U, et al. The emerging SARS-CoV-2 variants of concern. Therapeutic Advances in Infectious Disease. 2021;8: 204993612110243. doi: 10.1177/20499361211024372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Araf Y, Akter F, Tang Y, Fatemi R, Parvez MdSA, Zheng C, et al. Omicron variant of SARS‐CoV‐2: Genomics, transmissibility, and responses to current COVID‐19 vaccines. Journal of Medical Virology. 2022;94: 1825–1832. doi: 10.1002/jmv.27588 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wolter N, Jassat W, Walaza S, Welch R, Moultrie H, Groome M, et al. Early assessment of the clinical severity of the SARS-CoV-2 omicron variant in South Africa: A data linkage study. The Lancet. 2022;399: 437–446. doi: 10.1016/S0140-6736(22)00017-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Liu Y, Rocklöv J. The effective reproductive number of the Omicron variant of SARS-CoV-2 is several times relative to Delta. Journal of Travel Medicine. 2022;29. doi: 10.1093/jtm/taac037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lewnard JA, Hong VX, Patel MM, Kahn R, Lipsitch M, Tartof SY. Clinical outcomes associated with SARS-CoV-2 Omicron (B.1.1.529) variant and BA.1/BA.1.1 or BA.2 subvariant infection in Southern California. Nature Medicine. 2022;28: 1933–1943. doi: 10.1038/s41591-022-01887-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Eden J-S, Rockett R, Carter I, Rahman H, Ligt J de, Hadfield J, et al. An emergent clade of SARS-CoV-2 linked to returned travellers from Iran. Virus evolution. 2020;6: veaa027. doi: 10.1093/ve/veaa027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Sjoberg DD, Whiting K, Curry M, Lavery JA, Larmarange J. Reproducible summary tables with the gtsummary package. The R Journal. 2021;13: 570–580. doi: 10.32614/RJ-2021-053 [DOI] [Google Scholar]
  • 10.Stan Development Team. RStan: The R interface to Stan. 2022. Available: https://mc-stan.org/ [Google Scholar]
  • 11.Bürkner P-C. Advanced Bayesian multilevel modeling with the R package brms. The R Journal. 2018;10: 395–411. doi: 10.32614/RJ-2018-017 [DOI] [Google Scholar]
  • 12.Kay M. tidybayes: Tidy data and geoms for Bayesian models. 2022. doi: 10.5281/zenodo.1308151 [DOI] [Google Scholar]
  • 13.Lenth RV. Emmeans: Estimated marginal means, aka least-squares means. 2022. Available: https://github.com/rvlenth/emmeans [Google Scholar]
  • 14.Oksanen J, Simpson GL, Blanchet FG, Kindt R, Legendre P, Minchin PR, et al. Vegan: Community ecology package. 2022. Available: https://CRAN.R-project.org/package=vegan [Google Scholar]
  • 15.Wickham H, Averick M, Bryan J, Chang W, McGowan LD, François R, et al. Welcome to the tidyverse. Journal of Open Source Software. 2019;4: 1686. doi: 10.21105/joss.01686 [DOI] [Google Scholar]
  • 16.Wickham H. ggplot2: Elegant graphics for data analysis. Springer-Verlag New York; 2016. Available: https://ggplot2.tidyverse.org [Google Scholar]
  • 17.Monod M, Blenkinsop A, Xi X, Hebert D, Bershan S, Tietze S, et al. Age groups that sustain resurging COVID-19 epidemics in the United States. Science. 2021;371. doi: 10.1126/science.abe8372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.De Salazar PM, Gómez-Barroso D, Pampaka D, Gil JM, Peñalver B, Fernández-Escobar C, et al. Lockdown measures and relative changes in the age-specific incidence of SARS-CoV-2 in Spain. Epidemiology and Infection. 2020;148. doi: 10.1017/s0950268820002551 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Malmgren J, Guo B, Kaplan HG. Continued proportional age shift of confirmed positive COVID-19 incidence over time to children and young adults: Washington state March—August 2020. Borrow R, editor. PLOS ONE. 2021;16: e0243042. doi: 10.1371/journal.pone.0243042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Davies NG, Abbott S, Barnard RC, Jarvis CI, Kucharski AJ, Munday JD, et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science. 2021;372. doi: 10.1126/science.abg3055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.O’Toole Á, Scher E, Underwood A, Jackson B, Hill V, McCrone JT, et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evolution. 2021;7. doi: 10.1093/ve/veab064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zárate S, Taboada B, Muñoz-Medina JE, Iša P, Sanchez-Flores A, Boukadida C, et al. The Alpha variant (B.1.1.7) of SARS-CoV-2 failed to become dominant in Mexico. Medina RA, editor. Microbiology Spectrum. 2022;10. doi: 10.1128/spectrum.02240-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Earnest R, Uddin R, Matluk N, Renzette N, Turbett SE, Siddle KJ, et al. Comparative transmissibility of SARS-CoV-2 variants Delta and Alpha in New England, USA. Cell Reports Medicine. 2022;3: 100583. doi: 10.1016/j.xcrm.2022.100583 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Nyberg T, Ferguson NM, Nash SG, Webster HH, Flaxman S, Andrews N, et al. Comparative analysis of the risks of hospitalisation and death associated with SARS-CoV-2 Omicron (B.1.1.529) and Delta (B.1.617.2) variants in England: A cohort study. The Lancet. 2022;399: 1303–1312. doi: 10.1016/S0140-6736(22)00462-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Bager P, Wohlfahrt J, Bhatt S, Stegger M, Legarth R, Møller CH, et al. Risk of hospitalisation associated with infection with SARS-CoV-2 Omicron variant versus Delta variant in Denmark: An observational cohort study. The Lancet Infectious Diseases. 2022;22: 967–976. doi: 10.1016/S1473-3099(22)00154-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Pierce Carl A., Paula Preston-Hurlburt Yile Dai, Clare Burn Aschner Natalia Cheshenko, Galen Benjamin et al. Immune responses to SARS-CoV-2 infection in hospitalized pediatric and adult patients. Science Translational Medicine. 2020;12: eabd5487. doi: 10.1126/scitranslmed.abd5487 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Nihad AM Al-Rashedi

12 Feb 2024

PONE-D-23-26349SARS-CoV-2 clade dynamics and their associations with hospitalization during the first two years of the COVID-19 pandemicPLOS ONE

Dear Dr. Päll,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Mar 28 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Nihad A.M Al-Rashedi

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We suggest you thoroughly copyedit your manuscript for language usage, spelling, and grammar. If you do not know anyone who can help you do this, you may wish to consider employing a professional scientific editing service.  

Whilst you may use any professional scientific editing service of your choice, PLOS has partnered with both American Journal Experts (AJE) and Editage to provide discounted services to PLOS authors. Both organizations have experience helping authors meet PLOS guidelines and can provide language editing, translation, manuscript formatting, and figure formatting to ensure your manuscript meets our submission guidelines. To take advantage of our partnership with AJE, visit the AJE website (http://learn.aje.com/plos/) for a 15% discount off AJE services. To take advantage of our partnership with Editage, visit the Editage website (www.editage.com) and enter referral code PLOSEDIT for a 15% discount off Editage services.  If the PLOS editorial team finds any language issues in text that either AJE or Editage has edited, the service provider will re-edit the text for free.

Upon resubmission, please provide the following: 

● The name of the colleague or the details of the professional service that edited your manuscript

● A copy of your manuscript showing your changes by either highlighting them or using track changes (uploaded as a *supporting information* file)

● A clean copy of the edited manuscript (uploaded as the new *manuscript* file)

3. Please note that funding information should not appear in any section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript.

4. Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please delete it from any other section. 

5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. 

Additional Editor Comments:

​Your manuscript, "SARS-CoV-2 clade dynamics and their associations with hospitalization during the first two years of the COVID-19 pandemic" (PONE-D-23-26349), has been assessed by our reviewers. Although it is of interest, we are unable to consider it for publication in its current form. The reviewers have raised a number of points that we believe would improve the manuscript and may allow a revised version to be published in PLOS ONE.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Päll et al. present a comprehensive study on the dynamics of SARS-CoV-2 variants during the first two years of the COVID-19 pandemic in Estonia, examining their correlation with hospitalization and exploring the potential effects of vaccination across different age and sex groups. The article is well-organized, maintaining a good flow and clarity despite the abundance of data. The methods, results, and conclusions are consistently presented. Several findings align with global publications, providing clear data and evidence that contribute to broader conclusions applicable to various geographic regions during the pandemic. However, I have a few comments:

In the summary, the statement "The odds of hospitalization in the vaccinated population decreased with age" might be prone to misinterpretation. The wording could be clearer and less confusing. It's essential to contextualize this statement. The conclusion in the summary appears better worded, emphasizing that the impact of vaccination might be more pronounced in older individuals, as evidenced by a decrease in hospitalization rates. In the results section, the wording slightly changes when compared to the unvaccinated population. However, there is room for improvement to prevent potential confusion.

Reviewer #2: The study by Päll et al. aims to identify the circulating SARS-CoV-2 variants and subsequently relate their diversity and hospitalization outcomes from the first two years of the COVID-19 pandemic in Estonia. The authors concluded that the Omicron variant is linked to a lower probability of hospitalization, and vaccination is protective against severe disease progression in people aged 40 years and older. Overall, the manuscript is well-written. However, there are several minor concerns that need to be addressed by the authors.

1. Please include justification for why this study is needed in the Introduction section.

2. Table 1 is confusing. Do the numbers in parentheses represent percentages or standard deviations? Please clarify.

3. Please consider separating the Discussion section into several paragraphs for better readability.

4. One of the interesting findings of this study is that the protective effect of vaccination against severe disease, associated with hospitalization, was observed from age 40 years onwards and this effect increased with age. Please elaborate more on this. Is there a possible explanation for this phenomenon?

5. Please include a paragraph on how the data in this study could be beneficial to the scientific community or government policymaking.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 May 10;19(5):e0303176. doi: 10.1371/journal.pone.0303176.r002

Author response to Decision Letter 0


8 Apr 2024

Dear Nihad A.M Al-Rashedi,

thank you and the reviewers for the comments and suggestions. Here we try to address the issues raised during the review.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

We updated title page formatting, referencing to figures and tables, and reference formatting according to PLOS One style.

2. We suggest you thoroughly copyedit your manuscript for language usage, spelling, and grammar.

The manuscript was copyedited by a professional service:

Ref: Maarja Yörük

Tõlkebüroo Esteet OÜ

+372 56995633

www.esteet.ee

3. Please note that funding information should not appear in any section or other areas of your manuscript. We will only publish funding information present in the

Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript.

All funding-related text was removed from the Manuscript.

4. Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please delete it from any other section.

We included the Ethics statement section under the Methods section and removed the ethics statement from other sections.

5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting- information.

We included S1 and S2 Appendix captions at the end of the Manuscript. Additionally, we restructured Supporting Information – moved the S1 Table to a separate file, moved the S2 Table to the S2 Appendix and updated in-text references accordingly. We moved the S1 Table, and S1-S8 Fig captions to the end of the Manuscript.

Reviewer #1: Päll et al. present a comprehensive study on the dynamics of SARS-CoV-2 variants during the first two years of the COVID-19 pandemic in Estonia, examining their correlation with hospitalization and exploring the potential effects of vaccination across different age and sex groups. The article is well-organized, maintaining a good flow and clarity despite the abundance of data. The methods, results, and conclusions are consistently presented. Several findings align with global publications, providing clear data and evidence that contribute to broader conclusions applicable to various geographic regions during the pandemic.

However, I have a few comments:

1) In the summary, the statement "The odds of hospitalization in the vaccinated population decreased with age" might be prone to misinterpretation. The wording could be clearer and less confusing. It's essential to contextualize this statement. The conclusion in the summary appears better worded, emphasizing that the impact of vaccination might be more pronounced in older individuals, as evidenced by a decrease in hospitalization rates. In the results section, the wording slightly changes when compared to the unvaccinated population. However, there is room for improvement to prevent potential confusion.

Our main goal in this sequencing study, regarding vaccination, was to analyse if vaccination benefit is related to virus clade, for which we could not find support. Whereas hospitalisation was strongly related to age and Omicron caused fewer hospitalisations. Vaccination benefits regarding hospitalisation were not the same in all age groups, instead in the over-forties population, where hospitalisation became considerable, vaccination benefit was reduced with younger age, correlating with findings by others that adaptive immune response to SARS-CoV-2 increases with age. To make these results clearer, we updated the text, respectively, in the Abstract:

“Hospitalisation increased markedly with age in the over-forties, and was negligible in the under-forties. Vaccination decreased the odds of hospitalisation in over-forties. The effect of vaccination on hospitalisation rates was strongly dependent upon age but was clade-independent.”

in the Results section:

“We found that the probability of hospitalisation was strongly related to age and COVID-19 vaccination status (Fig 5A). Overall, the probability of hospitalisation was very low for people who were under forty years of age but started to increase after that, whereas vaccinated people had a lower probability of being hospitalised when compared to unvaccinated people. The beneficial effect of any vaccination increased considerably with age, as the odds of hospitalisation decreased with age in the vaccinated population when compared to the unvaccinated (OR = 0.71, 95% CI [0.58, 0.84] at forty years of age; OR = 0.37, 95% CI [0.3, 0.45] at sixty, and OR = 0.19, 95% CI [0.13, 0.27] at eighty. This association between age and vaccination effect was similar across all common clades (Fig 5B).”

and the Discussion:

“Importantly, our data suggest that vaccination substantially reduced the probability of hospitalisation, especially in older populations which were at a higher risk of contracting severe levels of illness, irrespective of the involved virus clade. The vaccination effect on hospitalisation rates was strongly related to age, and tended to wane in younger age groups. We can speculate that the increasing protective effect of vaccination on hospitalisation rates in connection with age, particularly in those who were aged forty and above, can at least partially be attributed to a higher adaptive immune response to SARS-Cov-2 [26].”

Reviewer #2: The study by Päll et al. aims to identify the circulating SARS-CoV-2 variants and subsequently relate their diversity and hospitalization outcomes from the first two years of the COVID-19 pandemic in Estonia. The authors concluded that the Omicron variant is linked to a lower probability of hospitalization, and vaccination is protective against severe disease progression in people aged 40 years and older. Overall, the manuscript is well-written. However, there are several minor concerns that need to be addressed by the authors.

1. Please include justification for why this study is needed in the Introduction section.

We rewrote the end of the Introduction to better address the justification of this study: “We undertook this study in order to be able to understand how distinct SARS-CoV-2 clades influenced the severity of COVID-19 and hospitalisation, and which other factors - including immunisation against SARS-CoV-2 - served to contribute to this process.”

2. Table 1 is confusing. Do the numbers in parentheses represent percentages or standard deviations? Please clarify.

The dimension of the value in parentheses is indicated in the first column of Table 1, it’s either an SD, % or IQR, depending on the type of the data. We updated the Table 1 legend to improve table readability.

3. Please consider separating the Discussion section into several paragraphs for better readability.

Thank you for the suggestion, we split the Discussion into separate paragraphs.

4. One of the interesting findings of this study is that the protective effect of vaccination against severe disease, associated with hospitalization, was observed from age 40 years onwards and this effect increased with age. Please elaborate more on this. Is there a possible explanation for this phenomenon?

We don’t have a clear explanation for this, nor does our data allow us to address this question directly. However, there seems to be a higher adaptive immune response to SARS-Cov-2 in older individuals (doi:10.1126/scitranslmed.abd5487), which seems to be a good working hypothesis. We amended the Discussion accordingly:

“We can speculate, that the increasing protective effect of vaccination on hospitalization with age, particularly in those over 40 years old, can be at least partly attributed to higher adaptive immune response to SARS-Cov-2 [26].”

5. Please include a paragraph on how the data in this study could be beneficial to the scientific community or government policymaking.

We amended the Discussion with the ‘Significance and implementation of SARS-CoV-2 sequencing section’, to better address aspects of the study relevant to government policymaking:

“As in many countries, in Estonia, we initiated SARS-CoV-2 sequencing to characterize the mutation patterns of the virus, and to complement the epidemiological data of outbreak management, including contact tracing, with viral genetic information. In 2021, the detection of new variants and potentially more virulent strains circulating in the world became the main focus of the sequencing. This information was constantly integrated into the decision-making by the Estonian Health Board and the Government. Simultaneously, it is pivotal to acknowledge that most of the time we sequenced the minimum or even higher proportion of samples recommended by ECDC, enabling us to describe the variants almost in real-time in the background of the constantly and rapidly evolving COVID-19 pandemic. In addition, as shown here, we assessed the viral diversity and the relationships of SARS-CoV-2 clades with hospitalization and vaccination.

We emphasize that based on an in-depth analysis of SARS-CoV-2 sequencing outcomes, it is possible to devise a sequencing strategy for future crises. For instance, the SARS-CoV-2 sequencing project enabled us to work out from scratch an effective sequencing approach at the national level, and downstream analysis brought out the significance of proper stratification of samples to sufficiently cover various population groups and societal segments, that should be prioritized for sequencing. Countries around the globe, including Estonia, expended substantial resources in conducting SARS-CoV-2 sequencing; hence, it is crucial to assess the optimal means by which such information can be acquired in the future. Our study provides valuable information for formulating strategies in the event of a future pandemic resembling COVID-19. The data generated during this project could be integrated into modelling efforts aimed at enhancing preparedness for future pandemics or epidemics.

Yours sincerely,

Taavi Päll

Research fellow of medical virology

Department of Microbiology,

Institute of Biomedicine and Translational Medicine,

University of Tartu,

Ravila 19

50411 Tartu,

Estonia

Email: taavi.pall@ut.ee

Mob.: +372 56452732

Attachment

Submitted filename: response-to-reviewers-2024-04-08.docx

pone.0303176.s012.docx (23.1KB, docx)

Decision Letter 1

Nihad AM Al-Rashedi

22 Apr 2024

SARS-CoV-2 clade dynamics and their associations with hospitalization during the first two years of the COVID-19 pandemic

PONE-D-23-26349R1

Dear Dr. Päll,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Nihad A.M Al-Rashedi

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I consider the authors have made revisions that enhance the quality of the writing, making it suitable for publication

Reviewer #2: The authors have revised the manuscript accordingly. Additionally, the manuscript is now more readable after grammar editing. I suggest accepting it for publication in PLOS ONE.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

Acceptance letter

Nihad AM Al-Rashedi

29 Apr 2024

PONE-D-23-26349R1

PLOS ONE

Dear Dr. Päll,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Nihad A.M Al-Rashedi

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Distribution of SARS-CoV-2 Pango lineages by the epidemiological wave.

    Epidemiological waves were defined by the Estonian Health Board. Values are counts and percentages relative to the column are shown in parentheses.

    (DOCX)

    pone.0303176.s001.docx (40.8KB, docx)
    S1 Fig. Age-group prevalence and dynamics of SARS-Cov-2 infections.

    (A) the study population age group distribution relative to the 2022 general population. Points denote the best estimate of the aggregated binomial model, N = 26,618. Thick and thin lines denote 67% and 95% credible intervals, respectively. Dotted horizontal lines denote age group prevalence in the general population. (B) trends in the study population age group distribution during the study period. Lines denote the best estimate of the simple binomial model fitted to the weekly age group prevalence. Dashed lines denote unobserved dates implied by the underlying model. Ribbons denote a 95% credible interval. Solid horizontal lines denote SARS-Cov-2 waves. Panel A model summary is presented in Table 5 in S2 Appendix. Panel B model summary is presented in Table 6 in S2 Appendix.

    (TIF)

    pone.0303176.s002.tif (594.9KB, tif)
    S2 Fig. Trends of sex balance among age groups.

    Lines denote the best estimate of the Bernoulli model, N = 24,343. The dashed lines denote unobserved dates implied by the underlying model. Ribbons denote a 95% credible interval. Solid horizontal lines denote SARS-Cov-2 waves. The model summary is presented in Table 7 in S2 Appendix.

    (TIF)

    pone.0303176.s003.tif (390.8KB, tif)
    S3 Fig. The proportion of domestic and travel-related cases.

    (A) the weekly proportions of domestic- and travel-related cases and cases where such info was missing. (B) the association of travel-related cases with VOC indicated by the Bernoulli model, adjusted for the proportion of cases with missing info and population vaccination coverage, N = 14,341. (C) VOC association with countries or territories of travel-related cases indicated by the aggregated binomial model, N = 13,603. Y-axis labels are country ISO codes, and “Other” includes countries or territories associated with less than 30 cases. In panels B and C, points denote the model’s best estimate, and thin and thick lines denote 66% and 95% credible intervals, respectively. Panel B model summary is presented in Table 8 in S2 Appendix. Panel C model summary is presented in Table 9 in S2 Appendix.

    (TIF)

    pone.0303176.s004.tif (1.6MB, tif)
    S4 Fig. Association of average RT-PCR Cq values with SARS-CoV-2 clades.

    Posterior summaries of the average Cq value of ORF1a, S gene, and N gene, obtained from a linear model adjusted for age, sex, and vaccination status, skew-normal distribution, N = 5,689. S gene dropouts were treated as missing data and only ORF1a and N gene Ct values were used for these samples. Point denotes the model’s best fit and error bars denote a 95% credible interval. The model summary is presented in Table 10 in S2 Appendix.

    (TIF)

    pone.0303176.s005.tif (175.5KB, tif)
    S5 Fig. Diversity of SARS-CoV-2 lineages in domestic and travel-related cases in Estonia.

    (A) the observed number of Pango lineages. (B) the difference in the observed number of Pango lineages, imported cases compared to domestic cases. (C) evenness index. (D) the effect size of the evenness index in imported cases compared to domestic cases. In panels A and C, points denote individual weekly observations. The line denotes the autoregressive model best fit and the shaded ribbon denotes a 95% credible interval. In panels B and D, the line denotes effect size derived from the model fit shown in panels A or C, respectively, and the shaded ribbon denotes a 95% credible interval. Panel A and B model summary is presented in Table 11 in S2 Appendix. Panel C and D model summary is presented in Table 12 in S2 Appendix.

    (TIF)

    pone.0303176.s006.tif (654.1KB, tif)
    S6 Fig. SARS-CoV-2 infected males show a higher probability of hospitalization independent of vaccination status.

    (A) the probability of hospitalization is associated with sex, age, and vaccination status. (B) the odds ratio of hospitalization of males relative to females is associated with age, but independent of vaccination status. Posterior distributions of the estimated marginal means were obtained from the same model as in Fig 5. The line denotes the model’s best estimate and the ribbon denotes a 95% credible interval. The dotted line denotes the log odds ratio = 0. The model summary is presented in Table 4 in S2 Appendix.

    (TIF)

    pone.0303176.s007.tif (522.4KB, tif)
    S7 Fig. Lower hospitalization of Omicron-infected people among unvaccinated population.

    The odds ratios of hospitalization among the unvaccinated population, pairwise comparisons of the SARS-CoV-2 clades, associated with age, averaged over both sexes. Posterior distributions of the estimated marginal means were obtained from the same model as in Fig 5. The line denotes the model’s best estimate and the ribbon denotes a 95% credible interval. The dotted line denotes the log odds ratio = 0. The model summary is presented in Table 4 in S2 Appendix.

    (TIF)

    pone.0303176.s008.tif (683.3KB, tif)
    S8 Fig. Lower hospitalization of Omicron-infected people among the vaccinated population.

    The odds ratios of hospitalization among the vaccinated population, pairwise comparisons of the SARS-CoV-2 clades, associated with age, averaged over both sexes. Posterior distributions of the estimated marginal means were obtained from the same model as in Fig 5. The line denotes the model’s best estimate and the ribbon denotes a 95% credible interval. The dotted line denotes the log odds ratio = 0. The model summary is presented in Table 4 in S2 Appendix.

    (TIF)

    pone.0303176.s009.tif (686.1KB, tif)
    S1 Appendix. Supporting methods.

    (DOCX)

    pone.0303176.s010.docx (28.3KB, docx)
    S2 Appendix. Supporting tables.

    (XLSX)

    pone.0303176.s011.xlsx (36.4KB, xlsx)
    Attachment

    Submitted filename: response-to-reviewers-2024-04-08.docx

    pone.0303176.s012.docx (23.1KB, docx)

    Data Availability Statement

    The raw sequence data of analyzed samples were submitted to the European Nucleotide Archive under Project accessions: Accession Description PRJEB51354 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51025 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51498 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50998 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51124 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51230 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51679 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52383 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51023 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51683 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51735 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52298 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51503 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51595 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52047 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51597 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51738 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51007 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51126 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52438 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51231 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51352 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52254 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB51348 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47109 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47229 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47713 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50296 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50797 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47099 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47111 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48428 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48514 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49181 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50529 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50538 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47106 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47112 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47331 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47583 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47944 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48700 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50534 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50537 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47326 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47434 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47436 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47570 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48138 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48892 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50275 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50429 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50485 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB46782 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47105 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47120 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47598 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48077 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48558 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48886 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50528 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50535 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50546 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50677 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47110 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47180 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47183 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47194 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47585 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47873 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48696 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48711 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50312 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50486 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50540 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50559 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47121 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48640 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48772 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49981 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50007 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50859 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47123 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49812 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50186 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47125 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48515 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49070 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49805 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49810 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50341 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50524 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50525 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50531 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50675 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50997 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47101 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47102 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47992 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48559 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49076 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49978 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50886 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47116 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47118 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47190 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47871 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48440 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48701 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48771 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48873 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49167 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49807 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50311 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50327 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50436 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50678 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB49988 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50316 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50526 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50549 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47100 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47103 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47104 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47113 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47114 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47115 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47119 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47145 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47227 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB47674 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB48429 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50189 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50192 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50680 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB50860 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52934 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52588 Whole genome sequencing of SARS-CoV-2 in Estonia PRJEB52841 Whole genome sequencing of SARS-CoV-2 in Estonia.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES