Skip to main content
Cambridge University Press - PMC COVID-19 Collection logoLink to Cambridge University Press - PMC COVID-19 Collection
. 2021 Jul 30;149:e177. doi: 10.1017/S0950268821001461

Phylogenetic analysis of SARS-CoV-2 lineage development across the first and second waves in Eastern Germany in 2020: insights into the cause of the second wave

Buqing Yi 1,, Anna R Poetsch 2,3, Marlena Stadtmüller 1, Fabian Rost 4,5, Sylke Winkler 4,6, Alexander H Dalpke 1
PMCID: PMC8365105  PMID: 34325753

Abstract

In Germany, Eastern regions had a mild first wave of coronavirus disease 2019 (COVID-19) from March to May 2020, but were badly hit by a second wave later in autumn and winter. It is unknown how the second wave was initiated and developed in Eastern Germany where the number of COVID-19 cases was close to zero in June and July 2020. We used genomic epidemiology to investigate the dynamic of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) lineage development across the first and second waves in Eastern Germany. With detailed phylogenetic analyses we could show that SARS-CoV-2 lineages prevalent in the first and second waves in Eastern Germany were different, with several new variants including four predominant lineages in the second wave, having been introduced into Eastern Germany between August and October 2020. The results indicate that the major driving force behind the second wave was the introduction of new variants.

Key words: Phylogenetic analysis, SARS-CoV-2, second wave


In Germany, the first wave of the coronavirus disease 2019 (COVID-19) pandemic (March to May 2020) showed visible regional differences: it was much milder in Eastern regions (Saxony, Saxony-Anhalt, Berlin, Brandenburg and Thuringia) compared to most other regions in Germany. However, the severity of the second wave (August to December 2020) was similar in most regions in Germany. It is unclear how the second wave started in Eastern Germany where in June and July 2020 the number of COVID-19 cases was close to zero (Fig. 1A). We, therefore, performed phylogenetic analysis of the predominant variants of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in the first and second waves in Eastern Germany. By dissecting the difference between the first wave and the second wave, we expect the information achieved through this study could provide insights into the cause of the second wave and can possibly help developing suitable strategies for preventing similar scenarios in future.

Fig. 1.

Fig. 1.

Analysis of SARS-CoV-2 lineages predominant in the first and second waves in Eastern Germany, March to December 2020. (A) 7-day incidence rate per 100 000 inhabitants in Eastern Germany. First wave: March to May; second wave: August to December. (B) Summary of detected total SARS-CoV-2 lineage numbers in each month. (C) Frequency of detection for each SARS-CoV-2 lineage in each month in Eastern Germany (range: 0–0.74, representing 0–74%; 0 is shown with deep blue, indicating no detection of the relevant variant in that month). Note: To achieve a better resolution, a few lineages that were detected only once across 2020 and with a frequency of <0.01 were omitted from the heatmap. (D) Phylogenetic and time tree of SARS-CoV-2 genomes from Eastern Germany, March to December 2020. Each genome is denoted with Pangolin-lineage (PANGO Lineage). The names of lineages that were predominant in the first or second wave are colour labelled. The four lineages from the second wave B.1.177, B.1.258, B. 1.221 and B.1.160 had been circulating in multiple other European countries since June [1].

For surveillance purpose, randomly selected SARS-CoV-2 positive samples from each state in Germany were sequenced by the Robert Koch Institute or by sequencing facilities of local universities. All sequences that passed stringent quality control were uploaded to GISAID [2]. We used GISAID sequences from regions in Eastern Germany dating between March and December 2020 in this study (data collected on 28 February 2021; Table S1 in the Supplementary material available on the Cambridge Core website). The number of genomes in each month was: 74 (March), 102 (April), 19 (May), 48 (June), 18 (July), 41 (August), 47 (September), 105 (October) and 112 (December) (only a few genomes were sequenced in November because the testing labs were extremely overloaded by then, so the data of November were not included in the analysis). The data of 7-day-incidence rate per 100 000 inhabitants were obtained for the states in Eastern Germany from the Robert Koch Institute (https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Daten/Fallzahlen_Daten.htm), and the average values were visualised in Figure 1A. Lineage group assignment of SARS-CoV-2 genomes was performed using the software package Phylogenetic Assignment of Named Global Outbreak LINeages (Pangolin) [3]. Phylogenetic maximum likelihood and time trees were constructed using the SARS-CoV-2-specific procedures taken from github.com/nextstrain/ncov [4, 5].

The first wave in Eastern Germany reached its peak in April 2020 (Fig. 1A). Based on the frequency of detection in April (Fig. 1C and D), the SARS-CoV-2 lineages predominant in the first wave were: B.1, B.1.1.29, A and B, with respective frequencies of 46%, 21%, 9% and 7% (shown as 0.46, 0.21, 0.09 and 0.07 in Fig. 1C). The second wave reached its peak in December 2020 (Fig. 1A). Based on the frequency of detection in December (Fig. 1C and D), the most prevalent lineages in the second wave were different from that of the first wave: B.1.258, B.1.177, B.1.160 and B.1.221, with respective frequencies of 32%, 25%, 9% and 7%. All lineages in the first and second waves were defined in one batch with the pangoLEARN_version 2021-02-21. These four lineages B.1.258, B.1.177, B.1.160 and B.1.221 from the second wave were neither detected in the first wave in Eastern Germany (Fig. 1C and D), nor possibly derived from the local first wave lineages through mutant accumulation since the 7-day incidence rate in June and July in Eastern Germany was close to zero, which means there was almost no virus circulating in the local population. B.1.258, B.1.177, B.1.160 and B.1.221 were first identified in other European countries before April 2020 (https://cov-lineages.org/pango_lineages.html), and have a known spreading history in multiple other European countries in June and July, such as in Spain [1]. In Eastern Germany, B.1.258 was first detected in October; B.1.177 was first detected in August; B.1.160 was first detected in August and B.1.221 was first detected in September (Fig. 1C and D).

From August until October 2020 was the summer/autumn holiday season in Eastern Germany, and a lot of regional and international travels took place during this period. Our analysis indicates that various new lineages were introduced into Eastern Germany from August to October 2020 (Fig. 1B and C). For example, in August, 20 new lineages were first detected in Eastern Germany, such as B.1.160, B.1.1.234, B.1.1.277, B.1.1.305, B.1.1.39, B.1.416 and B.1.177. In total, more than 40 new variants were introduced into Eastern Germany during the holiday season (Fig. 1C), including the four predominant lineages B.1.258, B.1.177, B.1.160 and B.1.221, which paved the base for the second wave.

Interestingly, only a few of these new variants were responsible for most local cases in December when the second wave reached its peak value: the four predominant new variants (B.1.258; B.1.177; B.1.160 and B.1.221) were estimated to account for more than 70% of the cases based on their frequency of detection. These findings suggest that several control measures, such as test on the airport, might have prevented the local transmission of several new variants. However, from August to October 2020, the lineages B.1.258, B.1.177, B.1.160 and B.1.221 were prevalent in multiple European countries [1], which means the chances of the introduction of these lineages were higher compared to other variants.

In conclusion, the introduction of various SARS-CoV-2 lineages from August to October 2020 was the major driving force for the development of the second wave in Eastern Germany, instead of expansion of local circulating lineages from the first wave.

Acknowledgements

We thank all researchers who are working around the clock to generate and share genome data on GISAID (http://www.gisaid.org) on which the analysis is based. We specifically thank colleagues at the Institute of Medical Microbiology and Virology, Technische Universität Dresden, for their work in performing SARS-CoV-2 sample testing and sequencing sample preparing, and we thank the Robert Koch Institute and Dresden Concept Genome Center for their sequencing efforts. This project was in part co-financed with tax funds on the basis of the budget passed by the Saxony state parliament (Saxonian COVID-19 Research Consortium SaxoCOV).

Ethical standards

This study did not directly involve patients and does not require approval by an ethics committee.

Code availability

The code used for phylogenetic analysis is available at github.com/nextstrain/ncov.

Supplementary material

For supplementary material accompanying this paper visit http://dx.doi.org/10.1017/S0950268821001461.

S0950268821001461sup001.doc (555.5KB, doc)

click here to view supplementary material

Conflict of interest

None.

Data availability statement

The data used in this study is publicly available [2].

References

  • 1.Hodcroft EBet al. (2021) Spread of a SARS-CoV-2 variant through Europe in the summer of 2020. Nature 595, 707–712. doi: 10.1038/s41586-021-03677-y. [DOI] [PubMed] [Google Scholar]
  • 2.Shu Y and McCauley J (2017) GISAID: global initiative on sharing all influenza data – from vision to reality. EuroSurveillance 22(13), 30494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Rambaut Aet al. (2020) A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nature Microbiology 5, 1403–1407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sagulenko P, Puller V and Neher RA (2018) TreeTime: maximum-likelihood phylodynamic analysis. Virus Evolution 4, vex042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hadfield Jet al. (2018) Nextstrain: real-time tracking of pathogen evolution. Bioinformatics (Oxford, England) 34, 4121–4123. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

For supplementary material accompanying this paper visit http://dx.doi.org/10.1017/S0950268821001461.

S0950268821001461sup001.doc (555.5KB, doc)

click here to view supplementary material

Data Availability Statement

The data used in this study is publicly available [2].


Articles from Epidemiology and Infection are provided here courtesy of Cambridge University Press

RESOURCES