Abstract
We show that the SARS-CoV-2 B.1.1.7 lineage is highly disseminated in Portugal, with the odds of B.1.1.7 proportion increasing at an estimated 89% (95% confidence interval: 83–95%) per week until week 3 2021. RT-PCR spike gene target late detection (SGTL) can constitute a useful surrogate to track B.1.1.7 spread, besides the spike gene target failure (SGTF) proxy. SGTL/SGTF samples were associated with statistically significant higher viral loads, but not with substantial shift in age distribution compared to non-SGTF/SGTL cases.
Keywords: SARS-CoV-2, B.1.1.7 lineage, RT-PCR, spike gene target failure (SGTF), spike gene target late detection (SGTL), laboratory surveillance, genome sequencing
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) lineage B.1.1.7, also designated variant of concern (VOC) 202012/01 or 501Y.V1, has shown a pronounced frequency increase in the United Kingdom (UK) since November 2020 and has rapidly expanded its geographical range worldwide [1-5]. The SARS-CoV-2 B.1.1.7 lineage harbours a 21765–21770 genomic deletion (spike Δ69–70) that affects the detection of the spike (S) gene by some real-time polymerase chain reaction (RT-PCR) assays (e.g. TaqPath COVID-19, ThermoFisher, Waltham, MA, United States, targeting N, ORF1ab and S genes), leading to what has been termed ‘spike gene target failure (SGTF)’ or ‘spike gene drop out’ [1,6]. This coincidental occurrence has provided a good proxy for monitoring trends of B.1.1.7 [1,7-10].
We investigated the proportion of SGTF cases to gain insight on B.1.1.7 frequency and geographical spread in Portugal.
A detailed description of the Material and Methods is presented in the Supplement, including RT-PCR procedures and rationale for sample classification, genome sequencing, bioinformatics and statistical analysis.
Spike gene target failure and spike gene target late detection RT-PCR data as proxies for monitoring B.1.1.7 circulation
We took advantage of a large SARS-CoV-2 TaqPath COVID-19 RT-PCR data set comprehensively collected at the community level between week 49 2020 and week 3 2021 by Unilabs (Oporto), a large private laboratory with 287 testing sites distributed throughout the country.
We assumed that SGTF can be a reliable indicator of B.1.1.7 circulation in our country, considering that: (i) Portugal was among the top destinations for air travellers from the UK during early autumn [11]; (ii) the laboratories at the Portuguese National Institute of Health Dr. Ricardo Jorge (INSA) detected multiple independent B.1.1.7 introductions since early December 2020, among the B.1.1.7-associated cases confirmed by sequencing [12], as at 5 February 2021 and (iii) B.1.1.7 was identified by sequencing in 91.9% (79/86) of known SGTF-positive samples up to week 3 2021.
Between week 49 2020 and week 3 2021, Unilabs performed 170,658 SARS-CoV-2 RT-PCR tests using the TaqPath COVID-19 assay, which roughly corresponds to 8% of all SARS-CoV-2 RT-PCR tests done in Portugal in the same period. Of the 36,651 positive results, 3,367 (9.2%) corresponded to SGTF tests, as defined by a positive test with non-detectable S gene and cycle threshold (Ct) values of ≤ 30 for N and ORF1ab targets. During the same period, we also unexpectedly detected that in 1.5% (561/36,651) of TaqPath COVID-19 positive samples Ct values for the S gene were > 5 units higher than the maximum Ct value obtained for the other two targets (N and ORF1ab) of the assay (again, counts only included positive samples with ≤ 30 Ct for N and ORF1ab targets). The mean Ct difference between S gene and N/ORF genes for these samples was consistently around 5–8 Ct values (median: 6.60; interquartile range (IQR): 5.79–7.59), so we tentatively designated this RT-PCR profile as ‘spike gene target late detection’ (SGTL).
The delayed detection (i.e. higher Ct values) might be due to stochastic and low frequent probe misannealing, rendering detectable PCR signals only when a high number of S target amplicons are present in the reaction. B.1.1.7 was identified by sequencing in 11 of 12 SGTL positive samples, with the 21765–21770 genomic deletion (spike Δ69–70) being finely inspected and unequivocally confirmed in all SGTL samples. These data, together with the parallel scenario observed for SGTF and STGL regarding frequency increase and viral loads (see below), supports that SGTL can constitute an additional proxy to detect and monitor B.1.1.7 lineage. This phenomenon of late S gene detection has been observed in England, and diagnostic laboratories have been advised of such possible alternative presentation of B.1.1.7 [13].
Continued increase in the proportion of spike gene target failure and spike gene target late detection samples
The proportion of both SGTF and SGTL cases has continually risen since the beginning of December (week 49 2020) (Figure 1), reaching a total of 22.0% and 2.8% of all TaqPath COVID-19 positive samples, respectively, in week 3 2021. Since week 53 2020, the aggregated proportion of SGTF plus SGTL (hereafter defined as SGTF/SGTL) cases increased eightfold, reaching 24.7% in week 3 2021. The odds of SGTF/SGTL proportion increased at a 90% (95% confidence interval (CI): 85–96) rate per week. We forecast—assuming no change in the increasing rate—that the proportion of SGTF/SGTL cases can reach up to 68% (95% CI: 65–71) of positive cases by week 6 (Figure 2).
The proportion of SGTF/SGTL cases that are B.1.1.7 was estimated as 0.918 (95% CI: 0.845–0.964) based on a sub-sample of sequencing cases (90 B.1.1.7 cases in 98 known SGTF/SGTL sequenced samples). Thus, assuming this constant proportion of 0.918, the corresponding estimates for B.1.1.7 lineage are: (i) a proportion among SARS-CoV-2 detections of 12.3% and 22.7% at weeks 2 and 3, (ii) a 89% (95% CI: 83–96) odds increase rate per week and (iii) a forecast of a proportion of 65% (95% CI: 62–68) among SARS-CoV-2 detections on week 6 (Figure 2). This high growth rate (almost doubling each week) parallels what has been reported in other countries, namely in the UK [3] and Denmark [14]. Of note, given that the frequency of SGTF/SGTL that are B.1.1.7 is expected to increase over time (as observed in the UK [13]), we do not rule out that there is a potential bias in our estimates because of the use of a constant value (0.918) for the B.1.1.7 proxy.
Nevertheless, supporting the accuracy of the application of SGTF/SGTL as a proxy for B.1.1.7 and of the estimated B.1.1.7 relative frequency in Portugal during the study period, recent sequencing of 495 randomly collected SARS-CoV-2 RT-PCR positive samples across the country during week 2 identified 14.5% of B.1.1.7 sequences [12].
As shown in Supplementary Figure S1, SGTF/SGTL cases are dispersed throughout mainland Portugal, indicating that the B.1.1.7 lineage is highly disseminated and there is active community transmission. A general lockdown was implemented in week 2 2021, raising expectations about the impact that the expected reduction in the number of COVID-19 cases would have on the increasing rate of the relative frequency of the B.1.1.7 lineage. After the increase in frequency observed during the last week of 2020 and the first 3 weeks of 2021, and upon the implementation of public health measures, we observed a decelerating trend, with the SGTF/SGTL proportion remaining below 50% in week 7 2021.
Spike gene target failure and spike gene target late detection samples are associated with higher viral loads
We also investigated RT-PCR Ct values in SGTF and SGTL vs non-SGTF/SGTL positive samples. Both SGTF and SGTL samples had significantly lower median Ct values of N and ORF1ab gene targets (ca 3.5 and 1.8 Ct units, respectively) compared with samples where S gene was unbiasedly detected (Figure 3). This observation not only corroborates previous findings that B.1.1.7 SGTF samples are more likely to present higher viral loads [2,7], but also supports our finding that SGTL might be an additional surrogate to identify Δ69–70-bearing variants.
Spike gene target failure/spike gene target late detection cases are not associated with a substantial shift in age composition
Our analysis of SGTF and SGTL in different age groups did not indicate a substantial shift in the age composition when comparing them with non-SGTF/SGTL cases. In fact, although statistically different (p < 0.001), the age distributions for cases in the SGTF/SGTL (median: 37; IQR: 22–50) and non-SGTF/SGTL (median: 39; IQR: 24–53) groups did not differ substantially (Figure 4). Also, for cases in both groups, the highest frequencies were observed for individuals aged 20 to 49 years.
Development of an interactive dashboard to aid public health decision-making
In order to facilitate real-time monitoring of SGTF/SGTL cases, an interactive dashboard and data cube, relying on Unilabs Intelli4Covid (BigData platform), was developed by Unilabs. The dashboard was shared with INSA to support timely public health decision-making by enabling early identification of geographical regions with an estimated increased incidence and circulation of lineage B.1.1.7. In fact, the release of data regarding the estimated proportion of B.1.1.7 cases up to week 2 2020 (and forecasted for the following 3 weeks) had an immediate impact, as it triggered the political decision to strengthen the ongoing confinement measures, namely through school closures in week 3 2021.
Conclusions
In the present study, we evaluated the SARS-CoV-2 B.1.1.7 lineage dissemination in Portugal between week 49 2020 and week 3 2021. In particular, we observed that SGTL can be a proxy to identify B.1.1.7 lineage, in addition to SGTF. By almost doubling each week, the estimated B.1.1.7 proportion reached ca 23% at week 3, and at this time it was forecasted to reach 65% by week 6. Physical distancing measures implemented in weeks 2 and 3 strongly decelerated the concerning growth rate, with the proportion of SGTF and SGTL remaining below 50% until week 7 2021. In our dataset, patients whose samples exhibited either SGTF or SGTL effect in the TaqPath COVID-19 test were more likely to have high viral loads at the time of sampling. Age distribution of SGTF/SGTL cases did not seem to indicate a substantial shift in the age composition, as compared to non-SGTF/SGTL cases.
Portugal faced a high intensity transmission of SARS-CoV-2, being among the countries with the highest 14-day notification rate of newly diagnosed COVID-19 cases per 100,000 inhabitants, as of week 4 2021. Our data suggests that the highly transmissible B.1.1.7 lineage is spreading widely and progressively increasing in frequency in Portugal. This reinforces the need to implement robust public health measures adapted to this new variant in order to mitigate the impact of COVID-19 in terms of hospitalisations and deaths.
Acknowledgements
This study is partially co-funded by Fundação para a Ciência e Tecnologia and Agência de Investigação Clínica e Inovação Biomédica (234_596874175) on behalf of the Research 4 COVID-19 call. Some infrastructural resources used in this study come from GenomePT project (POCI-01-0145-FEDER-022184), supported by COMPETE 2020 - Operational Programme for Competitiveness and Internationalisation (POCI), Lisboa Portugal Regional Operational Programme (Lisboa2020), Algarve Portugal Regional Operational Programme (CRESC Algarve2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF), and by Fundação para a Ciência e a Tecnologia (FCT).
Supplementary Data
Conflict of interest: CS, LM, AMG, MP, JPA, MV, RS, ARS, MC and LC are employees of the private laboratory Unilabs as technicians, bioinformatics or executive manager (LM) and do not have any influence on the decisions made by the health authorities or government.
Authors’ contributions: CS, LM, AMG, MP, JPA, MV, RS, ARS, MC and LC were involved in the routine diagnosis of SARS-CoV-2 infection at Unilabs laboratory, data collection and interpretation, dashboard design and implementation. PC, PP and AP supported the epidemiological investigation. VB, JI, SD and LV were involved in wet-lab sequencing procedures and/or bioinformatic analysis. RG supported the laboratory surveillance. SS and BN performed statistical analysis and data interpretation. VB, CS, JPA, BN and JPG conceived the study. VB, JI and JPG were the main contributors for manuscript writing. All authors performed data analysis and interpretation and revised the manuscript.
References
- 1.Public Health England (PHE). Investigation of novel SARS-COV-2 variant: Variant of Concern 202012/01: Technical briefing document on novel SARS-CoV-2 variant. London: PHE; 2020. Available from: https://www.gov.uk/government/publications/investigation-of-novel-sars-cov-2-variant-variant-of-concern-20201201
- 2.European Centre for Disease Prevention and Control (ECDC). Risk related to spread of new SARS-CoV-2 variants of concern in the EU/EEA – 29 December 2020. ECDC: Stockholm; 2020. Available from: https://www.ecdc.europa.eu/sites/default/files/documents/COVID-19-risk-related-to-spread-of-new-SARS-CoV-2-variants-EU-EEA.pdf
- 3.Volz E, Mishra S, Chand M, Barrett JC, Johnson R, Geidelberg L, et al. Transmission of SARS-CoV-2 Lineage B. 1.1. 7 in England: Insights from linking epidemiological and genetic data; medRxiv 2021. 10.1101/2020.12.30.20249034 [DOI]
- 4. Galloway SE, Paul P, MacCannell DR, Johansson MA, Brooks JT, MacNeil A, et al. Emergence of SARS-CoV-2 B.1.1.7 Lineage - United States, December 29, 2020-January 12, 2021. MMWR Morb Mortal Wkly Rep. 2021;70(3):95-9. 10.15585/mmwr.mm7003e2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rambaut A, Loman N, Pybus O, Barclay W, Barrett J, Carabelli A, et al. Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. Virological.org; 2020. Available from: https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563
- 6. Kemp SA, Datir RP, Collier DA, Ferreira IATM, Carabelli A, Harvey W, et al. Recurrent emergence and transmission of a SARS-CoV-2 Spike deletion ΔH69/V70. bioRxiv. 2020;. 10.1101/2020.12.14.422555 [DOI] [Google Scholar]
- 7.Kidd M, Richter A, Best A, Mirza J, Percival B, Mayhew M, et al. S-variant SARS-CoV-2 is associated with significantly higher viral loads in samples tested by ThermoFisher TaqPath RT-QPCR; medRxiv 2020. 10.1101/2020.11.10.20228528 [DOI] [PMC free article] [PubMed]
- 8.Gravagnuolo AM, Faqih L, Cronshaw C, Wynn J, Burglin, Klapper P, Wigglesworth M. Epidemiological Investigation of New SARS-CoV-2 Variant of Concern 202012/01 in England; medRxiv 2021. 10.1101/2021.01.14.21249386 [DOI] [PMC free article] [PubMed]
- 9.Public Health England (PHE). Investigation of novel SARS-COV-2 variant. Variant of Concern 202012/01: Technical briefing 2. London: PHE; 2020. Available from: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/949639/Technical_Briefing_VOC202012-2_Briefing_2_FINAL.pdf
- 10.Bal A, Destras G, Gaymard A, Regue H, Semanas Q, d’Aubarede C, et al. Two-step strategy for the identification of SARS-CoV-2 variants co-occurring with spike deletion H69-V70, Lyon, France, August to December 2020; medRxiv 2020. 10.1101/2020.11.10.20228528 [DOI] [PMC free article] [PubMed]
- 11.O’Toole Á, Hill V, Pybus OG, Watts A, Bogoch II, Khan K, et al. Tracking the international spread of SARS-CoV-2 lineages B.1.1.7 and B.1.351/501Y-V2. Virological.org; 2021. Available from: https://virological.org/t/tracking-the-international-spread-of-sars-cov-2-lineages-b-1-1-7-and-b-1-351-501y-v2/592/1 [DOI] [PMC free article] [PubMed]
- 12.National Institute of Health (INSA) Dr. Ricardo Jorge. Genetic diversity of the novel coronavirus SARS-CoV-2 (COVID-19) in Portugal. Lisbon: INSA; 2021. [Accessed 5 Feb 2021]. Available from: https://insaflu.insa.pt/covid19
- 13.Public Health England (PHE). Investigation of novel SARS-COV-2 variant. Variant of Concern 202012/01: Technical briefing 5. London: PHE; 2021. Available from: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/957504/Variant_of_Concern_VOC_202012_01_Technical_Briefing_5_England.pdf
- 14.Staten Serum Institut (SSI). Status for udvikling af SARS-CoV-2 Variants of Concern (VOC) i Danmark. [Status of development of SARS-CoV-2 Variants of Concern (VOC) in Denmark]. Copenhagen: SSI; 2020. Danish. Available from: https://files.ssi.dk/covid19/virusvarianter/status/status-virusvarianter-10022021-2q3w
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.