Abstract
Recombinant sequences of the SARS-CoV-2 Omicron variant were detected in surveillance samples collected in north-western Finland in January 2022. We detected 191 samples with an identical genome arrangement in weeks 3 to 11, indicating sustained community transmission. The recombinant lineage has a 5’-end of BA.1, a recombination breakpoint between orf1a and orf1b (nucleotide position 13,296–15,240) and a 3’-end of BA.2 including the S gene. We describe the available genomic and epidemiological data about this currently circulating recombinant XJ lineage.
Keywords: SARS-CoV-2, variant, recombinant, omicron, XJ-lineage
Ten suspected recombinant sequences were initially detected by the national genomic surveillance of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in Finland in surveillance samples collected in January 2022 (weeks 3 and 4) in the Länsi-Pohja hospital district in north-western Finland. The Länsi-Pohja hospital district serves six small municipalities with a total population of 59,000, located near the border to Sweden. As inhabitants of the border municipalities frequently move between the two countries, there was a risk of cross-border spread. Intensified surveillance revealed at least 191 domestic cases, and database search retrieved five international cases by mid-March 2022 (week 11), indicating sustained community transmission and leading to Pango designation (Phylogenetic Assignment of Named Global Outbreak) of the XJ recombinant lineage.
We provide here a description of the confirmation and genomic characterisation of the novel XJ recombinant, describe the early cases and transmission events and discuss the available epidemiological data.
Genomic investigation
Recombinant SARS-CoV-2 genome sequences were observed in independent Illumina sequencing runs of surveillance samples in 2022 from week 3 onwards (sequencing and sequence analyses were conducted as described in [1-3]). Sequence analysis suggested a recombination breakpoint between nucleotide positions 13,296 and 15,240. The recombination was confirmed by long-read sequencing (Oxford Nanopore Technology) of two PCR amplicons spanning the recombination breakpoint (Figure 1).
Figure 1.
Schematic representation of the SARS-CoV-2 BA.1/BA.2 recombinant genome arrangement
SARS-CoV-2: severe acute respiratory syndrome coronavirus 2.
The recombinant genomes contain all lineage-defining mutations of BA.1 until the synonymous T13195C mutation at the 3’end of the ORF1a coding region (blue), while they lack the BA.1-specific C15240T mutation in ORF1b coding region, and contain all BA.2 lineage-defining mutations starting from C15714T in ORF1b (red), suggesting a recombination breakpoint between nucleotides 13,195 and 15,240. The recombination was confirmed by long-read nanopore sequencing of two PCR amplicons spanning the recombination breakpoint (forward primer 12793-GGAGGTAGGTTTGTACTTGCACTG-12817 and either one of two reverse primers 15558-TGCATTAACATTGGCCGTGACA-15536 or 15960-GGCCCCTAGGATTCTTGATGGA-15938). The sequencing library was prepared using the Nanopore SQK-LSK110 kit and the sequencing was carried out using a MinION R9.4.1 flowcell.

Ninety-six of 106 XJ recombinants with nearly complete genome sequences (fewer than 1,000 ambiguous nucleotides) were included in the phylogenetic analysis. The analysis suggested that 33 of the 96 Finnish lineage XJ genomes, including the earliest sequences, were identical to each other, while the sequences sampled later tended to have more diversity (Figure 2).
Figure 2.
Phylogenetic trees of all SARS-CoV-2 lineage XJ recombinant genomes that had fewer than 1,000 ambiguous nucleotides, Finland, January–February 2022 (n = 96)
SARS-CoV-2: severe acute respiratory syndrome coronavirus 2.
The genomes were aligned using Nextalign [11] and the tree was inferred using maximum likelihood method implemented in MEGA6 software. Clade-specific mutations are shown in each branch. Sequences detected outside Finland are marked with filled circles.
In order to identify the potential parental sequences for the recombinant genomes, we compared the genome regions flanking the recombination breakpoint with all 2,338 BA.1 and BA.2 sequences available in the GISAID database collected until 31 Jan 2022 from Finland. We detected 23 BA.1.17 and two BA.2 sequences identical to, respectively, the 5’ and 3’ ends of the recombinant genome. The BA.1.17 sequences had two signature mutations, C3241T and G5924A (ORF1a:V1887I), while the BA.2 sequences had one, C27945T (ORF8:Q18*), introducing a premature stop codon and resulting in a truncated ORF8 protein. Notably, both BA.2 and 11 of the 23 BA.1.17 sequences observed in the national genomic surveillance originated from the relatively small Länsi-Pohja hospital district, representing two of the nine BA.2 and 11 of the 571 BA.1 samples collected in the area during weeks 1 to 4 in 2022 (Figure 3). Using BLAST, we detected identical sequences for the 5’ and the 3’ ends flanking the recombination breakpoint in globally sequenced samples.
Figure 3.
Potential parental genomes of SARS-CoV-2 lineage XJ recombinants, Länsi-Pohja district, Finland, January–February 2022 (n = 19)
SARS-CoV-2: severe acute respiratory syndrome coronavirus 2.
The genomes were aligned using Nextalign [11] and the tree was inferred using maximum likelihood method implemented in MEGA6 software. The potential parental genomes are shown in blue for BA.1.17 and red for BA.2. For the sake of clarity, only the earliest sequences of the recombinant lineage XJ (black) and those of the potential parents (lineage BA.1.17 (blue) and BA.2 (red)) detected in Länsi-Pohja hospital district were included in the analysis.
Identification of early cases and transmission events
We traced the first 10 cases of infection with SARS-CoV-2 lineage XJ detected in the country in weeks 3 and 4 of 2022 and established epidemiological links between them. Five cases worked or studied at the same educational unit. Two cases lived in the same apartment building, one of whom attended a kindergarten that was later linked to two family clusters. The sampling date of the first cases fell within a range of four days in January 2022, therefore none of the identified cases were considered as an index case.
The median age of all 191 confirmed cases with XJ detected in Finland between week 3 and 11 was 42 years (range: 0–93). Data on COVID-19 vaccination status and clinical data were available for 43 confirmed cases with XJ. Of these, 35 had received at least two doses of COVID-19 vaccine, with the latest dose administered within 6 months before confirmed infection. Twenty-seven had received Comirnaty (BioNTech-Pfizer), three Spikevax (Moderna) and five had received a combination of Comirnaty and Spikevax, Comirnaty and Vaxzevria (Astra Zeneca) or Spikevax and Vaxzevria. No remarkable differences in the clinical course of disease have been identified so far. One death was recorded in an elderly individual with underlying disease.
Distribution of the recombinant
In response to the initial identification of the recombinant sequences from week 3/2022 onwards, and because the tracing of early cases and transmission events suggested community transmission, genomic surveillance of variants was intensified in the affected area in week 7. The number of sequenced samples was increased to detect as many cases as possible and to monitor changes in the prevalence of the XJ variant with increased sensitivity [4]. A large number of recombinant sequences in the Länsi-Pohja hospital district was observed in week 7 and onwards. Forty-eight cases were detected in week 7, representing 39% of the SARS-CoV-2 samples sequenced in that week (Figure 4). By week 11, 191 recombinant cases were confirmed through sequencing, the great majority of which were from Länsi-Pohja hospital district. Single cases were subsequently also detected in surveillance samples from several cities across the country including Helsinki, the capital. Based on sequences available in the GISAID database, XJ sequences were further identified in Sweden (n = 2), the United Kingdom (n = 2), France (n = 1) and Israel (n = 1) (Figure 2). A sample collection date as early as week 3/2022 was recorded for a sample taken in a Swedish town bordering Länsi-Pohja, indicating that cross-border spread may have occurred at an early stage.
Figure 4.
SARS-CoV-2 variant distribution, Länsi-Pohja hospital district, Finland, week 1–11/2022 (n = 191 XJ among 1,094 sequences)
SARS-CoV-2: severe acute respiratory syndrome coronavirus 2.

Discussion
Rapid changes in the global SARS-CoV-2 variant composition have occurred several times during the coronavirus disease (COVID-19) pandemic since its first emergence in the end of 2019 [5,6]. The SARS-CoV-2 variant Omicron (Pango lineage designation B.1.1.529) emerged in late 2021, displacing the dominant Delta variant (B.1.617.2) in most parts of the world within a few weeks [7,8]. Recombination is a widespread evolutionary mechanism among viruses, including coronaviruses, giving rise to genomic diversity independent of mutations. It has been associated with diverse phenotypical changes including expansion of the viral host range, reviewed in [9]. With several coronavirus lineages circulating in the population, recombinants are likely to emerge and are now increasingly reported from different countries [10].
The national genomic surveillance of SARS-CoV-2 in Finland was initiated jointly by the National Institute for Health and Welfare (THL), the University of Helsinki and the Institute for Molecular Medicine Finland (FIMM) in the end of 2020 to support the timely detection of circulating variants and the global effort to monitor and assess the evolution of SARS-CoV-2. In addition, three other clinical laboratories, TYKS Laboratoriot, FIMLAB and Vita Laboratoriot, participate in the national genomic surveillance of SARS-CoV-2 in Finland. The XJ recombinant described here was detected through the national surveillance. It has arisen from BA.1 and BA.2 parental lineages and harbours the S gene of the widely circulating BA.2 lineage without any novel mutations. Since no changes to the receptor binding region, recognised antibody binding sites or sites involved in the proteolytic cleavage of the spike protein have been introduced in the XJ recombinant, it is plausible that it will display phenotypical properties similar to the parental lineages. Little is known about the ORF8:Q18* mutation found in the XJ lineage, its impact on infection remains to be elucidated. Further studies addressing transmissibility, disease severity and immune evasion are needed to assess the public health impact that the recombinant may have.
We did not identify an index case, leaving the origin of the XJ recombinant unresolved. The presence of both of the potential parental lineages in the geographical region where the XJ lineage was first detected, suggests that the recombination event may have occurred in the Länsi-Pohja region or in its proximity. However, since genomes identical to these potential parental sequences have also been detected elsewhere, we cannot exclude the possibility of introduction from another country.
Conclusion
The recombinant XJ virus has established continuous community transmission in north-western Finland and has dispersed to other parts of Finland and abroad. Global genomic variant surveillance plays a pivotal role in timely tracking of variants and for our increased understanding of the evolutionary dynamics of the pandemic SARS-CoV-2 virus.
Sequence data accessibility
GISAID accession numbers for the first eight cases: EPI_ISL_10266425, EPI_ISL_10266426, EPI_ISL_10266423, EPI_ISL_10266424, EPI_ISL_10266429, EPI_ISL_10266427, EPI_ISL_10266428, EPI_ISL_10266422.
The raw sequence data (Illumina) are available at: https://www.ncbi.nlm.nih.gov/bioproject/825706
Ethical statement
Ethical approval was not required for this study because no identifiable patient data were described.
Acknowledgements
We would like to acknowledge the contribution of the Institute for Molecular Medicine Finland (FIMM), a fundamental partner in the joint national genomic surveillance effort of SARS-CoV-2 in Finland. We further acknowledge CSC-IT Center for Science, Finland, for providing computational resources. Sequencing were performed by FIMM Genomics NGS Sequencing unit at University of Helsinki supported by HiLIFE and Biocenter Finland.
Funding statement
This study was supported by the Academy of Finland (grant number 336490, 339510), VEO - European Union’s Horizon 2020 (grant number 874735), Jane and Aatos Erkko Foundation and Helsinki University Hospital Funds (TYH2021343).
Conflict of interest: None declared.
Authors’ contributions: EL, TS, OV, CSK and NI designed the study and drafted the manuscript. TS and HV were responsible for sequencing and sequence analyses, EL, SB and KL participated in variant analyses. SB, KL, LS, JI, JR, JT, TT and PS participated in data collection while EL, SB, KL and NI analysed the data. All authors participated in the interpretation of the data and reviewed the manuscript.
References
- 1. Kant R, Nguyen PT, Blomqvist S, Erdin M, Alburkat H, Suvanto M, et al. Incidence trends for SARS-CoV-2 Alpha and Beta variants, Finland, Spring 2021. Emerg Infect Dis. 2021;27(12):3137-41. 10.3201/eid2712.211631 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Truong Nguyen PT, Plyusnin I, Sironen T, Vapalahti O, Kant R, Smura T. HAVoC, a bioinformatic pipeline for reference-based consensus assembly and lineage assignment for SARS-CoV-2 sequences. BMC Bioinformatics. 2021;22(1):373. 10.1186/s12859-021-04294-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Vauhkonen H, Truong P, Kant R, Plyusnin I, Erdin M, Kurkela S, et al. Introduction and rapid spread of SARS-CoV-2 Omicron variant and the dynamics of its sub-lineages BA.1 and BA.1.1, December 2021, Finland, 23 March 2022. PREPRINT (Version 1). Research Square. Available from: 10.21203/rs.3.rs-1480433/v1 10.21203/rs.3.rs-1480433/v1 [DOI]
- 4.European Centre for Disease Prevention and Control (ECDC). Guidance for representative and targeted genomic SARS-CoV-2 monitoring . ECDC: Stockholm; 3 May 2021. Available from: https://www.ecdc.europa.eu/en/publications-data/guidance-representative-and-targeted-genomic-sars-cov-2-monitoring
- 5.World Health Organization (WHO). COVID-19 – China. Disease outbreak news. Geneva: WHO; 12 January 2020. Available from: https://www.who.int/emergencies/disease-outbreak-news/item/2020-DON233
- 6.World Health Organization (WHO). Tracking SARS-CoV-2 variants. Geneva: WHO. [Accessed 9.2.20223]. Available from: https://www.who.int/en/activities/tracking-SARS-CoV-2-variants
- 7. Viana R, Moyo S, Amoako DG, Tegally H, Scheepers C, Althaus CL, et al. Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature. 2022;603(7902):679-86. 10.1038/s41586-022-04411-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.World Health Organization (WHO). Weekly epidemiological update on COVID-19. Geneva: WHO: 11 January 2022. Available from: https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---11-january-2022
- 9. Pérez-Losada M, Arenas M, Galán JC, Palero F, González-Candelas F. Recombination in viruses: mechanisms, methods of study, and evolutionary consequences. Infect Genet Evol. 2015;30:296-307. 10.1016/j.meegid.2014.12.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.GitHub. Potential BA.1/BA.2 recombinant lineage in Finland [66 seq as of 2022-03-25]. [Accessed 8.4.2022]. Available from: https://github.com/cov-lineages/pango-designation/issues/449
- 11. Aksamentov I, Roemer C, Hodcroft EB, Neher RA. Nextclade: clade assignment, mutation calling and quality control for viral genomes. J Open Source Softw. 2021;6(67):3773. 10.21105/joss.03773 [DOI] [Google Scholar]


