Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2021 Feb 25;10(8):e00100-21. doi: 10.1128/MRA.00100-21

Genome Sequence of a SARS-CoV-2 Strain from Bangladesh That Is Nearly Identical to United Kingdom SARS-CoV-2 Variant B.1.1.7

Mohammad Enayet Hossain a, M Mahfuzur Rahman a, M Shaheen Alam a, Yeasir Karim a, Ananya Ferdous Hoque a, Sezanur Rahman a, Mohammed Ziaur Rahman a, Mustafizur Rahman a,
Editor: Simon Rouxb
PMCID: PMC7909083  PMID: 33632858

The coding-complete genome sequence of a coronavirus strain, SARS-CoV-2/human/BGD/G039392/2021, obtained from a symptomatic male patient with coronavirus disease 2019 (COVID-19) in Dhaka, Bangladesh, is reported. The strain G039392 is 99.9% identical to the UK variant B.1.1.7.

ABSTRACT

The coding-complete genome sequence of a coronavirus strain, SARS-CoV-2/human/BGD/G039392/2021, obtained from a symptomatic male patient with coronavirus disease 2019 (COVID-19) in Dhaka, Bangladesh, is reported. The strain G039392 is 99.9% identical to the UK variant B.1.1.7.

ANNOUNCEMENT

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a positive-sense single-stranded RNA virus from the Betacoronavirus genus of the broad family Coronaviridae. In Bangladesh, SARS-CoV-2, the causative agent of the coronavirus disease 2019 (COVID-19) pandemic, was first reported on 8 March 2020. Here, we report the complete sequence of SARS-CoV-2 strain G039392, which was identified on 6 January 2021 from a 50-year-old symptomatic male patient in Dhaka, Bangladesh, and is 99.9% identical to the UK variant B.1.1.7.

As part of the countrywide COVID-19 laboratory network, the International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b), in collaboration with the Government of Bangladesh, has been testing for SARS-CoV-2 since March 2020. Due to the recent emergence of novel variants from the United Kingdom, South Africa, and Brazil (1), we started monitoring SARS-CoV-2 variants. Between 15 December 2020 and 21 January 2021, a total of 5,250 nasopharyngeal swab samples were screened for SARS-CoV-2 by real-time reverse transcriptase PCR using RdRp (open reading frame 1ab [ORF1ab]) and N gene-specific primers and probes (2). The iTaq universal probes one-step kit (Bio-Rad Laboratories, CA, USA) was used in the Bio-Rad CFX96 Touch real-time PCR system. Threshold cycle (CT) values of ≤37 were considered positive. Of 988 positive samples, 191 were selected (CT values of ≤25) for variant surveillance by amplification of the spike protein gene using Sanger sequencing. Sequence data for the spike protein gene showed that the spike protein of one strain was nearly identical to the spike protein of the UK variant, and the strain was considered for complete genome sequencing. In brief, total RNA was extracted from nasopharyngeal swab samples using the chemagic viral NA/gDNA kit (PerkinElmer, MA, USA), and reverse transcription was performed using the high-capacity cDNA reverse transcription kit (Thermo Fisher Scientific, CA, USA) following the manufacturer’s protocol. Ninety-eight gene segments were amplified by GoTaq G2 Hot Start Taq polymerase (Promega Corp., WI, USA) with specific primer sets (3) to cover the whole genome. All PCR products were analyzed by agarose gel electrophoresis with SYBR Safe staining (Thermo Fisher Scientific) and purified using the ExoSAP‐IT kit (Affymetrix, OH, USA). Sequencing was performed in an Applied Biosystems 3500XL genetic analyzer using the BigDye Terminator v3.1 cycle sequencing ready reaction kit (PerkinElmer). The chromatogram sequencing files were inspected using Chromas v2.23 (Technelysium, QLD, Australia), and the consensus sequences were prepared using SeqMan II (DNASTAR, WI, USA). Multiple sequence alignment was performed using the BioEdit v7.2 program (4).

The assembled SARS-CoV-2/human/BGD/G039392/2021 viral genome consists of 29,796 nucleotides (GC content, 37.98%) with 99.9% coverage, compared with reference strain Wuhan-Hu-1 (GenBank accession number NC_045512). The genome was also compared with UK variant VOC-202012/01 (GISAID accession number EPI_ISL_601443), which indicated that Bangladeshi strain G039392 had all 23 mutations that were identified in UK variant B.1.1.7 (Table 1). In addition, G039392 contained two nonsynonymous mutations (P78S and K460R) in the helicase protein and one nucleotide deletion (A28274^ [with ^ denoting the deleted amino acid]) at the start position of the N gene that were absent in the UK variant. However, the deletion (A28274^) might have no impact on translation of the N gene because three adenine residues were present at the same position (AAA28271), which can be used to initiate translation with the start codon (ATG).

TABLE 1.

Comparison of mutations between the UK and Bangladeshi strains

Data forb:
Gene segment (position) and nucleotidea Amino acida UK strain Bangladeshi strain
5′ untranslated region
 C241T Y Y
ORF1a (266 to 13483)
 C913T Y Y
 C3037T Y Y
 C3267T T1001I Y Y
 T5266A N Y
 C5388A A1708D Y Y
 C5986T Y Y
 T6954C I2230T Y Y
 TCTGGTTTT11288^ SGF3675^ Y Y
RdRp (13442 to 13468, 13468 to 16236)
 C14408T P323L Y Y
 C14676T Y Y
 C15279T Y Y
 T16176C Y Y
Helicase (16237 to 18039)
 C16468T P78S N Y
 A17615G K460R N Y
S (21,563 to 25,384)
 ACATGT21765^ HV69^ Y Y
 TTA21991^ Y144^ Y Y
 A23063T N501Y Y Y
 C23271A A570D Y Y
 A23403G D614G Y Y
 C23604A P681H Y Y
 C23709T T716I Y Y
 T24506G S982A Y Y
 G24914C D1118H Y Y
ORF8 (27,894 to 28,259)
 C27972T Q27* Y Y
 G28048T R52I Y Y
 A28111G Y73C Y Y
N (28,274 to 29,533)
 A28274^ N Y
 GAT28280CAT D3L Y Y
 GGG28881AAC RG203KR Y Y
 T28924G N Y
 C28977T S235F Y Y
a

The positions and changes are indicated by comparison with the Wuhan-Hu-1 strain. ^, deletion; *, stop codon; −, synonymous mutation.

b

Y, present; N, absent.

The availability of genomic data for the circulating SARS-CoV-2 isolates from different parts of the world will serve as a valuable resource for monitoring emerging new variants.

Data availability.

The sequence has been deposited in the GenBank database under the accession number MW531680. The GISAID EpiCoV coronavirus SARS-CoV-2 database (www.gisaid.org) accession number for the sequence is EPI_ISL_890237.

ACKNOWLEDGMENTS

The SARS-CoV-2 variant monitoring is a part of the “COVID-19 testing and tracing in Bangladesh” study funded by the Bill and Melinda Gates Foundation (investment INV-017556) and was approved by the icddr,b institutional review board. The icddr,b acknowledges with gratitude the commitment of the Bill and Melinda Gates Foundation to its research efforts. The icddr,b is also grateful to the governments of Bangladesh, Canada, Sweden, and the United Kingdom for providing core/unrestricted support.

REFERENCES

  • 1.Galloway SE, Paul P, MacCannell DR, Johansson MA, Brooks JT, MacNeil A, Slayton RB, Tong S, Silk BJ, Armstrong GL, Biggerstaff M, Dugan VG. 2021. Emergence of SARS-CoV-2 B.1.1.7 lineage: United States, December 29, 2020–January 12, 2021. MMWR Morb Mortal Wkly Rep 70:95–99. doi: 10.15585/mmwr.mm7003e2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Chinese Center for Disease Control and Prevention. 2020. Laboratory testing for COVID-19. Chinese Center for Disease Control and Prevention, Beijing, China. http://www.chinacdc.cn/en/COVID19/202003/P020200323390321297894.pdf. [Google Scholar]
  • 3.Deng X, Gu W, Federman S, Du Plessis L, Pybus OG, Faria NR, Wang C, Yu G, Bushnell B, Pan C-Y, Guevara H, Sotomayor-Gonzalez A, Zorn K, Gopez A, Servellita V, Hsu E, Miller S, Bedford T, Greninger AL, Roychoudhury P, Starita LM, Famulare M, Chu HY, Shendure J, Jerome KR, Anderson C, Gangavarapu K, Zeller M, Spencer E, Andersen KG, MacCannell D, Paden CR, Li Y, Zhang J, Tong S, Armstrong G, Morrow S, Willis M, Matyas BT, Mase S, Kasirye O, Park M, Masinde G, Chan C, Yu AT, Chai SJ, Villarino E, Bonin B, Wadford DA, Chiu CY. 2020. Genomic surveillance reveals multiple introductions of SARS-CoV-2 into northern California. Science 369:582–587. doi: 10.1126/science.abb9263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hall TA. 1999. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser 41:95–98. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The sequence has been deposited in the GenBank database under the accession number MW531680. The GISAID EpiCoV coronavirus SARS-CoV-2 database (www.gisaid.org) accession number for the sequence is EPI_ISL_890237.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES