Genomic characterization of a novel SARS-CoV-2

Rozhgar A Khailany; Muhamad Safdar; Mehmet Ozaslan

doi:10.1016/j.genrep.2020.100682

. 2020 Apr 16;19:100682. doi: 10.1016/j.genrep.2020.100682

Genomic characterization of a novel SARS-CoV-2

Rozhgar A Khailany ^a,^⁎, Muhamad Safdar ^b, Mehmet Ozaslan ^c

PMCID: PMC7161481 PMID: 32300673

Abstract

A new severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) associated with human to human transmission and extreme human sickness has been as of late announced from the city of Wuhan in China. Our objectives were to mutation analysis between recently reported genomes at various times and locations and to characterize the genomic structure of SARS-CoV-2 using bioinformatics programs. Information on the variation of viruses is of considerable medical and biological impacts on the prevention, diagnosis, and therapy of infectious diseases. To understand the genomic structure and variations of the SARS-CoV-2. The study analyzed 95 SARS-CoV-2 complete genome sequences available in GenBank, National MicrobiologyData Center (NMDC) and NGDC Genome Warehouse from December-2019 until 05 of April-2020. The genomic signature analysis demonstrates that a strong association between the time of sample collection, location of sample and accumulation of genetic diversity. We found 116 mutations, the three most common mutations were 8782C>T in ORF1ab gene, 28144T>C in ORF8 gene and 29095C>T in the N gene. The mutations might affect the severity and spread of the SARS-CoV-2. The finding heavily supports an intense requirement for additional prompt, inclusive investigations that combine genomic detail, epidemiological information and graph records of the clinical features of patients with COVID-19.

Keywords: SARS-CoV-2, Genomic characterization, Mutation, COVID-19

Abbreviations: SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; COVID-19, Coronavirus disease 2019; NGDC, National Genomics Data Center; NMDC, National Microbiology Data Center; WHO, World Health Organization; EMBOSS, The European Molecular Biology Open Software Suite; BLAST, Basic Local Alignment Search Tool; UTR, Untranslated region; CDC, Centers of Disease Control and Prevention; MERS, Middle East Respiratory Syndrome; NCBI, National Center for Biotechnology Information; NSP, nonstructural protein; ORF, Open Reading Frame

1. Introduction

The current outbreak of coronavirus disease (COVID-19) that was first reported from Wuhan, China, in December 2019. This epidemic had spread to 206 countries and territories around the world and 2 international conveyances with 1,203,459 confirmed cases, including 64,754 deaths, as of April 05, 2020, so the World Health Organization declared it as a Public Health Emergency of worldwide (https://www.worldometers.info/coronavirus/). Similarly, Middle East respiratory syndrome coronavirus (MERS-CoV) had become a worldwide health concern. MERS-CoV originally reported in 2012 (De Wit et al., 2016). It affected more than 2000 people in 27 countries and 4 sub-continents in the Middle East. While the epidemic of SARS affected 26 countries and resulted in more than 8000 cases in 2003 (De Wit et al., 2016). Since then, a small number of cases have occurred as a result of laboratory accidents or, possibly, through animal-to-human transmission (Guangdong, China) (De Wit et al., 2016).

This study analyzed and discussed available published genome until April 05, 2020, for a better understanding of the genomic variation and characterization of a novel coronavirus (COVID-19). This virus is transmitted from person to person via droplet transmission (Li et al., 2020; Ozaslan et al., 2020). Therefore, the virus is spreading easily in overcrowded areas. Most patients experience only mild to moderate symptoms, such as high body temperature in conjunction with some respiratory symptoms such as cough, sore throat, and headache. Some people may have severe symptoms like pneumonia and acute respiratory distress syndrome (Chen et al., 2020). Also, individuals with underlying complications such as heart disease, chronic lung disease, or diabetes potentially display more severe symptoms (Adhikari et al., 2020). Preventive measures such as masks, frequent hand washing, staying home when sick, avoid public contact, and quarantines are being recommended for reducing the transmission. To date, no specific antiviral treatment is proven effective, hence, infected people initially rely on symptomatic treatments that showed encouraging profile for blocking the new coronavirus in early clinical trials.

Importantly, the genome size of the SARS-CoV-2 varies from 29.8 kb to 29.9 kb and its genome structure followed the specific gene characteristics to known CoVs; the 5′ more than two-thirds of the genome comprises orf1ab encoding orf1abpolyproteins, while the 3′ one third consists of genes encoding structural proteins including surface (S), envelope (E), membrane (M), and nucleocapsid N proteins (Fig. 1 ). Additionally, the SARS-CoV-2 contains 6 accessory proteins, encoded by ORF3a, ORF6, ORF7a, ORF7b, and ORF8 genes (Fig. 1) (Li et al., 2005; Oostra et al., 2007).

Fig. 1 — Structure of the SARS-CoV-2 genome.

Recently, the development of high-throughput sequencing has provided datasets of high-quality, complete genome sequences for viral isolates collected in a relatively unbiased manner, regardless of virulence or other unusual characteristics. Analyses of the genome sequence data combined with large-scale antigenic typing have given insights into the pattern of global spread, the genetic diversity during seasonal epidemics, and the dynamics of subtype evolution. SARS-CoV-2 such as the NCBI Severe acute respiratory syndrome coronavirus 2 database (http://www.nhc.gov.cn/jkj/s7915/202001/e4e2d5e6f01147e0a8df3f6701d49f33.shtml) and NGDC Genome Warehouse (bigd.big.ac.cn/gwh/) make the genomic information publicly available, together with epidemiological data for the sequenced isolates. The data sharing requires users to agree to collaborate with, and appropriately credit, all data contributors. A notable success of this initiative has been the contribution of countries, such as China, Philippines, and Japan, etc. which have previously been reticent about placing data in the public domain. The WHO also supports the endeavor of rapid publication of all available sequences for coronaviruses and there is hope that comprehensive submission to public databases will soon become a reality.

The finding heavily supports an intense requirement for additional prompt, inclusive investigations that combine genomic detail, epidemiological information and graph records of the clinical features of patients with COVID-19 (Payne et al., 2018). In the future, mining these resources and establishing a statistical framework based on epidemiological, antigenic, and genetic information could provide further insights into the rules that govern the emergence and establishment of antigenically novel variants and improve the potential for SARS-CoV-2 prevention and control (Ge et al., 2013; Yang et al., 2015). In this study, we investigated the extent of molecular variation between the recently sequenced genomes of SARS-CoV-2.

2. Methodology

We have downloaded 94 publicly available genomes from Genbank up to 12 March. Among 94 genomes, some of the genomes were not used for the analysis due to unusually high variants with gaps. NC_045512 genome sequence was used for reference and the genomic coordinate in this study is based on this reference genome (Lu et al., 2020). Therefore, genomic coordinates must be adjusted to compare with previous studies (Koyama et al., 2020)

Each genome was first aligned to NC_045512 using the EMBOSS needle with a default gap penalty of 10 and an extension penalty of 0.5. Then, differences in comparison with NC_045512 were extracted to create variants (Table 1, Table 2 ) (Rice et al., 2000). Based on protein annotations, nucleotide level variants were converted into amino acid codon variants for alignments when its location within a gene was identified (Arvestad, 2018). Nucleotide mutations in the genomes were revealed.

Table 1.

Coding mutation list detected in SARS-CoV-2 genomes.

Accession	Location-date	Nucleotide variation	Gene	Amino acid change	Mutation type
MT240479	04-03-2020/Pakistan Gilgit	1 1497G>A	Orf1ab		Synonymous mutation
MN996527	30/Dec/2019-China Wuhan	21316G>A	Orf1ab	D7018N	Missense
MN996527	30/Dec/2019-China Wuhan	24292A>G	S		Synonymous mutation
LC528232	10/Feb/2020-Japan	11083T>G	Orf1ab	L3606F	Missense
LC528232	10/Feb/2020-Japan	29642C>T	ORF10		Synonymous mutation
LR757995	05/Jan/2020-China Wuhan	28144T>C	ORF8	L84S	Missense
LR757998	12/26/2019-China Wuhan	6968C>A	Orf1ab	L2235I	Missense
LR757998	12/26/2019-China Wuhan	11749T>A	Orf1ab		Synonymous mutation
MN938384	1/10/2020-China Shenzhen	8782C>T	Orf1ab		Synonymous mutation
MN938384	1/10/2020-China Shenzhen	28144T>C	ORF8	L84S	Missense
MN938384	1/10/2020-China Shenzhen	29095C>T	N		Synonymous mutation
MN975262	11/Jan/2020-China	8782C>T	Orf1ab		Synonymous mutation
MN975262	11/Jan/2020-China	9534C>T	Orf1ab	T3090I	Missense
MN975262	11/Jan/2020-China	29095C>T	N		Synonymous mutation
MN975262	11/Jan/2020-China	28144T>C	ORF8	L84S	Missense
MN975262	11/Jan/2020-China	8782C>T	Orf1ab		Synonymous mutation
MN985325	19/Jan/2020-USA WA	28144T>C	ORF8	L84S	Missense
MN994467	23/Jan/2020-USA CA	1548G>A	Orf1ab	S428N	Missense
MN994467	23/Jan/2020-USA CA	8782C>T	Orf1ab		Synonymous mutation
MN994467	23/Jan/2020-USA CA	26729T>C	M		Synonymous mutation
MN994467	23/Jan/2020-USA CA	28077G>C	ORF8	V62L	Missense
MN994467	23/Jan/2020-USA CA	28144T>C	ORF8	L84S	Missense
MN994467	23/Jan/2020-USA CA	28792A>C	N		Synonymous mutation
MN994467	23/Jan/2020-USA CA	1912C>T	Orf1ab		Synonymous mutation
GWHABKF00000001	23/Dec/2019-China Wuhan	3778A>G	Orf1ab		Synonymous mutation
GWHABKF00000001	23/Dec/2019-China Wuhan	8388A>G	Orf1ab	N2708S	Missense
GWHABKF00000001	23/Dec/2019-China Wuhan	8987T>A	Orf1ab	F2908I	Missense
GWHABKK00000001	30/Dec/2019-China Wuhan	24325A>G	S		Synonymous mutation
GWHABKK00000001	30/Dec/2019-China Wuhan	21316G>A	Orf1ab	D7018N	Missense
GWHABKH00000001	30/Dec/2019-China Wuhan	6996T>C	Orf1ab	I2244T	Missense
GWHABKJ00000001	01/Jan/2019-China Wuhan	7866G>T	Orf1ab	G2534V	Missense
GWHABKM00000001	30/Dec/2019-China Wuhan	21137A>G	Orf1ab	K6958R	Missense
GWHABKM00000001	30/Dec/2019-China Wuhan	7016G>A	Orf1ab	G2251S	Missense
GWHABKO00000001	30/Dec/2019-China Wuhan	8001A>C	Orf1ab	D2579A	Missense
GWHABKO00000001	30/Dec/2019-China Wuhan	9534C>T	Orf1ab	T3090I	Missense
MT188341	05/Mar/2020-USA MN	6035A>G	Orf1ab		Synonymous mutation
MT188341	05/Mar/2020-USA MN	8782C>T	Orf1ab		Synonymous mutation
MT188341	05/Mar/2020-USA MN	16467A>G	Orf1ab		Synonymous mutation
MT188341	05/Mar/2020-USA MN	18060C>T	Orf1ab		Synonymous mutation
MT188341	05/Mar/2020-USA MN	21386insT	Orf1ab		Insertion
MT188341	05/Mar/2020-USA MN	21388-21390insTT	Orf1ab		Insertion
MT188341	05/Mar/2020-USA MN	23185C>T	S		Synonymous mutation
MT188341	05/Mar/2020-USA MN	28144T>C	ORF8	L84S	Missense
MT188339	09/Mar/2020-USA MN	8782C>T	Orf1ab		Synonymous mutation
MT188339	09/Mar/2020-USA MN	17423A>G	Orf1ab	Y5720C	Missense
MT188339	09/Mar/2020-USA MN	18060C>T	Orf1ab		Synonymous mutation
MT188339	09/Mar/2020-USA MN	21386C>T	Orf1ab		Synonymous mutation
MT188339	09/Mar/2020-USA MN	22432C>T	S		Synonymous mutation
MT188339	09/Mar/2020-USA MN	28144T>C	ORF8	L84S	Missense
MT121215	02/Feb/2020-China Shanghai	6031C>T	Orf1ab		Synonymous mutation
MT123290	05/Feb/2020-China Guangzhou	15597T>C	Orf1ab		Synonymous mutation
MT123290	05/Feb/2020-China Guangzhou	29095C>T	N		Synonymous mutation
MT126808	2/28/2020-Brazil	26144G>T	ORF3a	G251V	Missense
MT066175	31/Jan/2020-Taiwan	8782C>T	Orf1ab		Synonymous mutation
MT066175	31/Jan/2020-Taiwan	28144T>C	ORF8	L84S	Missense
MT093571	07/Feb/2020-Sweden	13225C>G	Orf1ab		Synonymous mutation
MT093571	07/Feb/2020-Sweden	13226T>C	Orf1ab		Synonymous mutation
MT093571	07/Feb/2020-Sweden	17423A>G	Orf1ab	Y5720C	Missense
MT093571	07/Feb/2020-Sweden	23952T>G	S		Synonymous mutation
MT066156	30/Jan/2020-Italy	11083T>G	Orf1ab	L3606F	Missense
MT066156	30/Jan/2020-Italy	26144G>T	ORF3a	G251V	Missense
LC522975	20/JAN/2020-JAPAN	8782C>T	Orf1ab		Synonymous mutation
LC522975	20/JAN/2020-JAPAN	29095C>T	N		Synonymous mutation
LC522975	20/JAN/2020-JAPAN	28144T>C	ORF8	L84S	Missense
LC522975	20/JAN/2020-JAPAN	2662C>T	ORF1ab		Synonymous mutation
LC522974	20/JAN/2020-JAPAN	8782C>T	ORF1ab		Synonymous mutation
LC522974	20/JAN/2020-JAPAN	29095C>T	N		Synonymous mutation
LC522974	20/JAN/2020-JAPAN	28144T>C	ORF8	L84S	Missense
LC522974	20/JAN/2020-JAPAN	2662C>T	ORF1ab		Synonymous mutation
LC522973	20/JAN/2020-JAPAN	8782C>T	ORF1ab		Synonymous mutation
LC522973	20/JAN/2020-JAPAN	29095C>T	N		Synonymous mutation
LC522973	20/JAN/2020-JAPAN	3792C>T	ORF1ab	A1176V	Missense
LC522973	20/JAN/2020-JAPAN	29095C>T	N		Synonymous mutation
LC522973	20/JAN/2020-JAPAN	2662C>T	ORF1ab		Synonymous mutation
LC522973	20/JAN/2020-JAPAN	28144T>C	ORF8	L84S	Missense
LC522972	20/JAN/2020-JAPAN	29303C>T	N	P344S	Missense
LC522972	20/JAN/2020-JAPAN	25810C>G	ORF3a	L140V	Missense
LC522972	20/JAN/2020-JAPAN	11557G>T	ORF1ab	E3764D	Missense
LC522972	20/JAN/2020-JAPAN	15324C>T	ORF1ab		Synonymous mutation
LC521925	21/JAN/2020-JAPAN	1912C>T	ORF1ab		Synonymous mutation
LC521925	21/JAN/2020-JAPAN	18512C>T	ORF1ab	P6083L	Missense
LC521925	21/JAN/2020-JAPAN	359_382del	ORF1ab	G32_L39del	Deletion
MN988713	21/JAN/2020-USA Chicago	24034C>T	S		Synonymous mutation
MN988713	21/JAN/2020-USA Chicago	26729T>C	M		Synonymous mutation
MN988713	21/JAN/2020-USA Chicago	8782C>T	ORF1ab		Synonymous mutation
MN988713	21/JAN/2020-USA Chicago	490T>A	ORF1ab	D75E	Missense
MN988713	21/JAN/2020-USA Chicago	3177C>T	ORF1ab	P971L	Missense
MN988713	21/JAN/2020-USA Chicago	28854C>T	N	S194L	Missense
MN988713	21/JAN/2020-USA Chicago	28077G>C	ORF8	V62L	Missense
MN988713	21/JAN/2020-USA Chicago	28144T>C	ORF8	L84S	Missense
MN997409	21/JAN/2020-USA Arizona	8782C>T	ORF1ab		Synonymous mutation
MN997409	21/JAN/2020-USA Arizona	29095C>T	N		Synonymous mutation
MN997409	21/JAN/2020-USA Arizona	11083G>T	ORF1ab	L3606F	Missense
MN997409	21/JAN/2020-USA Arizona	28144T>C	ORF8	L84S	Missense
MT072688	26/JAN/2020-USA: Massachussetts	24034C>T	S		Synonymous mutation
NMDC60013002-09	01/JAN/2019-China Wuhan	27493C>T	ORF7a	P34S	Missense
NMDC60013002-09	01/JAN/2019-China Wuhan	28253C>T	ORF8		Synonymous mutation
NMDC60013002-10	30/Dec/2019-China Wuhan	20679G>A	ORF1ab		Synonymous mutation
NMDC60013002-01	30/Dec/2019-China Wuhan	11764T>A	ORF1ab	N3833K	Missense
NMDC60013002-06	30/Dec/2019-China Wuhan	24325A>G	S		Synonymous mutation
NMDC60013002-04	05/Dec/2019-China Wuhan	28144T>C	ORF8	L84S	Missense

Open in a new tab

Table 2.

Non-coding mutation list detected in SARS-CoV-2 genomes.

Accession	Location-date	Nucleotide variation	UTR type
MT240479	04-03-2020/Pakistan Gilgit	241C>T	5 UTR
MT123290	05/Feb/2020-China Guangzhou	4A>T	5 UTR
MT007544	25/Jan/2020-Australia Victoria	29749-29759del	3 UTR
NMDC60013002-07	07/JAN/2019-China Wuhan	29869del	3 UTR
NMDC60013002-04	05/Dec/2019-China Wuhan	29856T>A	3 UTR
NMDC60013002-04	05/Dec/2019-China Wuhan	29854C>T	3 UTR
NMDC60013002-04	05/Dec/2019-China Wuhan	16C>T	5 UTR
MT049951	17/Jan/2019-China Yunnan	75C>A	5 UTR
LC522975	20/JAN/2020-JAPAN	29705G>T	3 UTR
GWHABKG00000001	30/Dec/2019-China Wuhan	124G>A	5 UTR
GWHABKG00000001	30/Dec/2019-China Wuhan	120T>C	5 UTR
GWHABKG00000001	30/Dec/2019-China Wuhan	119C>G	5 UTR
GWHABKG00000001	30/Dec/2019-China Wuhan	112T>G	5 UTR
GWHABKG00000001	30/Dec/2019-China Wuhan	111T>C	5 UTR
GWHABKG00000001	30/Dec/2019-China Wuhan	104T>A	5 UTR

Open in a new tab

3. Results

A hundred fifty-six total variants were found and 116 unique variants as shown in Table 1, Table 2. Among the 95 genomes we analyzed, 24 samples did not exhibit any variants except for missing starts and end base pairs. The distinct variants consist of 46 missense, 52 synonymous, 2 insertion, 1 deletion and 14 non-coding alleles in Fig. 1. The most common variants were 8782C>T(ORF1ab) in 13 samples, 28144T>C (ORF8) in 14 samples and 29095C>T (N) in 8samples. The occurrences of 8782C>T and 28144T>C coincide. 29095C>T is found in the subset of them. Both 8782C>T and 29095C>T are synonymous; however, 28144T>C causes amino acid to change L84S in ORF8. Notably, most of 8782C>T and 28144T>Cvariant substrains are found outside of Wuhan. For the 46 missense variants, 24 variants are found in ORF1ab, which is the longest ORF occupying 2/3 of the entire genome. ORF1ab is cleaved into many nonstructural proteins (NSP1-NSP16). Among NSP's, NSP3 has more variants in the analyzed samples. All noncoding mutations are located in 5′ UTR or 3′ UTR regions. In terms of base changes, the most frequently observed one is C>T as shown in Table 1, Table 2.

4. Discussion

The genetic information of any life is protected in its genome, and annotation is the initial step to interpret the sequence. The length of the SARS-CoV genome is over 30 Kb, while just a few coding genes appear not to accord with the general properties for the viral genome and the minimum grouping of hereditary data. In addition to these, it may have some non-structural proteins but lacking data at one place is needed. The absence probably results from their short existing-time before decomposition. In this study, we worked to find the extent of molecular variation between the recently sequenced genomes of SARS-CoV-2.

Numerous investigations have depicted that ORFs and ACE2 genes play a key role during novel coronavirus disease (Koyama et al., 2020; Kirchdoerfer and Ward, 2019; Van der Meer et al., 1998; Wan et al., 2020). So in our study, 156 total variants were found and 116 unique variants (Table 1, Table 2). Among the 95 genomes we analyzed, 24 samples did not exhibit any variants except for missing starts and end base pairs. Additionally, the distinct variants consist of 46 missense, 52 synonymous, 2 insertions, 1 deletion, 14 non-coding alleles (Table 1, Table 2). Most common variants were 8782C>T(ORF1ab) in 13 samples, 28144T>C (ORF8) in 14 samples and 29095C>T (N) in 8 samples.The occurrences of 8782C>T and 28144T>C coincide. 29095C>T is found in the subset of them. Both 8782C>T and 29095C>T are synonymous; however, 28144T>C causes amino acid to change L84S in ORF8. It is notable that most of 8782C>T and 28144T>C variant substrains are found outside of Wuhan. For the 46 missense variants, 24 variants are found in ORF1ab, which is the longest ORF occupying 2/3 of the entire genome. ORF1ab is cleaved into many nonstructural proteins (NSP1-NSP16). Among NSP's, NSP3 has more variants in the analyzed samples. All noncoding mutations are located in 3′UTR or 5′UTR. In terms of base changes, the most frequently observed one is C>T (Table 1, Table 2).

The replicase enzyme is displayed as two polyproteins (ORF1a and ORF1ab), which are prepared into 12 nonstructural proteins by three viral proteases (Van der Meer et al., 1998). This ORF1ab polyprotein includes the nsps 1–3 proteins. This area of ORF1ab is the most important factor among coronaviruses (Wan et al., 2020). Many researchers found the relationship between ORFs with COVID-19 i.e.8782C>T(ORF1ab) and 28144T>C (ORF8) are available among genomic databases (Kirchdoerfer and Ward, 2019; Koyama et al., 2020). Hence, it will be clinically significant to break down the biological function of the particular protein ORF1ab in SARS-CoV-2.

Orf8 protein of SARS-CoV-2 doesn't contain a known useful motif or region. A total motif VLVVL (amino corrosive 75–79) has been found in SARS-CoV orf8b which was appeared to trigger intracellular stress pathways and enact NOD-like receptor family pyrin region containing-3 (NLRP3) (Shi et al., 2019). Moreover, multiple arrangements with different coronavirus ORF8 sequences propose that L84 related to 28144T>C (L84S) isn't preserved (Koyama et al., 2020). Thusly, it will be critical to examine the biological function of the particular protein (orf8) in SARS-CoV-2.

ORF10 is a short protein or peptide of length 38 deposits. Koyama et al. depicted that COVID-19 is ORF10 which doesn't have any comparative proteins in the NCBI repository. This one of a kind protein can be used to distinguish the infection more rapidly than PCR based strategies (Koyama et al., 2020), but the further characterization of this protein is strongly required.

Another study demonstrated that NCBI had displayed new annotations for orf1ab as of late. NSP6 is the main contrast and it is considered as a putative protein (Koyama et al., 2020). So, they held the NSP annotations. They further referenced that 12 remarkable variations in NSP3 protein in ORF1ab. Thus concluded that there was a basic connection between the nsp3 association and the inception of coronavirus infection (Hurst et al., 2013a). Besides, they investigated that NSP3 contains the papain-like protease and is regarded as significant for SARS infection (Niemeyer et al., 2018). Variations found in subjects began from Wuhan are situated in either TM1 or Y space which is profoundly saved (Hurst et al., 2013a, Hurst et al., 2013b).

Sawicki et al. performed sequencing of ORF1 from a huge available data that was established in labs (Sawicki et al., 2005). The report distinguished single point transformations coming from nonsynonymous substitutions in nsps 4, 5, 10, 12, 14 and 16. The collective outcomes recommend that the ORF1b nsp 12, 14 and 16 proteins characterize particular cistrons, while the distorted ORF1a proteins nsps 4, 5, and 10 form a compartment together at the location of the coding sequence of ORF1ab (Graham et al., 2008). Notably, of the eight announced mutations in MHV, seven of the influenced amino acid deposits are correlated with SARS-CoV (Graham et al., 2008). This methodology may permit the determination of phenotypic travelers that will distinguish by protein interactions. Also, it will permit the progressions to be acquainted in SARS-CoV with deciding whether the ts phenotype can be reproduced in that foundation, with the chance of quickly building up a board of SARS-CoV. Curiously, not only is the slow-growth branch dominated by travelers, but the COVID-19 lineages appear to be phylogenetically related to each other, suggesting an exposure point for these individuals that are distinct from the rest of the population.

5. Conclusion

The fast increment of cases is giving more genomes that may give some visibility and proof of populace structure, especially of the chance of various presentations of COVID-19 into the human population. A comprehension of the biological reservoirs conveying these infections, and how the course to introduce has been carrying them into contact with human beings will be critical to comprehend future risks for novel diseases. This study showed how the disease spread among the travelers. This fight against COVID-19 will be a long one until we develop vaccines or effective treatments. However, we believe that collecting and sharing knowledge on variants will be effective. We should continue to be vigilant for the emergence of new variants or substrains and data should be gathered at one place for better understanding.

CRediT authorship contribution statement

Rozhgar A. Khailany:Conceptualization, Methodology, Software, Validation, Visualization, Writing - original draft, Writing - review & editing.Muhamad Safdar:Conceptualization, Visualization, Writing - original draft, Writing - review & editing.Mehmet Ozaslan:Conceptualization, Supervision, Visualization, Writing - review & editing.

Declaration of competing interest

The authors declare that they have no competing interests.

Acknowledgments

We highly appreciate many members of the frontline medical and nursing staff who demonstrated selfless and heroic devotion to duty in the face of this outbreak.

References

Adhikari S., Meng S., Wu Y., et al. Epidemiology, causes, clinical manifestation and diagnosis, prevention and control of coronavirus disease (COVID-19) during the early outbreak period: a scoping review. Infect Dis Poverty. 2020;9:29. doi: 10.1186/s40249-020-00646-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Arvestad L. alv: a console-based viewer for molecular sequence alignments. Journal of Open Source Software. 2018;3(31):955. [Google Scholar]
Chen N., Zhou M., Dong X., Qu J., Gong F., Han Y., et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
De Wit E., van Doremalen N., Falzarano D., Munster V.J. SARS and MERS: recent insights into emerging coronaviruses. Nat Rev Microbiol. 2016;14:523–534. doi: 10.1038/nrmicro.2016.81. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ge X.Y., et al. Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor. Nature. 2013;503:535–538. doi: 10.1038/nature12711. [DOI] [PMC free article] [PubMed] [Google Scholar]
Graham R.L., Sparks J.S., Eckerle L.D., Sims A.C., Denison M.R. SARScoronavirus replicase proteins in pathogenesis. Virus Res. 2008;133 doi: 10.1016/j.virusres.2007.02.017. (88–10) [DOI] [PMC free article] [PubMed] [Google Scholar]
Hurst K.R., Koetzner C.A., Masters P.S. Characterization of a critical interaction between the coronavirus nucleocapsid protein and nonstructural protein 3 of the viral replicase-transcriptase complex. J. Virol. 2013;87(16):9159–9172. doi: 10.1128/JVI.01275-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hurst Kelley R., Koetzner Cheri A., Paul S. J. Virol. 2013;87(16):9159. doi: 10.1128/JVI.01275-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kirchdoerfer R.N., Ward A.B. Structure of the SARS-CoV nsp12 polymerase bound to nsp7 and nsp8 co-factors. Nat. Commun. 2019;10:2342. doi: 10.1038/s41467-019-10280-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Koyama T., Platt D., Parida L. Variant analysis of COVID-19 genomes. Bull. World Health Organ. 2020 doi: 10.2471/BLT.20.253591. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li F., Li W., Farzan M., Harrison S.C. Structure of SARS coronavirus spike receptor-binding domain complexed with receptor. Science. 2005;309:1864–1868. doi: 10.1126/science.1116480. [DOI] [PubMed] [Google Scholar]
Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lu R., Zhao X., Li J., et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395:565–574. doi: 10.1016/S0140-6736(20)30251-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Niemeyer D., Mosbauer K., Klein E.M., Sieberg A., Mettelman R.C., Mielech A.M., et al. The papain-like protease determines a virulence trait that varies among members of the SARS-coronavirus species. PLoS Pathog. 2018;14(9) doi: 10.1371/journal.ppat.1007296. [DOI] [PMC free article] [PubMed] [Google Scholar]
Oostra M., de Haan C.A., Rottier P.J. The 29-nucleotide deletion present in human but not in animal severe acute respiratory syndrome coronaviruses disrupts the functional expression of open reading frame 8. J. Virol. 2007;81:13876–13888. doi: 10.1128/JVI.01631-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ozaslan M., Safdar M., Kilic I.H., Khailany R.A. Practical measures to prevent COVID-19: a mini-review. J. Biol. Sci. 2020;20 XX-XX. [Google Scholar]
Payne D.C., et al. Multihospital outbreak of a Middle East respiratory syndrome coronavirus deletion variant, Jordan: a molecular, serologic, and epidemiologic investigation. Open Forum Infect Di. 2018;5 doi: 10.1093/ofid/ofy095. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rice P., Longden I., Bleasby A. EMBOSS: the European molecular biology open software suite. Trends Genet. 2000;16(6):276–277. doi: 10.1016/s0168-9525(00)02024-2. [DOI] [PubMed] [Google Scholar]
Sawicki S.G., Sawicki D.L., Younker D., Meyer Y., Thiel V., Stokes H., Siddell S.G. Functional and genetic analysis of coronavirus replicase-transcriptase proteins. PLoSPathog. 2005;1(4):e39. doi: 10.1371/journal.ppat.0010039. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shi C.S., Nabar N.R., Huang N.N., Kehrl J.H. SARS-Coronavirus Open Reading Frame-8b triggers intracellular stress pathways and activates NLRP3 inflammasomes. Cell Death Discov. 2019;5:101. doi: 10.1038/s41420-019-0181-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Van der Meer Y., van Tol H., Locker J.K., Snijder E.J. ORF1a-encoded replicase subunits are involved in the membrane association of the arterivirus replication complex. J. Virol. 1998;72:6689–6698. doi: 10.1128/jvi.72.8.6689-6698.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wan Y., Shang J., Graham R., Baric R.S., Li F. Receptor recognition by the novel coronavirus from Wuhan: an analysis based on decade-long structural studies of SARS coronavirus. J. Virol. 2020;94 doi: 10.1128/JVI.00127-20. (e00127-20) [DOI] [PMC free article] [PubMed] [Google Scholar]
Yang X.L., et al. Isolation and characterization of a novel bat coronavirus closely related to the direct progenitor of severe acute respiratory syndrome coronavirus. J. Virol. 2015;90:3253–3256. doi: 10.1128/JVI.02582-15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0005] Adhikari S., Meng S., Wu Y., et al. Epidemiology, causes, clinical manifestation and diagnosis, prevention and control of coronavirus disease (COVID-19) during the early outbreak period: a scoping review. Infect Dis Poverty. 2020;9:29. doi: 10.1186/s40249-020-00646-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0010] Arvestad L. alv: a console-based viewer for molecular sequence alignments. Journal of Open Source Software. 2018;3(31):955. [Google Scholar]

[bb0015] Chen N., Zhou M., Dong X., Qu J., Gong F., Han Y., et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0020] De Wit E., van Doremalen N., Falzarano D., Munster V.J. SARS and MERS: recent insights into emerging coronaviruses. Nat Rev Microbiol. 2016;14:523–534. doi: 10.1038/nrmicro.2016.81. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0025] Ge X.Y., et al. Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor. Nature. 2013;503:535–538. doi: 10.1038/nature12711. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0030] Graham R.L., Sparks J.S., Eckerle L.D., Sims A.C., Denison M.R. SARScoronavirus replicase proteins in pathogenesis. Virus Res. 2008;133 doi: 10.1016/j.virusres.2007.02.017. (88–10) [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0035] Hurst K.R., Koetzner C.A., Masters P.S. Characterization of a critical interaction between the coronavirus nucleocapsid protein and nonstructural protein 3 of the viral replicase-transcriptase complex. J. Virol. 2013;87(16):9159–9172. doi: 10.1128/JVI.01275-13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0040] Hurst Kelley R., Koetzner Cheri A., Paul S. J. Virol. 2013;87(16):9159. doi: 10.1128/JVI.01275-13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0045] Kirchdoerfer R.N., Ward A.B. Structure of the SARS-CoV nsp12 polymerase bound to nsp7 and nsp8 co-factors. Nat. Commun. 2019;10:2342. doi: 10.1038/s41467-019-10280-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0050] Koyama T., Platt D., Parida L. Variant analysis of COVID-19 genomes. Bull. World Health Organ. 2020 doi: 10.2471/BLT.20.253591. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0055] Li F., Li W., Farzan M., Harrison S.C. Structure of SARS coronavirus spike receptor-binding domain complexed with receptor. Science. 2005;309:1864–1868. doi: 10.1126/science.1116480. [DOI] [PubMed] [Google Scholar]

[bb0060] Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0065] Lu R., Zhao X., Li J., et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395:565–574. doi: 10.1016/S0140-6736(20)30251-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0075] Niemeyer D., Mosbauer K., Klein E.M., Sieberg A., Mettelman R.C., Mielech A.M., et al. The papain-like protease determines a virulence trait that varies among members of the SARS-coronavirus species. PLoS Pathog. 2018;14(9) doi: 10.1371/journal.ppat.1007296. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0080] Oostra M., de Haan C.A., Rottier P.J. The 29-nucleotide deletion present in human but not in animal severe acute respiratory syndrome coronaviruses disrupts the functional expression of open reading frame 8. J. Virol. 2007;81:13876–13888. doi: 10.1128/JVI.01631-07. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb2000] Ozaslan M., Safdar M., Kilic I.H., Khailany R.A. Practical measures to prevent COVID-19: a mini-review. J. Biol. Sci. 2020;20 XX-XX. [Google Scholar]

[bb0085] Payne D.C., et al. Multihospital outbreak of a Middle East respiratory syndrome coronavirus deletion variant, Jordan: a molecular, serologic, and epidemiologic investigation. Open Forum Infect Di. 2018;5 doi: 10.1093/ofid/ofy095. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0090] Rice P., Longden I., Bleasby A. EMBOSS: the European molecular biology open software suite. Trends Genet. 2000;16(6):276–277. doi: 10.1016/s0168-9525(00)02024-2. [DOI] [PubMed] [Google Scholar]

[bb0095] Sawicki S.G., Sawicki D.L., Younker D., Meyer Y., Thiel V., Stokes H., Siddell S.G. Functional and genetic analysis of coronavirus replicase-transcriptase proteins. PLoSPathog. 2005;1(4):e39. doi: 10.1371/journal.ppat.0010039. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0100] Shi C.S., Nabar N.R., Huang N.N., Kehrl J.H. SARS-Coronavirus Open Reading Frame-8b triggers intracellular stress pathways and activates NLRP3 inflammasomes. Cell Death Discov. 2019;5:101. doi: 10.1038/s41420-019-0181-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0105] Van der Meer Y., van Tol H., Locker J.K., Snijder E.J. ORF1a-encoded replicase subunits are involved in the membrane association of the arterivirus replication complex. J. Virol. 1998;72:6689–6698. doi: 10.1128/jvi.72.8.6689-6698.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0110] Wan Y., Shang J., Graham R., Baric R.S., Li F. Receptor recognition by the novel coronavirus from Wuhan: an analysis based on decade-long structural studies of SARS coronavirus. J. Virol. 2020;94 doi: 10.1128/JVI.00127-20. (e00127-20) [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0115] Yang X.L., et al. Isolation and characterization of a novel bat coronavirus closely related to the direct progenitor of severe acute respiratory syndrome coronavirus. J. Virol. 2015;90:3253–3256. doi: 10.1128/JVI.02582-15. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Genomic characterization of a novel SARS-CoV-2

Rozhgar A Khailany

Muhamad Safdar

Mehmet Ozaslan

Abstract

1. Introduction

Fig. 1.

2. Methodology

Table 1.

Table 2.

3. Results

4. Discussion

5. Conclusion

CRediT authorship contribution statement

Declaration of competing interest

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Genomic characterization of a novel SARS-CoV-2

Rozhgar A Khailany

Muhamad Safdar

Mehmet Ozaslan

Abstract

1. Introduction

Fig. 1.

2. Methodology

Table 1.

Table 2.

3. Results

4. Discussion

5. Conclusion

CRediT authorship contribution statement

Declaration of competing interest

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases