Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2008 Jun 2;136(1):43–49. doi: 10.1016/j.virusres.2008.04.015

Complete nucleotide sequence of polyprotein gene 1 and genome organization of turkey coronavirus

Jianzhong Cao 1, Ching-Ching Wu 1, Tsang Long Lin 1,
PMCID: PMC7114415  PMID: 18514348

Abstract

The complete nucleotide sequence of polyprotein gene 1 and the assembled full-length genome sequence are presented for turkey coronavirus (TCoV) isolates 540 and ATCC. The TCoV polyprotein gene encoded two open reading frames (ORFs), which are translated into two products, pp1a and pp1ab, the latter being produced via −1 frameshift translation. TCoV polyprotein pp1a and pp1ab were predicted to be processed to 15 non-structure proteins (nsp2–nsp16), with nsp1 missing. ClustalW analysis revealed 88.99% identity and 96.99% similarity for pp1ab between TCoV and avian infectious bronchitis virus (IBV) at the amino acid level. The whole genome consists of 27,749 nucleotides for 540 and 27,816 nucleotides for ATCC, excluding the poly(A) tail. A total of 13 ORFs were predicted for TCoV. Five subgenomic RNAs were detected from ATCC-infected turkey small intestines by Northern blotting. The whole genome sequence had 86.9% identity between TCoV and IBV, supporting that TCoV is a group 3 coronavirus.

Abbreviations: nsp, non-structure protein; ORF, open reading frame; TCoV, turkey coronavirus; TRS, transcription-regulating sequence; UTR, untranslated region

Keywords: Turkey coronavirus, Polyprotein, Polymerase, coronavirus, Genome, Subgenomic

1. Introduction

Turkey coronavirus (TCoV) is a causative agent for bluecomb disease in turkey poults. The outbreak of the disease was first reported more than 40 years ago, and the viral agent responsible for the disease was identified as turkey coronavirus in 1973 (Ritchie et al., 1973). TCoV infects the small intestine of turkey poults and causes disruption of the infected tissue resulting in reduced surface area of intestine, reduced consumption of food and apparent decrease in body weight of infected turkeys. The mortality rate is low, however. The outbreaks of TCoV were mostly reported from turkey farms in the US and Europe (Cavanagh, 2005, Nagaraja and Pomeroy, 1997). Based on the antigenic relationship between TCoV and other coronaviruses, TCoV was classified with avian infectious bronchitis virus (IBV), which infects chicken, as a group 3 coronavirus within the Genus Coronavirus, Family Coronaviridae, and Order Nidovirales (Gonzalez et al., 2003).

Coronavirus genome contains a single, positive-strand RNA ((+) ssRNA) molecule, which is about 27–33 kilobases (kb) and has a cap at the 5′ end and poly(A) tail at the 3′ end (Boursnell et al., 1987, Lai and Stohlman, 1981). There are four structural genes encoded by all coronavirus genomes so far sequenced; these are spike protein (S), envelope protein (E), matrix protein (M), and nucleocapside protein (N). The genome organization of coronavirus is 5′-polymerase-S-E-M-N-3′. An untranslated region (UTR) is located at both the 5′ and 3′ ends of the genome. The production of structural proteins is through transcription of a set of co-terminal subgenomic mRNA (sgRNA). The molecular mechanisms of genome replication and transcription are not fully understood, but the discontinuous negative-strand extension model has gained wide acceptance (Sawicki and Sawicki, 1995, Sawicki et al., 2007).

The polymerase gene accounts for about two-thirds of the genome (20–22 kDa) and consists of two open reading frames (ORFs): ORF1a and ORF1b (Boursnell et al., 1987). The polymerase is necessary and sufficient for genome replication and transcription because purified viral RNA or in vitro transcribed viral RNA from cDNA construct are infectious when transfected into permissive cells (Yount et al., 2000). However, nucleocapsid protein greatly enhanced coronavirus genome replication (Almazan et al., 2004, Schelle et al., 2005), suggesting that nucleocapsid protein may have a regulatory role for coronavirus replication. When viral genomic RNA enters the host cell, ORF1a (pp1a) and polyprotein 1ab (pp1ab) are translated first, the latter being translated through a −1 frameshift translation mechanism (Bredenbeek et al., 1990, Brierley et al., 1989, Herold and Siddell, 1993, Lee et al., 1991). Coronavirus pp1ab contains a 3C-like proteinase (3CLpro) and a papain-like proteinase (PLP) that automatically cleave themselves from polyprotein and further process the polyprotein into more than 15–16 non-structure proteins (nsps) (Weiss et al., 1994) including an RNA-dependent RNA polymerase (RdRp, nsp12), an NTPase/Helicase (nsp13) for unwinding dsRNA, and three other proteins recently predicted to be a nuclease ExoN homolog (nsp14), an endoRNAse (nsp15), and a 2′-O-methyltransferase (2′-O-MT, nsp16) (Snijder et al., 2003). The biochemical functions of some of these enzymes were recently characterized (Bhardwaj et al., 2004, Ivanov et al., 2004a, Ivanov et al., 2004b, Ivanov and Ziebuhr, 2004, Minskaia et al., 2006, Snijder et al., 2003), and their in vivo roles are under investigation.

Recently, our lab reported the completion of the 3′ end sequence of four isolates of the TCoV (Lin et al., 2004, Loa et al., 2006). The sequence revealed four structure genes (E, M, N, and S) and four accessory ORFs designated as 3a, 3b, 5a, and 5b (Breslin et al., 1999a, Breslin et al., 1999b, Lin et al., 2004). Sequence analysis indicated that the sequences of M and N of TCoV shared over 80% sequence identity with that of IBV. However, the S gene shared less than 40% sequence similarity to any known coronavirus S genes (Lin et al., 2004). These results suggest that TCoV may have diverged from IBV during evolution. In this study, we continue to determine and analyze the nucleotide sequence of the polyprotein gene of TCoV and use bioinformatics to predict potential functional domains encoded by TCoV polyprotein 1ab (pp1ab). The polymerase gene sequence is then combined with structure gene sequence to assemble the full-length genome sequence for TCoV.

2. Materials and methods

2.1. Viruses

TCoV isolate ATCC (Minnesota strain) was obtained from American Type Culture Collection (ATCC, Manasass, VA). The TCoV isolate 540 used in the present study was recovered from fecal contents and intestines of turkey poults with acute coronaviral enteritis in Indiana, USA in 1994. The viruses were propagated in 22-day-old embryonating turkey eggs. The presence of TCoV in the intestines of embryos was confirmed by TCoV-specific immunofluorescence antibody assays and electron microscopy at the Indiana State Animal Disease Diagnostic Laboratory in West Lafayette, IN, USA (Lin et al., 2004). Viruses were purified from small intestine following published method (Loa et al., 2002) and either used immediately or stored at −80 °C for further use.

2.2. RNA isolation and cDNA synthesis

Viral genomic RNA was purified with RNApure reagent (GenHunter). Briefly, 0.2 ml of virus suspension was mixed with 1 ml of RNApure reagent followed by chloroform extraction. RNA was finally precipitated by isopropanol and washed with 70% ethanol. RNA pellet was air dried and dissolved in 30 μl of DEPC–H2O and used for cDNA synthesis by SuperScript RT II system with random hexmer or oligo dT18 (for 3′ RACE) (Invitrogen). The synthesized cDNA was treated with RNase A to digest viral RNA and then served as template for PCR.

2.3. PCR amplification

To clone the whole 1ab gene, the following strategies were employed. The first was to amplify a 900-bp conserved RdRp sequence based on Stephensen's report (Stephensen et al., 1999). Then, long-PCR was used to amplify the region between RdRp and the spike gene. Based on sequence results, bioinformatics analysis was used to design PCR primers to amplify the remaining sequence of ORF 1a gene. Expand LA PCR system (Roche) was used for all PCR amplification. The PCR reaction consisted of 1× PCR buffer, 1.7 mM MgCl2, 500 nM each of dNTPs, 200 pmol of each primer, 2 μl of cDNA, and 0.25 unit of DNA polymerase in a final volume of 50 μl. The PCR was performed on a Tetra machine (MJ Research) with the following conditions: initial denaturation at 94 °C for 3 min; denaturation at 93 °C for 10 s, annealing at 55 °C for 30 s, extension at 68 °C for 5–6 min; total of 30 cycles. The final extension at 68 °C was 10 min. PCR product was purified by Qiagen PCR purification Kit (Qiagen), cloned into pCRII-TOPO vector, and transformed into TOP10F cells (Invitrogen). The plasmid was prepared by QIAquick Spin Miniprep Kit (Qiagen) and submitted for DNA sequencing at Purdue Genomic Center (Purdue University, West Lafayette, IN, USA). At least two independent colonies were sequenced for each sequence. All PCR primers are listed in Supplementary Table S1 and are available upon request.

2.4. Amplification of 5′ and 3′ ends by RACE

To amplify the 5′ end of TCoV genome, 5′ RACE system for rapid amplification of cDNA ends (Invitrogen) was employed except that Expand LA polymerase was used in the PCR. Random primers were used to synthesize cDNA from ATCC and 540 RNA. The cDNA was treated with RNase mix and purified by GlassMax spin cartridge according to manufacture's protocol (Invitrogen). The 3′ end of cDNA was tailed with dCTP by TdT. After tailing with dCTP, PCR was performed with primers AAP (GGC CAC GCG TCG ACT AGT ACG GGI IGG GII GGG IIG) and IBPR2 (TGG CAC TAC CCC CTA CAA AC). The amplified PCR product was analyzed and cloned for sequencing in the same way as described in previous section for PCR amplification.

To amplify the 3′ end of TCoV genome, oligodT18 was used to synthesize cDNA from genomic RNA of ATCC and 540. After degradation of RNA with RNase mix, the cDNA was used as template for PCR with primers oligodT15 and AT3endF (TGGAATTTGATGATGAACC, 96 nt upstream of the stop codon of ATCC N gene). The PCR product was treated in the same way as above.

To obtain the leader–body junction sequence of each subgenomic mRNA (sgRNA), primers TCVF (ACTAAAGATAGATATTAATATATATCTATTGCACTAGCC) and TCVsgR1 (AAACCAAGATGCATTTCC) were used to amplify the 5′ end of sgRNA for 3, M, 5, and N. For amplify the 5′ end of sgRNA for S gene, TCVF and ATS174 (TCTGGCGGTCTCATAACATCTGGA) were used in PCR. PCR products were cloned into pCRII-TOPO for sequencing as described in previous section.

2.5. Sequence analysis

All DNA sequences were analyzed by DNAStar software (Madison, WI, USA) and ClustalW program (Thompson et al., 1994) or online softwares as indicated in the results. Frameshift pseudoknot was predicted using M-fold (Mathews et al., 1999).

2.6. Polyprotein mapping

Polyprotein mapping of TCoV 1ab polyprotein was based on predicted 3CLpro and PLP and their substrate preferences as described for IBV (Liu et al., 1998) and other coronaviruses (Hegyi and Ziebuhr, 2002, Kiemer et al., 2004). BLASTp program (NCBI: http://www.ncbi.nlm.nih.gov/blast/Blast.cgi) and pfman (www.expasy.org) were used to find sequence similarity and conserved domains in database. TMHMM was used to predict transmembrane domains (http://www.cbs.dtu.dk/services/TMHMM-2.0/). The nomenclature for pp1a and pp1ab mapping product (nsp) was according to Ziebuhr, 2005 and Ziebuhr et al., 2000.

2.7. Phylogenetic analysis

The alignments were performed using CLUSTALW (Thompson et al., 1994), and phylogenetic trees were drawn by DNAStar and program at http://www.genebee.msu.su/services/phtree_full.html.

Coronavirus sequences used in this article were from NCBI. Their GenBank accession numbers were:

BCoV, NC_003045; BtCoV, NC_008315; FCoV, NC_007025; HCoV-229E, NC_002645; HCoV-NL63, NC_005831; HCoV-OC43, NC_005147; HCoV-HKU1, NC_006577; IBV, NC_001451; MHV, NC_001846; PEDV, NC_003436; SARS-CoV, NC_004718; TGEV, NC_002306.

2.8. Northern blotting

About 10 μg of isolated total RNA from mock and ATCC-infected turkey small intestines were separated on 1% agarose gel and transferred onto nitrocellulose membrane. 32P-CTP-(GE healthcare) labeled N gene probe was prepared using High Prime DNA Labeling Kit (Roche) with N gene primers N102F and N102R. Membrane was prehybridized for 2 h at 68 °C and then hybridized overnight at 68 °C with 32P-labeled N gene probe. After hybridization, membranes were wrapped with Saran Wrap and exposed to X-ray film for signal development.

3. Results and discussion

3.1. Nucleotide sequence accession number

The sequences reported in this work have been deposited in the GenBank database under accession number EU022526 for TCoV-ATCC and EU022525 for TCoV-540.

3.2. Polyprotein gene of TCoV

The sequence of polymerase gene 1 of TCoV isolate ATCC contained 20,441-nts, excluding the 5′ UTR. Two ORFs were encoded by gene 1. ORF1a contained 11,874 nts (529–12,402) encoding a protein of 3957 aa (pp1a); ORF1b contained 7965 nt (12,477–20,441) encoding a protein of 2654 aa (pp1b) (Table 1 ). The polyprotein gene of the TCoV 540 isolate consisted of 20,401 nts excluding the 5′ UTR. ORF 1a was 11,838 nts (531–12,368), encoding pp1a of 3945 aa, and ORF 1b was 7959 nts (12,443–20,401), encoding a protein of 2652 aa (Table 2 ). Through −1 frameshift translation, pp1ab was predicted to contain 6637 aa for ATCC and 6623 aa for 540. The 3′ end of ORF1b overlapped with the 50 nts on the 5′ end of spike gene. There were 14 aa missing in 540 pp1ab when compared with ATCC pp1ab. They were distributed at 7 positions on pp1ab of the ATCC isolate, i.e. positions 922–923 (2 aa, nsp3); 930 (1 aa, nsp3); 971–973 (3 aa, nsp3); 2306–2307 (2 aa, nsp4); 3226–3229 (4 aa, nsp6); 4234 (1 aa, nsp12); 5095 (1 aa, nsp13). ClustalW comparison of the protein sequence between 540 and ATCC showed that the sequence identities of pp1a, pp1b, and pp1ab were 89.92%, 95.86%, and 92.26%, respectively. The overall similarities for pp1a, pp1b, and pp1ab were 97.4%, 98.91%, and 97.97%, respectively.

Table 1.

Open reading frames encoded in TCoV-ATCC genome

ORF Location Size (nt) Size (aa/kDa)
1a 529–12,402 11,874 3957/441.130
1b 12,477–20,441 7,965 2654/300.788
S (2) 20,392–24,003 3,612 1203/132.168
3a 24,003–24,176 174 57/6.680
3b 24,176–24,370 195 64/7.395
E (3c) 24,351–24,662 312 103/11.451
M (4a) 24,652–25,323 672 223/25.153
4b 25,324–25,608 285 94/11.180
4c 25,529–25,687 159 52/6.297
5a 25,684–25,881 198 65/7.502
5b 25,878–26,126 249 82/9.354
N (6a) 26,069–27,298 1,230 409/45.069
6b 27,307–27,531 225 74/8.744

Table 2.

Open reading frames encoded in TCoV-540 genome

ORF Location Size (nt) Size (aa/kDa)
1a 531–12,368 11,838 3945/439.617
1b 12,443–20,401 7,959 2652/300.298
S (2) 20,352–23,963 3,612 1203/132.106
3a 23,963–24,136 174 57/6.638
3b 24,136–24,330 195 64/7.412
E (3c) 24,311–24,610 300 99/11.051
M (4a) 24,610–25,278 669 222/25.141
4b 25,279–25,563 285 94/11.075
4c 25,484–25,654 171 56/6.280
5a 25,638–25,835 198 65/7.544
5b 25,832–26,074 243 80/9.323
N (6a) 26,017–27,246 1,230 409/45.005
6b 27,255–27,476 222 73/8.604

The frameshift “slippery sequence” UUUAAAC (Brierley et al., 1989) was identified for both ATCC and 540. Both sequences were located before the end of ORF1a. The sequences downstream of UUUAAAC were predicted to form a pseudoknot to support the translational frameshift (Brierley et al., 1989) (Supplementary Fig. S1). The frameshift position was predicted at C of UUUAAAC.

Comparison of pp1a and pp1ab of TCoV with those of other coronaviruese revealed that the TCoV polyprotein was predicted to be processed into 15 non-structure proteins (nsp2–nsp16; Fig. 1(b) and Table 3 ) by polyprotein-encoded viral proteinases. One 3C-like proteinase (3CLpro) was predicted to reside in nsp5 due to its conserved residues responsible for 3CLpro activity (Supplementary Fig. S2) (Ziebuhr et al., 2000); one papain-like proteinase (PLpro) was identified in nsp3 due to its conserved PLP residues (CHD) (Supplementary Fig. S3). Like another group 3 coronavirus, IBV, only one active PLpro was predicted for TCoV. The structure of TCoV nsp3 bears similar organization to nsp3 of IBV in that the Ac domain, X domain (ADPR), and Y domain were all present and arranged in the same order (Fig. 1(b)). Comparison of amino acid sequences of each nsp of TCoV with those of other coronaviruses predicted several putative enzymatic activities; among them, the enzymatic activity and potentials of nsp2, nsp5 (Supplementary Fig. S3), nsp13 (Supplementary Fig. S5), nsp14 (Supplementary Fig. S6), and nsp15 (Supplementary Fig. S7) were confirmed in other coronaviruses by experimentation (Bhardwaj et al., 2004, Eckerle et al., 2007, Fang et al., 2006, Graham et al., 2005, Kiemer et al., 2004). Nsp8, nsp9, and nsp10 were predicted to have RNA binding activity (Egloff et al., 2004, Matthes et al., 2006, Zhai et al., 2005). Nsp12 was predicted to be the major RdRp (Supplementary Fig. S4), though its activity has not been experimentally confirmed. Nsp16 was predicted to be 2′-O-methyltransferase (Supplementary Fig. S8).

Fig. 1.

Fig. 1

(a) Genome organization of TCoV. Diagram shows putative ORFs. UTRs, leader (L), and TRS are not to scale. Above, the genome organization of TCoV are shown the predicted five sgRNA in relative sizes. Genome organization of IBV-Beaudette (NC_001451) is displayed below for comparison with that of TCoV. (b) Mapping of TCoV polyprotein. Predicted non-structure proteins (nsp2–nsp16) for pp1a are shown in relative sizes (bottom panel). Nsp1 is missing from TCoV and nsp11 contains only 23 aa. The sequence is for ATCC isolate (accession number EU022526).

Table 3.

Polyprotein mapping for TCoV ATCC

Cleavage products Polyprotein Position on polyprotein Size (aa) Cleavage by Potential function
nsp2 Pp1ab/pp1a M1-G673 673 PLP
nsp3 Pp1ab/pp1a G674-G2267 1,594 PLP TM1, PLpro, ADRP
nsp4 Pp1ab/pp1a G2268-Q2781 514 PLP/3CLpro TM2
nsp5 Pp1ab/pp1a A2782-Q3088 307 3CLpro 3CLpro
nsp6 Pp1ab/pp1a S3089-Q3385 297 3CLpro TM3
nsp7 Pp1ab/pp1a S3386-Q3468 83 3CLpro
nsp8 Pp1ab/pp1a S3469-Q3678 210 3CLpro dsRNA binding; RdRp?
nsp9 Pp1ab/pp1a N3679-Q3789 111 3CLpro TM4; ssRNA binding
nsp10 Pp1a S3790-Q3934 145 3CLpro Zinc-binding; ssRNA binding
nsp11 Pp1ab S3935-G3957 23 3CLpro
nsp12 Pp1ab S3935-Q4875 941 3CLpro RdRp
nsp13 Pp1ab S4876-Q5476 601 3CLpro Helicase
nsp14 Pp1ab G5477-Q5997 521 3CLpro Exoribonuclease
nsp15 Pp1ab S5998-Q6335 338 3CLpro NendoU
nsp16 Pp1ab S6336-M6637 302 3CLpro 2′-O-Methyltransferase

3.3. Genome organization of TCoV

The first two full-length genome sequences were reported for TCoV prototype ATCC and field isolate 540. The complete genome sequences were obtained by assembly of polyprotein gene sequences that were determined by direct sequencing of cloned RT-PCR products in this report and published structure gene sequences of the same isolates from our lab (Lin et al., 2004, Loa et al., 2006). Both 5′ and 3′ UTR sequences were determined by RACE and used to assembly the full-length genomic sequence. The reported genomic sequences were 27,817 nucleotides (nt) for ATCC and 27,749 nt for 540, excluding poly(A) tail. For both TCoV isolates, the percentage of nucleotide composition was 29% for A, 33% for U, 22% for G, and 16% for C. A + U was 62%, indicating that the genome of TCoV was AU rich. The genome nucleotide sequence identity between 540 and ATCC was 92.8% by Clastal W. 540 and ATCC shared nucleotide sequence identity of 86.9% and 87.5% with that of IBV, respectively.

Analysis of genome organization of the TCoV-ATCC isolate revealed that there was a 64-nt (1–64) leader sequence within the 5′ UTR of 530 nt. As found in other coronaviruses (Brian and Baric, 2005), the 5′ UTR of TCoV encoded an ORF of 11 amino acids (Supplementary Table S2). Using the ORF finder at NCBI, it was revealed that there were 13 putative ORFs in the genomes of TCoV isolates ATCC and 540. These ORFs were 1a, 1b, 2 (spike), 3a, 3b, 3c (envelope), 4a (matrix), 4b, 4c, 5a, 5b, 6a (nucleocapsid), and 6b (Fig. 1(a); Table 1, Table 2). 4b came immediately after the matrix gene. 6b was immediately following N gene. By comparison with another group 3 coronavirus, IBV-Beaudette, it was found out that 4c and 6b were not present in IBV-Beaudette (Fig. 1(a)). The prediction of 6b was not expected. After N gene, the nucleotide sequences of TCoV and IBV-Beaudette were highly conserved (Supplementary Fig. S9). However, there was no ORF in this region of IBV, so the 3′ UTR of IBV was over 500 nt. In both isolates of TCoV, a 74-aa ORF (6b) was predicted in this region irrespective of nucleotide sequence conservation between TCoV and IBV. The prediction of ORF 6b reduced the potential 3′ UTR of TCoV to less than 301-nts as compared with 506-nts in IBV. Determination of whether or not proteins of 4b, 4c, and 6b were produced requires further experimental confirmation. A consensus octanucleotide motif GGAAGAGC was found 72-nt upstream of the poly(A) tail in 540 and ATCC genomes of the TCoV. In mouse hepatitis virus, the octanucleotide motif was found to be unnecessary for virus replication in vitro, but a deletion mutant showed reduced replication in mouse brain, suggesting that the octanucleotide motif affects pathogenesis (Goebel et al., 2007). A consensus transcriptional regulated sequence (TRS) (CUUAACAAA) was found located at the 3′ end of the genome leader (1–64 nt) and in front of each structure gene and major accessory gene with either an exact match (sg3–6) or one mismatch (sg2). A total of five sgRNA were predicted for production of structure and accessory proteins in the genome of TCoV.

3.4. Phylogenetic analysis of 1ab

Clastal W program was used to analyze the relationship between TCoV pp1ab and other coronavirus pp1ab. Table 4 was a summary of the amino acid sequence identity of nsps between TCoV ATCC and other coronaviruses. It was noticed that TCoV and IBV shared highest sequence identity for all nsps when compared with other coronaviruses. Tree-top software was used to draw phylogenetic trees (http://www.genebee.msu.su/services/phtree_full.html). Fig. 2 shows the result of phylogenetic analysis of pp1ab. The TCoV was grouped with the IBV in the group 3. A close examination of TCoV and IBV polyprotein pp1ab showed that the matrix distance within the two TCoV strains was longer (0.047) than that of TCoV and IBV (0.045). ClustalW analysis of pp1ab, 3CLpro, RdRp, and helicase of TCoV and IBV showed sequence similarity of 97.97%, 94.09%, 98.5%, and 97.66%, respectively.

Table 4.

Sequence identity of TCoV-ATCC nsps with other coronaviruses

TCoV-540 IBV SARS BCoV HCoV-OC43 MHV BtCoV HCoV-HKU1 FCoV TGEV PEDV HCoV-229E HCoV-NL63
nsp2 93.6 92.1 5.5 6.0 5.5 5.6 7.3 4.9 10.7 11.1 10.7 5.8 7.3
nsp3 88.7 87.4 16.9 17.6 16.7 17.3 19.9 17.9 15.4 16.2 16.9 14.8 15.1
nsp4 83.4 90.9 25.6 27.8 28.4 25.6 25.1 27.0 22.2 21.6 22.7 23.1 22.4
nsp5 89.3 88.3 37.3 41.6 42.6 40.6 38.2 39.9 38.4 39.7 41.1 38.4 37.3
nsp6 91.5 93.9 22.1 18.1 16.7 20.6 20.2 19.9 17.7 15.3 19.3 21.9 22.2
nsp7 98.8 97.6 39.8 47.0 45.8 41.0 42.2 44.6 41.0 42.2 37.3 41.0 41.0
nsp8 94.8 95.2 39.4 39.1 39.1 38.6 38.7 41.1 40.0 39.0 36.9 37.9 39.0
nsp9 95.5 99.1 40.5 41.8 40.9 37.3 40.0 38.2 32.4 34.2 34.3 31.2 30.3
nsp10 94.5 95.9 56.1 48.2 48.2 48.2 53.2 48.2 51.9 49.6 50.4 52.6 50.4
nsp12 95.0 96.1 59.9 59.3 59.2 59.1 59.1 58.8 57.2 57.0 56.6 56.1 56.0
nsp13 97.2 96.5 57.9 59.1 59.4 58.7 57.0 57.1 56.1 56.6 54.4 54.4 54.8
nsp14 95.4 96 52.4 53.4 53.4 54.1 51.1 54.7 49.5 49.5 52.2 50.4 50.3
nsp15 95.6 94.7 38.8 36.7 36.4 34.6 35.5 36.1 37.6 37.9 39.3 38.5 38.8
nsp16 94.0 90.7 52.7 51.5 50.5 52.5 48.0 50.8 47.7 47 52.5 50.7 54.0

Fig. 2.

Fig. 2

Phylogenetic relationship between TCoV and other coronaviruses pp1ab. The map was generated by Phylip at http://www.genebee.msu.su/services/phtree_full.html. Sequences of pp1ab of coronaviruses were used for analysis. The sequences accession numbers are listed at the end of the article.

3.5. Subgenomic mRNA detection for TCoV

Based on the location of TRS on the genome, it was predicted that 5 subgenomic mRNA would be produced for structure and accessory gene translation (Fig. 1).

To confirm predicted sgRNA production for TCoV, total RNA was isolated from mock or ATCC-infected turkey small intestines and used for Northern blotting with 32P-labeled PCR probe specific for the N gene. Fig. 3 shows 7 RNA bands detected in the ATCC infected sample, but not in the mock-infected sample, indicating the specificity of the probe. Based on predicted sizes for genomic and subgenomic RNA, one band was assigned to genomic RNA and five bands were assigned to sgRNA 2–6 for expression of S, E, M, 5, and N proteins. One extra band whose size was smaller than genomic RNA was assumed to be a defect interfering (DI) RNA. DI RNA has been detected in other coronavirus-infected cells and was assumed to be the template switch products during replication.

Fig. 3.

Fig. 3

Northern blotting of total RNA isolated from TCoV infected turkey small intestines. Total RNA (10 μg) was isolated from mock or ATCC infected turkey small intestines 3 days post infection, separated on 1% agarose gel, transferred onto nitrocellulose membrane, and detected with 32P labeled PCR probe corresponding to N gene. The sizes (kb) on the right indicate the predicted size of genomic and subgenomic RNA. X indicates assumed DI RNA.

Because TRS in sgRNA could be derived from template switch between leader and body TRS, we aimed to determine potential switch position by analyzing sequences flanking the TRS region in each sgRNA. Fig. 4 is a summary of partial sequences flanking the TRS region for each sgRNA. It was noticed that the TRS (CUUAACAAA) of the S gene sgRNA was identical to the TRS of the leader, but different from the body TRS by one nucleotide (CUgAACAAA). This suggested that the template switch was downstream of CUU on the leader TRS. The TRS of the remaining sgRNA was the same as for the leader and the body TRS, implying the template switch could have occurred anywhere within CUUAACAAA. As expected, genes 3a, 3b, and 3c (E) shared the same sgRNA for translation; genes 4a (M), 4b, and 4c shared the same sgRNA; genes 5a and 5b shared the same sgRNA; genes 6a (N) and 6b shared the same sgRNA. Determination of weather or not the predicted 3a, 3b, 4b, 4c, 5b, and 6b were expressed require experimental confirmation and hence their biological functions during replication and pathogenesis.

Fig. 4.

Fig. 4

Sequences flanking TRS region for TCoV sgRNA. The partial sequences display each sgRNA for ATCC isolate. For each sgRNA, partial genomic leader (gL) and body (gS, gE, gM, g5, and gN) sequences are displayed above and below sgRNA. The star (*) indicates identical nucleotide and the box indicates TRS region where template switch is assumed to occur.

4. Conclusion

In conclusion, our data of completed TCoV polyprotein gene sequence and the assembly of the first full-length genome of TCoV support the classification of TCoV as a group 3 coronavirus. The completed genome sequences of two TCoV isolates will aid our understanding of coronavirus in terms of molecular evolution and molecular pathogenesis. It will also provide a strong basis for the development of up-dated molecular diagnostics and recombinant or DNA-based vaccines for the control and prevention of TCoV infection in turkey flocks.

Footnotes

Appendix A

Supplementary data associated with this article can be found, in the online version, at doi:10.1016/j.virusres.2008.04.015.

Appendix A. Supplementary data

mmc1.doc (398.5KB, doc)

References

  1. Almazan F., Galan C., Enjuanes L. The nucleoprotein is required for efficient coronavirus genome replication. J. Virol. 2004;78(22):12683–12688. doi: 10.1128/JVI.78.22.12683-12688.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bhardwaj K., Guarino L., Kao C.C. The severe acute respiratory syndrome coronavirus Nsp15 protein is an endoribonuclease that prefers manganese as a cofactor. J. Virol. 2004;78(22):12218–12224. doi: 10.1128/JVI.78.22.12218-12224.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Boursnell M.E., Brown T.D., Foulds I.J., Green P.F., Tomley F.M., Binns M.M. Completion of the sequence of the genome of the coronavirus avian infectious bronchitis virus. J. Gen. Virol. 1987;68(Pt 1):57–77. doi: 10.1099/0022-1317-68-1-57. [DOI] [PubMed] [Google Scholar]
  4. Bredenbeek P.J., Pachuk C.J., Noten A.F., Charite J., Luytjes W., Weiss S.R., Spaan W.J. The primary structure and expression of the second open reading frame of the polymerase gene of the coronavirus MHV-A59: a highly conserved polymerase is expressed by an efficient ribosomal frameshifting mechanism. Nucleic Acids Res. 1990;18(7):1825–1832. doi: 10.1093/nar/18.7.1825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Breslin J.J., Smith L.G., Fuller F.J., Guy J.S. Sequence analysis of the matrix/nucleocapsid gene region of turkey coronavirus. Intervirology. 1999;42(1):22–29. doi: 10.1159/000024956. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Breslin J.J., Smith L.G., Fuller F.J., Guy J.S. Sequence analysis of the turkey coronavirus nucleocapsid protein gene and 3′ untranslated region identifies the virus as a close relative of infectious bronchitis virus. Virus Res. 1999;65(2):187–193. doi: 10.1016/S0168-1702(99)00117-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brian D.A., Baric R.S. Coronavirus genome structure and replication. Curr. Top. Microbiol. Immunol. 2005;287:1–30. doi: 10.1007/3-540-26765-4_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Brierley I., Digard P., Inglis S.C. Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot. Cell. 1989;57(4):537–547. doi: 10.1016/0092-8674(89)90124-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Cavanagh D. Coronaviruses in poultry and other birds. Avian Pathol. 2005;34(6):439–448. doi: 10.1080/03079450500367682. [DOI] [PubMed] [Google Scholar]
  10. Eckerle L.D., Lu X., Sperry S.M., Choi L., Denison M.R. High fidelity of murine hepatitis virus replication is decreased in nsp14 exoribonuclease mutants. J. Virol. 2007;81(22):12135–12144. doi: 10.1128/JVI.01296-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Egloff M.P., Ferron F., Campanacci V., Longhi S., Rancurel C., Dutartre H., Snijder E.J., Gorbalenya A.E., Cambillau C., Canard B. The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc. Natl. Acad. Sci. U.S.A. 2004;101(11):3792–3796. doi: 10.1073/pnas.0307877101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Fang S., Chen B., Tay F.P., Ng B.S., Liu D.X. An arginine-to-proline mutation in a domain with undefined functions within the helicase protein (Nsp13) is lethal to the coronavirus infectious bronchitis virus in cultured cells. Virology. 2006 doi: 10.1016/j.virol.2006.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Goebel S.J., Miller T.B., Bennett C.J., Bernard K.A., Masters P.S. A hypervariable region within the 3′ cis-acting element of the murine coronavirus genome is nonessential for RNA synthesis but affects pathogenesis. J. Virol. 2007;81(3):1274–1287. doi: 10.1128/JVI.00803-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gonzalez J.M., Gomez-Puertas P., Cavanagh D., Gorbalenya A.E., Enjuanes L. A comparative sequence analysis to revise the current taxonomy of the family Coronaviridae. Arch. Virol. 2003;148(11):2207–2235. doi: 10.1007/s00705-003-0162-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Graham R.L., Sims A.C., Brockway S.M., Baric R.S., Denison M.R. The nsp2 replicase proteins of murine hepatitis virus and severe acute respiratory syndrome coronavirus are dispensable for viral replication. J. Virol. 2005;79(21):13399–13411. doi: 10.1128/JVI.79.21.13399-13411.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Hegyi A., Ziebuhr J. Conservation of substrate specificities among coronavirus main proteases. J. Gen. Virol. 2002;83(Pt 3):595–599. doi: 10.1099/0022-1317-83-3-595. [DOI] [PubMed] [Google Scholar]
  17. Herold J., Siddell S.G. An ‘elaborated’ pseudoknot is required for high frequency frameshifting during translation of HCV 229E polymerase mRNA. Nucleic Acids Res. 1993;21(25):5838–5842. doi: 10.1093/nar/21.25.5838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Ivanov K.A., Hertzig T., Rozanov M., Bayer S., Thiel V., Gorbalenya A.E., Ziebuhr J. Major genetic marker of nidoviruses encodes a replicative endoribonuclease. Proc. Natl. Acad. Sci. U.S.A. 2004;101(34):12694–12699. doi: 10.1073/pnas.0403127101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ivanov K.A., Thiel V., Dobbe J.C., van der Meer Y., Snijder E.J., Ziebuhr J. Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase. J. Virol. 2004;78(11):5619–5632. doi: 10.1128/JVI.78.11.5619-5632.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ivanov K.A., Ziebuhr J. Human coronavirus 229E nonstructural protein 13: characterization of duplex-unwinding, nucleoside triphosphatase, and RNA 5′-triphosphatase activities. J. Virol. 2004;78(14):7833–7838. doi: 10.1128/JVI.78.14.7833-7838.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kiemer L., Lund O., Brunak S., Blom N. Coronavirus 3CLpro proteinase cleavage sites: possible relevance to SARS virus pathology. BMC Bioinformatics. 2004;5:72. doi: 10.1186/1471-2105-5-72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lai M.M., Stohlman S.A. Comparative analysis of RNA genomes of mouse hepatitis viruses. J. Virol. 1981;38(2):661–670. doi: 10.1128/jvi.38.2.661-670.1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lee H.J., Shieh C.K., Gorbalenya A.E., Koonin E.V., La Monica N., Tuler J., Bagdzhadzhyan A., Lai M.M. The complete sequence (22 kilobases) of murine coronavirus gene 1 encoding the putative proteases and RNA polymerase. Virology. 1991;180(2):567–582. doi: 10.1016/0042-6822(91)90071-I. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Lin T.L., Loa C.C., Wu C.C. Complete sequences of 3’ end coding region for structural protein genes of turkey coronavirus. Virus Res. 2004;106(1):61–70. doi: 10.1016/j.virusres.2004.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Liu D.X., Shen S., Xu H.Y., Wang S.F. Proteolytic mapping of the coronavirus infectious bronchitis virus 1b polyprotein: evidence for the presence of four cleavage sites of the 3C-like proteinase and identification of two novel cleavage products. Virology. 1998;246(2):288–297. doi: 10.1006/viro.1998.9199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Loa C.C., Lin T.L., Wu C.C., Bryan T., Hooper T., Schrader D. Specific mucosal IgA immunity in turkey poults infected with turkey coronavirus. Vet. Immunol. Immunopathol. 2002;88(1/2):57–64. doi: 10.1016/S0165-2427(02)00135-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Loa C.C., Wu C.C., Lin T.L. Comparison of 3′-end encoding regions of turkey coronavirus isolates from Indiana, North Carolina, and Minnesota with chicken infectious bronchitis coronavirus strains. Intervirology. 2006;49(4):230–238. doi: 10.1159/000091470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Mathews D.H., Sabina J., Zuker M., Turner D.H. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J. Mol. Biol. 1999;288(5):911–940. doi: 10.1006/jmbi.1999.2700. [DOI] [PubMed] [Google Scholar]
  29. Matthes N., Mesters J.R., Coutard B., Canard B., Snijder E.J., Moll R., Hilgenfeld R. The non-structural protein Nsp10 of mouse hepatitis virus binds zinc ions and nucleic acids. FEBS Lett. 2006;580(17):4143–4149. doi: 10.1016/j.febslet.2006.06.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Minskaia E., Hertzig T., Gorbalenya A.E., Campanacci V., Cambillau C., Canard B., Ziebuhr J. Discovery of an RNA virus 3′->5′ exoribonuclease that is critically involved in coronavirus RNA synthesis. Proc. Natl. Acad. Sci. U.S.A. 2006;103(13):5108–5113. doi: 10.1073/pnas.0508200103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Nagaraja K.V., Pomeroy B.S. Coronaviral enteritis of turkeys (bluecomb disease) In: Calnek B.W., Barnes H.J., Beard C.W., Reid W.M., Yoda H.W., editors. Disease of Poultry. Iowa State University Press; Ames: 1997. [Google Scholar]
  32. Ritchie A.E., Deshmukh D.R., Larsen C.T., Pomeroy B.S. Electron microscopy of coronavirus-like particles characteristic of turkey bluecomb disease. Avian Dis. 1973;17(3):546–558. [PubMed] [Google Scholar]
  33. Sawicki S.G., Sawicki D.L. Coronaviruses use discontinuous extension for synthesis of subgenome-length negative strands. Adv. Exp. Med. Biol. 1995;380:499–506. doi: 10.1007/978-1-4615-1899-0_79. [DOI] [PubMed] [Google Scholar]
  34. Sawicki S.G., Sawicki D.L., Siddell S.G. A contemporary view of coronavirus transcription. J. Virol. 2007;81(1):20–29. doi: 10.1128/JVI.01358-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Schelle B., Karl N., Ludewig B., Siddell S.G., Thiel V. Selective replication of coronavirus genomes that express nucleocapsid protein. J. Virol. 2005;79(11):6620–6630. doi: 10.1128/JVI.79.11.6620-6630.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Snijder E.J., Bredenbeek P.J., Dobbe J.C., Thiel V., Ziebuhr J., Poon L.L., Guan Y., Rozanov M., Spaan W.J., Gorbalenya A.E. Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J. Mol. Biol. 2003;331(5):991–1004. doi: 10.1016/S0022-2836(03)00865-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Stephensen C.B., Casebolt D.B., Gangopadhyay N.N. Phylogenetic analysis of a highly conserved region of the polymerase gene from 11 coronaviruses and development of a consensus polymerase chain reaction assay. Virus Res. 1999;60(2):181–189. doi: 10.1016/S0168-1702(99)00017-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Thompson J.D., Higgins D.G., Gibson T.J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22(22):4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Weiss S.R., Hughes S.A., Bonilla P.J., Turner J.D., Leibowitz J.L., Denison M.R. Coronavirus polyprotein processing. Arch. Virol. Suppl. 1994;9:349–358. doi: 10.1007/978-3-7091-9326-6_35. [DOI] [PubMed] [Google Scholar]
  40. Yount B., Curtis K.M., Baric R.S. Strategy for systematic assembly of large RNA and DNA genomes: transmissible gastroenteritis virus model. J. Virol. 2000;74(22):10600–10611. doi: 10.1128/jvi.74.22.10600-10611.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Zhai Y., Sun F., Li X., Pang H., Xu X., Bartlam M., Rao Z. Insights into SARS-CoV transcription and replication from the structure of the nsp7-nsp8 hexadecamer. Nat. Struct. Mol. Biol. 2005;12(11):980–986. doi: 10.1038/nsmb999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Ziebuhr J. The coronavirus replicase. Curr. Top. Microbiol. Immunol. 2005;287:57–94. doi: 10.1007/3-540-26765-4_3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Ziebuhr J., Snijder E.J., Gorbalenya A.E. Virus-encoded proteinases and proteolytic processing in the Nidovirales. J. Gen. Virol. 2000;81(Pt 4):853–879. doi: 10.1099/0022-1317-81-4-853. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.doc (398.5KB, doc)

Articles from Virus Research are provided here courtesy of Elsevier

RESOURCES