Abstract
We have previously reported the nucleotide sequences of gene 2 (spike (S) protein gene), gene 6 (nucleocapsid (N) protein gene), and the 3′ end untranslated region of a novel avian infectious bronchitis virus (IBV) strain, CU-T2 [Jia et al. (1995) Arch. Virol. 140, 259–271]. In the present report we describe the sequences of the remaining genes of this strain (gene 3, 4 and 5) with the exception of gene 1 (RNA polymerase gene). Gene 3 contained three open reading frames (ORFs), 3a, 3b and 3c of 174, 195 and 282 nucleotides (nt), respectively. Gene 4 (membrane (M) protein gene) consisted of 749 nt with a single ORF of 687 nt. Gene 5 contained two ORFs, 5a and 5b, with 198 and 249 nt, respectively. Thus, in total, there were 7349 nt from the 5′ end of S protein gene to the 3′ end of the CU-T2 genome. The overall nt sequence homologies between gene 3, 4, and 5 of CU-T2 and those of other strains were between 84.1–90.8%, 85.8–88.8% and 90.4–96.4%, respectively. The predicted amino acid (aa) sequence homologies revealed that gene 3b and 5b were more conserved than 3a, 3c and 5a. Each individual gene of CU-T2 strain (with the exception of the RNA polymerase gene) had a different level of homology with the homologous gene of other strains, suggesting that the evolution of IBV strains in general has been a complex, and as yet, poorly understood process.
© 1997 Elsevier Science B.V. All rights reserved.
Keywords: Coronaviridae, Coronavirus, IBV, Virus evolution, Cloning, Recombinant DNA, Membrane protein gene, Small membrane protein gene
Abbreviations: aa, amino acid(s); Ark, Arkansas; bp, base pair(s); Escherichia coli (E. coli); IBV, infectious bronchitis virus; kb, kilobase(s) or 1000 bp; M, membrane; Mass, Massachusetts; N, nucleocapsid; nt, nucleotide(s); oligo, oligodeoxyribonucleotide; ORF, open reading frame(s); S, spike; sM, small membrane; UTR, untranslated region.
1. Introduction
Infectious bronchitis virus (IBV), a member of Coronaviridae family, has a single-stranded positive-sense RNA genome, which is 27 kb in length (Boursnell et al., 1987). Gene 1 (RNA polymerase gene) is located on the 5′ end of the genome and is about 20 kb long. The remaining 7 kb of the genome consists of five genes: gene 2 (spike glycoprotein (S protein) gene), gene 3, gene 4 [membrane (M) protein gene], gene 5 and gene 6 (nucleocapsid (N) protein gene). In addition, an untranslated region (UTR) exists on the extreme 3′ end of the genome. Gene 3 and gene 5 contain three (3a, 3b, 3c) and two (5a, 5b) open reading frames (ORFs), respectively. With the exception of gene 3c, which was recently shown to encode small membrane (sM) protein (Liu and Inglis, 1991; Smith et al., 1990), the products of all the other ORFs remain unknown.
Although IBV is the lone virus in the antigenic group III of Coronaviridae (Holmes, 1990), more than twenty serotypes within IBV have been recognized worldwide on the basis of virus-neutralization (VN) test (Gelb et al., 1991). IBV strains may undergo substitution of large genomic fragments in multiple genes (Kusters et al., 1990; Wang et al., 1993; Jia et al., 1995). Although the individual genes (mostly S protein genes) of many IBV isolates have been sequenced (Binns et al., 1985, Binns et al., 1986a, Binns et al., 1986b; Boursnell et al., 1985; Cavanagh and Davis, 1988, Cavanagh and Davis, 1992; Cavanagh et al., 1992a; Jia et al., 1995; Wang et al., 1993; Williams et al., 1992, Williams et al., 1993), the complete sequence of the 3′ end of the genome (whole genome except the RNA polymerase gene) of only one strain (Beaudette strain) of serotype Massachusetts (Mass), and a Japanese strain KB8523 have been determined so far (Boursnell et al., 1987; Sutou et al., 1988). The IBV strain CU-T2 described in this report was a unique strain which had apparently originated from Arkansas (Ark) serotype, had undergone recombination in two genes, and expressed both Ark and Mass epitopes on the S1 protein (Jia et al., 1995). In the previous study, we had reported the sequences of the S protein gene, the N protein gene and the 3′-UTR of this strain. Here, we report the sequences of all the remaining 3′ end genes of CU-T2: gene 3, M protein gene and gene 5.
2. Experimental and discussion
2.1. Clones containing cDNA of CU-T2 genes
Cloning of gene 3, M protein gene and gene 5 of CU-T2 are described in Fig. 1 . Three overlapping phagemids, pBT2-t223, pBT2-t231 and pBT2-t237, were used for sequencing in this study. pBT2-t223 contained a 2172 bp fragment which overlapped the 3′ end of the S protein gene and most of the 5′ end of gene 3 (Fig. 1). pBT2-t231 had a 2100 bp insert which overlapped partially pBT2-t223, and contained the entire gene 3, M protein gene, and partial 5′ end of gene 5 (Fig. 1). pBT2-t237 contained a 3550 bp fragment which overlapped a region extending from the 3′ end of S protein gene to the 3′ end of N protein gene (Fig. 1).
Fig. 1.

cDNA clones of CU-T2. IBV strain CU-T2 (Jia et al., 1995) was propagated in 11-day-old specific-pathogen-free (SPF) embryonated chicken eggs. Allantoic fluid collected 40 h after virus inoculation was stored at −70°C. Virus was purified by sucrose density gradient centrifugation (Sutou et al., 1988), and genomic RNA extracted with phenol-chloroform was used for cDNA synthesis. An oligonucleotide primer complementary to a conserved region of the S genes of several Mass serotype strains containing an XhoI restriction site and an eight nucleotide spacer (5′-GAACTAGTCTCGAGGAAGGACGTGGGACTTTG-3′) was synthesized. This primer was found to bind to downstream 3′ end sequences of several genes (i.e., gene 3, M protein gene, and N protein gene) on the IBV genome, therefore it was used for the first strand cDNA synthesis. After blunt-ending the cDNA with T4 DNA polymerase, EcoRI adapters were added to the ends of the cDNAs. Following EcoRI and XhoI digestion, the cDNA fragments were cloned into EcoRI/XhoI sites in λ-ZAP vector (Stratagene, La Jolla, CA, USA). The cDNA library was packaged using Gigapack II Gold packaging extract (Stratagene). Escherichia coli (E. coli) containing the library were plated onto NZY plates and duplicated onto nitrocellulose filters. The positive plaques were identified by hybridization with a 5′ end-radiolabeled probe prepared from a StyI fragment derived from plasmid pBSM41M, which contained the S gene of Mass41. Phagemids containing CU-T2 genes were obtained by in vivo excision in (E. coli) strain XL-1 using a protocol furnished by Stratagene. A total of 27 positive clones were obtained. Three of them were characterized.
2.2. Sequences of gene 3, M protein gene, gene 5
Gene 3 of CU-T2 consisted of 711 nt, which contained three ORFs, 3a, 3b and 3c. The ORFs consisted of 174, 195 and 282 nt (potentially encode 58, 65 and 94 aa), respectively. 32 nt of the coding sequence of the S protein gene overlapped gene 3. M protein gene consisted of 749 nt, and with a single ORF of 687 nt (encoded 229 aa). There were 66 nt overlapping between the 3′ end of ORF 3c and 5′ end of M protein gene. There was a non-coding region consisting of 342 nt between 3′ end of M protein gene and gene 5. Gene 5 had 460 nt, and contained two ORFs (5a and 5b) which consisted of 198 and 249 nt (potentially encoded 66 and 83 aa), respectively. 159 nt of ORF 5b overlapped the 5′ end sequence of N protein gene (see Fig. 2 ).
Fig. 2.


Nucleotide sequences of gene 3, M protein gene, and gene 5 of IBV CU-T2. Heavy underlines indicate the putative start codons. Light lines above the sequence indicate the stop codons. The conserved nt sequence CTGAACAA or CTTAACAA, which is located at the starting site of each gene, is boxed. Three overlapping phagemids pBT2-t223, pBT2-t231 and pBT2-t237 containing CU-T2 genes were used for DNA sequencing which was carried out by the dideoxy method using a Sequenase 2.0 DNA sequencing kit (US Biolchemical, Cleveland, OH, USA). The nucleotide sequences reported here have been deposited with the GenBank. The accession Nos. are as follows: Gene 3 of IBV CU-T2, U46036; M protein gene (gene 4) of IBV CU-T2, U46035; Gene 5 of IBV CU-T2, U46037; Complete sequences of the 3′ end genome (whole genome except the RNA polymerase gene) of IBV CU-T2, U49858.
2.3. Comparison of the sequences
Six IBV strains whose nt sequences of the gene 3 are available, are listed in Table 1 . Of those strains, UK/183/66 and UK/68/84 are non-Mass serotype strains (Cavanagh et al., 1992b), whereas Portugal/322/82 strain belongs to Mass serotype (Cavanagh et al., 1992b). However, the latter strain and strain KB8523 were possible recombinants, with recombinant fragments located between 3′ end of S protein gene and gene 3. (Cavanagh et al., 1992b; Sutou et al., 1988). The overall nt sequence of CU-T2 gene 3 compared with sequences of the six strains listed in Table 1, which showed homologies between 84.1% and 90.8%. Except Portugal/322/82 and KB8523, however, there were only about 85% homologies between the CU-T2 and the prototype Mass strains (i.e., Mass41 and Beaudette) or the non-Mass and non-Ark serotype. It was earlier reported that some non-Mass IBV isolates contained deletions in the 3′ end of gene 3, and, as an exception, the Mass serotype Portugal/322/82 also had a similar deletion (Cavanagh and Davis, 1988; Cavanagh et al., 1992b). In this study, CU-T2 was found to have a 23 nt deletion in that region compared to a 32 nt deletion in Portugal/322/82, a 9 nt deletion in UK/68/84, and a 6 nt deletion in UK/183/66. Thus, results of the percentage homology and the deletions in gene 3 nt sequences suggested that Ark-like IBV has a relatively distant evolutionary relationship with the Mass serotype strains and non-Mass and non-Ark serotype. Besides the nt sequences, the deduced aa sequences of gene 3a, 3b and 3c of CU-T2 were also compared with published sequences (Table 1). The overall aa sequence homologies of 3a (between 81% and 86.2%) and 3c (between 84% and 90.4%) were close to those of the respective nt sequences. However, the homologies of putative 3b aa sequences were between 87.5% and 95.4%, which showed an average of 6.5% and 4.1% higher homologies than 3a and 3c, respectively. The result suggested that the 3b region is relatively more conserved compared to regions 3a and 3c. In addition, a hydrophobic region found at the amino-terminal region of the 3c protein, which was adjacent to a cysteine-rich region in CU-T2, was similar to that found in other coronaviruses (Siddell, 1995).
Table 1.
Percentage nt and aa homologies between coding regions of gene 3, M protein gene, and gene 5 of CU-T2 and other IBV strains
| IBV strains | Percent homology with CU-T2 |
||||||||
| Gene 3 |
M protein gene |
Gene 5 |
|||||||
| nt (%) | aa (%) |
nt (%) | aa (%) | nt (%) | aa (%) |
||||
| 3a | 3b | 3c | 5a | 5b | |||||
| Portugal/322/82a | 90.8 | 86.2 | 95.4 | 88.3 | N/A | N/A | N/A | N/A | N/A |
| KB8523b | 90.2 | 84.5 | 93.9 | 90.4 | 85.8 | 90.3 | 96.4 | 92.4 | 97.6 |
| UK/183/66a | 86.7 | 87.9 | 92.2 | 86.2 | N/A | N/A | N/A | N/A | N/A |
| Mass41c | 84.8 | 81.0 | 90.8 | 88.3 | N/A | N/A | N/A | N/A | N/A |
| Beaudetted | 84.6 | 84.5 | 89.2 | 87.2 | 88.8 | 90.7 | 90.4 | 87.9 | 95.2 |
| UK/68/84a | 84.1 | 86.2 | 87.5 | 84.0 | N/A | N/A | N/A | N/A | N/A |
| 68/2e | N/A | N/A | N/A | N/A | 86.6 | 92.0 | N/A | N/A | N/A |
Cavanagh et al., 1992a, Cavanagh et al., 1992b; bSutou et al. (1988); cNiesters et al. (1986); dBoursnell et al. (1987); eBinns et al. (1986b). N/A, not available.
Although the M protein gene of IBV was believed to be a conserved gene, comparison of nt sequences of CU-T2 M protein gene to those of three available IBV strains showed only 85.8%, 86.6% and 88.8% homology, respectively (Table 1). The aa sequences, however, showed higher homologies (90.3%, 92% and 90.7%, respectively). Three insertions (each encoding an additional aa) were found in the first 240 nt of M protein gene of CU-T2 compared to both Beaudette and KB8523 strains.
Overall nt sequence of gene 5 of CU-T2 showed 90.4% and 96.4% homology to those of Beaudette and KB8523 (Table 1). The aa sequence homology between gene 5a of CU-T2 and that of the other two strains was between 87.9% and 92.4%, and that of gene 5b was 95.2% and 97.6%, respectively (Table 1). The sequence of 5b showed 7.3% and 5.2% higher homologies than those of 5a, indicating that the 5b region was probably more conserved than the 5a region.
The sequences of gene 3, M protein gene and gene 5 of CU-T2, and those of S protein gene, N protein gene and the 3′-UTR reported previously by us (Jia et al., 1995), provided all the sequences of the genome except the RNA polymerase gene. A total of 7349 nt were found in a region beginning from the 5′ end of S protein gene to the 3′ end of CU-T2 genome, compared to 7355 and 7301 nt reported for KB8523 and Beaudette strains, respectively. The sequence CTGAACAA found at the starting sites of S protein gene and gene 3 of CU-T2, and the sequence CTTAACAA found at the start of M protein gene, gene 5 and N protein gene, respectively, were identical to those found in the identical sites in Beaudette and KB8523 strains. The overall sequence of the CU-T2 genome distal to the RNA polymerase gene was found to have 87% homology with the equivalent regions of the Beaudette and KB8523 strains (compared with 92% homology between the latter two strains themselves). This suggested once again that Ark-like IBV probably has a relatively distant evolutionary relationship with the Mass as well as non-Mass and non-Ark serotypes. Table 1 showed that the nt sequence homologies between gene 3, M protein gene and gene 5 of CU-T2 and Beaudette strain were 84.6%, 88.8% and 90.4%, respectively, while the homologies between CU-T2 and KB8523 were 90.2%, 85.8% and 96.4%. Furthermore, nt and aa sequences of other individual genes of CU-T2, Beaudette and KB8523 showed different degrees of homology (data not shown), implying that the evolution of the IBV strains might have involved a complex series of events, which are as yet poorly understood.
3. Conclusions
-
1.
The gene 3, M protein gene and gene 5 of IBV strain CU-T2 were cloned and sequenced. Gene 3 had 711 nt and three potential ORFs, 3a, 3b and 3c. The ORFs consisted of 174, 195 and 282 nt, respectively. M protein gene had 749 nt, and a single ORF of 687 nt. Gene 5 had 460 nt and two ORFs, 5a and 5b, of 198 and 249 nt, respectively. There was a non-coding region between M protein gene and gene 5 consisting of 342 nt. In total, there were 7349 nt from the 5′ end of S protein gene to the 3′ end of the CU-T2 genome.
-
2.
The nt and aa sequences of CU-T2 were compared with published sequences of seven other IBV strains. The overall nt sequence homologies between gene 3, M protein gene and gene 5 of CU-T2 and other strains were between 84.1% and 90.8%, 85.8% and 88.8%, 90.4% and 96.4%, respectively. The sequence of the CU-T2 genome distal to the RNA polymerase gene was found to have only an 87% homology with those of the Beaudette and KB8523 strains. The deletion observed in gene 3 nt sequences of CU-T2 suggested that Ark-like IBV has a relatively distant evolutionary relationship with the Mass and non-Mass and non-Ark serotype. In addition, the results of aa sequence comparisons suggested that the gene 3b and 5b were relatively more conserved than 3a, 3c and 5a.
-
3.
The nt sequence homologies between gene 3, M protein gene and gene 5 of CU-T2 and Beaudette strain were 84.6%, 88.8% and 90.4%, respectively, while homologies between the same three genes of CU-T2 and KB8523 were 90.2%, 85.8% and 96.4%. Furthermore, nt and aa sequences of other individual genes of CU-T2, Beaudette and KB8523 showed different degrees of homology, implying that the evolution of the IBV strains might have involved a complex series of events.
Acknowledgements
The clone containing Mass41 S gene was kindly provided by Dr. George Thiry, of Solvay Animal Health, Inc. This work was partially supported by a grant from Solvay Animal Health, Inc. We thank Dr. Donna Muscarella and Mr. Rodman Getchell for the review of this manuscript.
References
- Binns, M.M., Boursnell, M.E.G., Cavanagh, D., Pappin, D.J.C. and Brown, T.D.K. (1985) Cloning and sequencing of the gene encoding the spike protein of the coronavirus IBV. J. Gen. Virol. 66, 719–726. [DOI] [PubMed]
- Binns, M.M., Boursnell, M.E.G., Tomley, F.M. and Brown, T.D.K. (1986a) Comparison of the spike precursor sequences of coronavirus IBV strains M41 and 6/82 with that of IBV Beaudette. J. Gen. Virol. 67, 2825–2831. [DOI] [PubMed]
- Binns, M.M., Boursnell, M.E.G., Tomley, F.M. and Brown, T.D.K. (1986b) Nucleotide sequence encoding of the membrane protein of the IBV strain 6/82. Nucleic Acids Res. 14, 5558–5558. [PMC free article] [PubMed]
- Boursnell, M.E.G., Binns, M.M., Foulds, I.J. and Brown, T.D.K. (1985) Sequence of the nucleocapsid genes from two strains of avian infectious bronchitis virus. J. Gen. Virol. 66, 573–580. [DOI] [PubMed]
- Boursnell, M.E.G., Brown, T.D.K., Foulds, I.J., Green, P.F., Tomley, F.M. and Binns, M.M. (1987) Completion of the sequence of the genome of the coronavirus avian infectious bronchitis virus. J. Gen. Virol. 68, 55–77. [DOI] [PubMed]
- Cavanagh, D. and Davis, P.J. (1988) Evolution of avian coronavirus IBV: Sequence of the matrix glycoprotein gene and intergenic region of several serotypes. J. Gen. Virol. 69, 621–629. [DOI] [PubMed]
- Cavanagh, D. and Davis, P.J. (1992) Sequence analysis of strains of avian infectious bronchitis coronavirus isolated during the 1960s in the U.K. Arch. Virol. 130, 471–476. [DOI] [PMC free article] [PubMed]
- Cavanagh, D., Davis, P.J., Cook, K.J.A., Li, D., Kant, A. and Koch, G. (1992a) Location of the amino acid differences in the S1 spike glycoprotein protein of closely related serotypes of infectious bronchitis virus. Avian Pathol. 21, 33–43. [DOI] [PubMed]
- Cavanagh, D., Davis, P.J. and Cook, J.K.A. (1992b) Infectious bronchitis virus: evidence for recombination within the Massachusetts serotype. Avian Pathol. 21, 401–408. [DOI] [PubMed]
- Gelb Jr., J., Wolff, J.B. and Moran, C.A. (1991) Variant serotypes of infectious bronchitis virus isolated from commercial layer and broiler chickens. Avian Dis. 35, 82–87. [PubMed]
- Holmes, K.V. (1990) Coronaviridae and their replication. In: Fields, B.N. and Knipe, D.M. (Eds.), Virology. Raven Press, New York, NY.
- Jia, W., Karaca K., Parrish, C.R. and Naqi, S.A. (1995) A novel of avian infectious bronchitis virus resulting from recombination among three different strains. Arch. Virol. 140, 259–271. [DOI] [PMC free article] [PubMed]
- Kusters, J.G., Jager, E.G., Niesters, H.G.M. and Van der Zeijst, B.A.M. (1990) Sequence evidence for RNA recombination in field isolates of avian coronavirus infectious bronchitis virus. Vaccine 8, 605–608. [DOI] [PMC free article] [PubMed]
- Liu, D.X. and Inglis, S.C. (1991) Association of the infectious bronchitis virus 3c protein with the virion envelope. Virology 185, 911–917. [DOI] [PMC free article] [PubMed]
- Niesters, H.G.M., Zijderveld, A.J., Seifert, W.F., Lenstra, J.A., Bleumind-playm, N.M.C., Horzinek, M.C. and Van der Zeijst, B.A.M. (1986) Infectious bronchitis virus RNA D encodes three protein translation products. Nucleic Acids Res. 14, 3144–3144. [DOI] [PMC free article] [PubMed]
- Siddell, S. (1995) The small-membrane protein. In: Siddell, S. (Eds.), The Coronaviridae. Plenum Press, New York, pp. 181–189.
- Smith, A.R., Boursnell, M.E.G., Binns, M.M., Brown, T.D.K. and Inglis, S.G. (1990) Identification of new membrane-associated polypeptide specified by the coronavirus infectious bronchitis virus. J. Gen. Virol. 71, 3–11. [DOI] [PubMed]
- Sutou, S., Sato S., Okabe, T., Nakai, M. and Sasaki, N. (1988) Cloning and sequencing of genes encoding structural proteins of avian infectious bronchitis virus. Virology 165, 589–595. [DOI] [PubMed]
- Wang, L., Junker, D. and Collisson, E.W. (1993) Evidence of natural recombination within the S1 gene of infectious bronchitis virus. Virology 192, 710–716. [DOI] [PubMed]
- Williams, A.K., Wang, L., Sneed, L.W. and Collisson, E.W. (1992) Comparative analysis of the nucleocapsid genes of several strains of infectious bronchitis virus and other coronaviruses. Virus Res. 25, 213–222. [DOI] [PMC free article] [PubMed]
- Williams, A.K., Wang, L., Sneed, L.W. and Collisson, E.W. (1993) Analysis of a hypervariable region in the 3′ non-coding end of the infectious bronchitis virus genome. Virus Res. 28, 19–27. [DOI] [PMC free article] [PubMed]
