Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2015 Feb 25;50(3):514–517. doi: 10.1007/s11262-015-1183-z

Full genome sequence of guinea fowl coronavirus associated with fulminating disease

Mariette F Ducatez 1,, Etienne Liais 1, Guillaume Croville 1, Jean-Luc Guérin 1
PMCID: PMC7088915  PMID: 25712772

Abstract

Guinea fowl coronavirus (GfCoV), a recently characterized avian coronavirus, was identified from outbreaks of fulminating disease (peracute enteritis) in guinea fowl in France. The full-length genomic sequence was determined to better understand its genetic relationship with avian coronaviruses. The full-length coding genome sequence was 26,985 nucleotides long with 11 open reading frames and no hemagglutinin–esterase gene: a genome organization identical to that of turkey coronavirus [5′ untranslated region (UTR)—replicase (ORFs 1a, 1ab)—spike (S) protein—ORF3 (ORFs 3a, 3b)—small envelop (E or 3c) protein—membrane (M) protein—ORF5 (ORFs 4b, 4c, 5a, 5b)—nucleocapsid (N) protein (ORFs N and 6b)—3′ UTR]. This is the first complete genome sequence of a GfCoV and confirms that the new virus belongs to group gammacoronaviruses.

Electronic supplementary material

The online version of this article (doi:10.1007/s11262-015-1183-z) contains supplementary material, which is available to authorized users.

Keywords: Coronavirus, Nidovirus, Guinea fowl, Genome organization, Fulminating disease

Introduction

Coronaviruses (CoVs) are enveloped viruses with positive-sense, non-segmented RNA genomes of 25–32 kb. CoVs infect a wide range of hosts causing various degrees of morbidity and mortality. Group I CoVs (alphacoronaviruses) contain viruses that infect not only humans (HCoV-229E and HCoV-NL63) but also cats and dogs (with feline CoV and canine CoV, respectively), or pigs (with the porcine transmissible gastroenteritis virus, TGEV for example). Similarly, group II CoVs (betacoronaviruses) may infect humans (examples: HCoV-OC43, HCoV-HKU1, severe acute respiratory syndrome (SARS)-related CoVs or the recently emerged MERS-CoV), horses (with ECoV), or cattle (with BCoV). In contrast, group III CoVs (gammacoronaviruses) primarily infect birds: chickens, peafowl, and partridges harbour infectious bronchitis virus (IBV) while turkeys have turkey CoV (TCoV) and guinea fowl may be infected with guinea fowl CoV (GfCoV). Gammacoronavirus strains have however been isolated from a whale and a wild felid [1]. Group IV CoVs (deltacoronaviruses) have been detected in birds (with BuCoV, MuCoV, SpCoV, etc.), or pigs (with porcine deltacoronavirus) [2]. Interestingly CoVs of the groups I, II, and IV have been detected in Chiroptera (bats), thought to be the reservoir of CoVs [3, 4].

In the present study, we focused on a new member of the group III CoVs, GfCoV, and aimed at sequencing its full genome to better understand its molecular relationship with gammacoronaviruses.

Materials and methods

To determine the full genome of gammaCoV/guinea fowl/France/s/2011 (GfCoV/FR/2011), we first analysed the data generated on a MiSeq Illumina platform as previously described [5]. Briefly, pooled intestinal contents of experimentally infected guinea poults were clarified, ultracentrifuged, and treated with nucleases to concentrate encapsidated viral material. RNA was extracted, and a random RT-PCR was performed to generate unbiased PCR products of about 300 bp [5, 6]. The sequences generated that matched with avian CoVs sequences, as determined using GAAS software [7], were extracted for further analysis and visualized using integrative genomics viewer (IGV) with the closest blast hit as reference genome: TCoV MG10 (accession number: EU095850) [8]. Primers were designed based on the known sequence data to amplify missing genome fragments by PCR. Sanger sequencing was then performed with PCR primers. The full genome sequence was submitted to EMBL and was attributed the following accession number: [LN610099]. Sequence analysis was carried out using BioEdit version 7.0.8.0 [9], muscle for the alignment [10], and mega version 5.05 for the phylogeny [11].

Results and discussion

The gfCoV-generated sequences were assembled into one contiguous coding sequence of 26,985 nucleotides. The entire genome had a GC content of 38.3 %, identical to the turkey coronavirus (TCoV) MG10 genome [12]. GfCoV and TCoV genomes have the same organization: (i) a 5′ untranslated region (UTR), (ii) two large slightly overlapping ORFs coding for the replicase: 1a and 1ab, (iii) gene coding for the spike (S) protein, (iv) ORF3 (ORFs 3a, 3b), (v) gene coding for the small envelop (E or 3c) protein, (vi) gene coding for the membrane (M) protein, (vii) ORF5 (4b and 4c, 5a, 5b), (viii) genes coding for the nucleocapsid (N) protein (ORFs N and 6b), and (ix) 3′ UTR (Table 1). The multiprotein on single ORFs is generated by alternative translation. While the role of avian coronavirus (IBV) structural proteins is known: binding to RNA, nucleocapsid formation and role in cell-mediated immunity for N; virus budding site determination, role in virus particle assembly and in interferon-induction, interaction with viral nucleocapsid for M; association with viral envelop, role in virus particle assembly and putatively in apoptosis for E; binding to cellular receptors, induction of fusion between viral and cellular membranes, induction of neutralizing antibodies and role in cell-mediated immunity for S; little is known on the function of non-structural proteins. It has mainly been shown that they are not essential for virus replication in vitro but likely help the virus replicate in vivo [13, 14]. The proteins 3a, 3b, 4b, 5a, and N were of the same size. Sizes of other proteins varied, but within the range observed previously between different TCoV strains. Interestingly, GfCoV/FR/2011 harboured a shorter small envelop protein than its TCoV counterparts (Table 1). Further studies are warranted to understand the impact of avian CoVs protein sizes in the biology of the viruses.

Table 1.

Genes and coding regions for GfCoV/FR/2011

ORF Location (GfCoV) Size in nt (GfCoV) Size in aa (GfCoV) Size in aa (TCoV/VA-74/03)1 Size in aa (TCoV/TX-GL/01)1 Size in aa (TCoV/TX-1038/98)1 Size in aa (TCoV/IN-517/94)1 Size in aa (TCoV/ATCC)2 Size in aa (TCoV/540)2 Size in aa (TCoV/MG10)3
5′ UTR* 1–463 >463
1a 464–12,307 11,844 3948 3947 3949 3950 3952 3957 3945 3951
1ab (/1b) 464–12,280 and 12,28020,346

19884

8067

6628

2689

6596 6602 6602 6605 2654 2652 6601
S 20,294–23,914 3621 1207 1226 1225 1224 1226 1203 1203 1226
3a 23,917–24,087 171 57 57 57 57 57 57 57 57
3b 24,090–24,281 192 64 64 64 64 64 64 64 64
E (3c) 24,265–24,540 276 92 99 109 99 99 103 99 99
M 24,543–25,214 672 224 223 225 223 223 223 222 223
4b 25,218–25,499 282 94 94 94 94 94 94 94 94
4c 25,423–25,533 111 37 52 56
5a 25,578–25,772 195 65 65 65 65 65 65 65 65
5b 25,772–26,011 240 80 82 82 92 82 82 80 82
N 25,957–27,183 1227 409 409 409 409 409 409 409 409
6b 27,191–27,445 255 85 74 73
3′ UTR* 27,447 27,471

* Incomplete sequences, nt: nucleotides, aa: amino acids

1as described in [10]

2as described in [3] with 1b described rather than 1ab (size in aa in italic font)

3as described in [8]

Phylogenetic analysis on the full genome of GfCoV/FR/2011 showed it clearly clustered with North American TCoV strains (Fig. 1a, supported by a high bootstrap value of 100), as it was observed previously for the S gene [5]. The genetic distance between GfCoV/FR/2011 and TCoV ranged between 10.7 and 11.4 %, while genetic distances between GfCoV/FR/2011 and representative IBV strains were larger and varied between 13.5 and 15.0 % (Supplementary Table). A simplot analysis comparing the GfCoV/FR/2011 full genome to its closest TCoV and IBV Blast hits showed that the three genomes are highly similar throughout the genome (74–100 % similarity, with no significantly higher identity of GfCoV/FR/2011 with TCoV or IBV genomes), except for the S gene (Fig. 1b). GfCoV S gene was indeed more closely related to TCoV S than to IBV S genes but also more distinct to both viruses on the S gene than on the rest of its genome (<50 % identity for IBV and 65–90 % identity with TCoV S genes, Fig. 1b), suggesting a recombination event as was hypothesized for the origin of TCoV [15]. A parallel evolution from a common ancestor with a much higher substitution rate on the S gene than on the rest of the genome can however not be ruled out at this stage.

Fig. 1.

Fig. 1

Molecular comparison of the full genome of GfCoV/FR/2011 and avian gammacoronaviruses. a Phylogenetic analysis of the complete genomes of GfCoV/FR/2011 (in bold font) in relation to all available full genomes of turkey coronaviruses (TCoV) and full genomes of representative infectious bronchitis viruses (IBV) at the nucleotide level. The tree was generated using MEGA 5.05 and the maximum likelihood method. Bootstrap values (500 replicates) >75 are indicated on the nodes. b Simplot analysis of full genomic sequence for GfCoV/FR/2011 (query) and its closest TCoV (in blue) and IBV (in red) blast hits. The spike gene area is indicated on the plot (Color figure online)

The present study showed that GfCoV/FR/2011 harbours a genome organization very similar to that of TCoV strains. In addition, and again like TCoV, GfCoV/FR/2011 likely originated from a recombination event between an IBV-like (or TCoV-like) virus that would have given most of its genome and a so far unknown CoV that would have contributed by its spike gene. Despite the similarity of their genomes and their enteric tropism, TCoVs often cause mild clinical signs while GfCoVs are usually associated with extremely high mortalities in their host, suggesting strikingly different host–virus interactions. Further studies are ongoing to understand the host range of GfCoV/FR/2011 and its determinants of pathogenicity.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Acknowledgments

This work was supported by the ‘EPICOREM’ grant of the Agence Nationale de la Recherche (ANR), by the French Comité Interprofessionnel de la Pintade (CIP) and by the French Ministry of Agriculture. Etienne Liais is supported by a scholarship from French Ministry of Agriculture and Region Midi-Pyrénées. We would like to thank the Plateforme bioinformatique Toulouse Midi-Pyrénées, UBIA, INRA, 31326, Castanet-Tolosan, France; the GeT-PlaGe, INRA ENVT UMR444, Laboratoire de Génétique Cellulaire 31326, Castanet-Tolosan, France; and the Plateau de Génomique GeT-Purpan, UDEAR UMR 5165 CNRS/UPS, CHU PURPAN, Toulouse, France, for technical assistance with sequencing and sequence analysis.

References

  • 1.D. Gavier-widen, N. Decaro, C. Buonavoglia. in Infectious Diseases of Wild Mammals and Birds in Europe, ed. by D.J. Gavier-Widen, A. Meredith (Blackwell, 2012), pp. 234–240
  • 2.Chan JF, Li KS, To KK, Cheng VC, Chen H, Yuen KY. J. Infect. 2012;65:477–489. doi: 10.1016/j.jinf.2012.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Balboni A, Battilani M, Prosperi S. New Microbiol. 2012;35:1–16. [PubMed] [Google Scholar]
  • 4.Chen H, Guan Y, Fan X. Hong Kong Med. J. 2011;17(Suppl 6):36–40. [PubMed] [Google Scholar]
  • 5.Liais E, Croville G, Mariette J, Delverdier M, Lucas MN, Klopp C, Lluch J, Donnadieu C, Guy JS, Corrand L, Ducatez MF, Guerin JL. Emerg. Infect. Dis. 2014;20:105–108. doi: 10.3201/eid2001.130774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Victoria JG, Kapoor A, Li L, Blinkova O, Slikas B, Wang C, Naeem A, Zaidi S, Delwart E. J. Virol. 2009;83:4642–4651. doi: 10.1128/JVI.02301-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Angly FE, Willner D, Prieto-Davo A, Edwards RA, Schmieder R, Vega-Thurber R, Antonopoulos DA, Barott K, Cottrell MT, Desnues C, Dinsdale EA, Furlan M, Haynes M, Henn MR, Hu Y, Kirchman DL, McDole T, McPherson JD, Meyer F, Miller RM, Mundt E, Naviaux RK, Rodriguez-Mueller B, Stevens R, Wegley L, Zhang L, Zhu B, Rohwer F. PLoS Comput. Biol. 2009;5:e1000593. doi: 10.1371/journal.pcbi.1000593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Thorvaldsdottir H, Robinson JT, Mesirov JP. Brief. Bioinform. 2013;14:178–192. doi: 10.1093/bib/bbs017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hall T. Nucleic Acids Symp. 1999;41:95–98. [Google Scholar]
  • 10.Edgar RC. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. Mol. Biol. Evol. 2011;28:2731–2739. doi: 10.1093/molbev/msr121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Gomaa MH, Barta JR, Ojkic D, Yoo D. Virus Res. 2008;135:237–246. doi: 10.1016/j.virusres.2008.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.D. Cavanagh, J. Gelb Jr., in Diseases of Poultry, vol. 12, ed. by Y.M. Saif (2008)
  • 14.M.M.C. Lai, S. Perlman, L.J. Anderson, in Fields Virology, 5th edn., vol. 1, ed. by D.M. Knipe, P.M. Howley (2007)
  • 15.Jackwood MW, Boynton TO, Hilt DA, McKinley ET, Kissinger JC, Paterson AH, Robertson J, Lemke C, McCall AW, Williams SM, Jackwood JW, Byrd LA. Virology. 2010;398:98–108. doi: 10.1016/j.virol.2009.11.044. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Virus Genes are provided here courtesy of Nature Publishing Group

RESOURCES