Abstract
Corynebacterium pseudotuberculosis is of major veterinary importance because it affects many animal species, causing economically significant livestock diseases and losses. Therefore, the genomic sequencing of various lines of this organism, isolated from different hosts, will aid in the development of diagnostic methods and new prevention and treatment strategies and improve our knowledge of the biology of this microorganism. In this study, we present the genome of C. pseudotuberculosis Cp31, isolated from a buffalo in Egypt.
GENOME ANNOUNCEMENT
Corynebacterium pseudotuberculosis is a member of the CMNR group, which is composed of the Corynebacterium, Mycobacterium, Nocardia, and Rhodococcus genera (6). This microbe is a facultative intracellular Gram-positive pathogen that causes edematous skin disease (OSD). OSD is an endemic disease of buffalo in Egypt (6).
C. pseudotuberculosis Cp31 was isolated from a large brisket swelling of a buffalo in Egypt. The genome of C. pseudotuberculosis Cp31 was sequenced using a fragment library obtained according to the SOLiD version 3 protocol, and 80,530,223 50-bp reads were produced. These reads were filtered for quality control to remove reads with an average Phred quality score of less than 20, resulting in 51,276,112 reads for a total of 2,563,805,600 bp. This number of reads is equivalent to 1,114× coverage based on the 2.3-Mb genome of C. pseudotuberculosis FRC41 (GenBank accession number NC_014329), which was used as a reference.
The genome assembly was performed using a hybrid method (1) in which the reads were first assembled using the C. pseudotuberculosis FRC41 genome as a reference to obtain scaffolds, as studies have shown that C. pseudotuberculosis strains are highly clonal (2). The unmapped reads were then subjected to de novo assembly using the CLC Genomics Workbench software package, generating a set of contigs that contained gene products that were similar to those of Corynebacterium diphtheria. These contigs were inserted into the genome by manual curation using the same software used to produce the draft genome. The gaps were closed using three Ion Torrent sequencing runs totaling 228,643 reads, with an average size of 120 bp, which were mapped against the draft genome.
After open reading frame prediction, which was performed using the FgenesB program (http://linux1.softberry.com/), frameshifts were detected by aligning the Ion Torrent reads against the draft genome sequence by manual curation using the Artemis software (5). The prediction of rRNAs was performed using the RNAmmer program (3), which uses hidden Markov models, and tRNAs were predicted with tRNAscan-SE (4). InterProScan (8) was used to predict protein domains, motifs, and families. Noncoding RNA (ncRNA) prediction was performed using the Rfam database (http://rfam.sanger.ac.uk/), which predicted 11 ncRNAs, including a representative of the 6C class of ncRNAs that is absent in the C. pseudotuberculosis strain 1002 genome. Previous studies have indicated that this small RNA is conserved in Actinobacteria because its function is related to general dormancy or metabolic slow-down (7).
The annotation of coding sequences (CDSs) was performed in Artemis with the help of the NCBI nonredundant database. The sequence has a GC content of 52.2% and includes 2,310,587 bp, 2,171 CDSs, three rRNA operons, 49 tRNAs, and 47 pseudogenes.
Nucleotide sequence accession number.
The genome sequence obtained in this study has been deposited in the GenBank database under accession number CP003421.
ACKNOWLEDGMENTS
A.S., A.R.C., A.C.P., S.C.S., A.R.S., S.S.A., L.C.G., L.T.C., F.F.A., E.G.V.B., M.P.C.S., and V.A. were supported by the National Council for Scientific and Technological Development (Conselho Nacional de Desenvolvimento Científico e Tecnológico [CNPq]). R.T.J.R. and M.S.B. acknowledge support from the Brazilian Federal Agency for the Support and Evaluation of Graduate Education (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior [CAPES]). This work was performed as a part of the Genomics and Proteomics Network of the state of Para (Rede Paraense de Genômica e Proteômica [RPGP]) and was supported by the Para Research Foundation/Fundação de Amparo à Pesquisa do Estado do Pará (FAPESPA), the Amazon Center of Excellence in Microorganism Genomics (Núcleo Amazônico de Excelência em Genômica de Microorganismos), and the Support Program for Centers of Excellence (Programa de Apóio a Núcleos de Excelência [Pronex]), CNPq/FAPESPA, the National Program for Academic Cooperation (Programa Nacional de Cooperação Acadêmica [PROCAD]), CAPES, the Studies and Projects Funding Body (Financiadora de Estudos e Projetos [FINEP]), and the Minas Gerais Research Foundation (Fundação de Apóio à Pesquisa do Estado de Minas Gerais [FAPEMIG]).
REFERENCES
- 1. Cerdeira LT, et al. 2011. Rapid hybrid de novo assembly of a microbial genome using only short reads: Corynebacterium pseudotuberculosis I19 as a case study. J. Microbiol. Methods 86:218–223 [DOI] [PubMed] [Google Scholar]
- 2. Connor KM, Fontaine MC, Rudge K, Baird GJ, Donachie W. 2007. Molecular genotyping of multinational ovine and caprine Corynebacterium pseudotuberculosis isolates using pulsed-field gel electrophoresis. Vet. Res. 38:613–623 [DOI] [PubMed] [Google Scholar]
- 3. Lagesen K, et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Rutherford K, et al. 2000. Artemis: sequence visualization and annotation. Bioinformatics 16:944–945 [DOI] [PubMed] [Google Scholar]
- 6. Selim SA. 2001. Oedematous skin disease of buffalo in Egypt. J. Vet. Med. B Infect. Dis. Vet. Public Health 48:241–258 [DOI] [PubMed] [Google Scholar]
- 7. Swiercz JP, et al. 2008. Small non-coding RNAs in Streptomyces coelicolor. Nucleic Acids Res. 36:7240–7251 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Zdobnov EM, Apweiler R. 2001. InterProScan—an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17:847–848 [DOI] [PubMed] [Google Scholar]
