Skip to main content
Journal of Bacteriology logoLink to Journal of Bacteriology
. 2007 Jan 5;189(7):2949–2951. doi: 10.1128/JB.00913-06

A Possible Heterodimeric Prophage-Like Element in the Genome of the Insect Endosymbiont Sodalis glossinidius

Alvin J Clark 1,*, Mauricio Pontes 2, Tait Jones 2, Colin Dale 2
PMCID: PMC1855819  PMID: 17209029

Abstract

Extrachromosomal element pSOG3 (52,162 nucleotides) in the genome of Sodalis glossinidius contains redundant phage-related gene pairs, indicating that it may have been formed by the fusion of two ancestral phage genomes followed by gene degradation. We suggest that pSOG3 is a prophage that has undergone genome degeneration accompanying host adaptation to symbiosis.


Bacterial symbionts that live obligately in animal hosts are not expected to provide a significant reservoir for bacteriophages because they have little or no contact with bacteria outside of their host. This explains why the genomes of ancient primary insect symbionts, such as Buchnera aphidicola, Wigglesworthia glossinidia, and Blochmannia spp., are devoid of prophage sequences (1, 5, 11). However, studies have shown that prophage sequences are present in the genomes of recently derived facultative insect symbionts, such as Wolbachia spp., Sodalis glossinidius, and “Candidatus Hamiltonella defensa” (7, 9, 10, 12, 14). In the present study, we report on phage-related sequences found in an extrachromosomally replicating element (pSOG3) in S. glossinidius, a facultative symbiont of the tsetse fly.

The sequence of pSOG3 was obtained by the shotgun method from extrachromosomal DNA isolated by alkaline lysis of a laboratory culture of S. glossinidius from Glossina morsitans morsitans. The pSOG3 sequence was assembled at an average of 11-fold coverage from shotgun reads using the SeqMan assembler in the Lasergene package (DNASTAR, Madison, WI). Open reading frames (ORFs) were predicted using GeneMark (6) and by translating BLAST searches. The sequence of pSOG3 encompasses 52,162 bp comprising 50 putative coding sequences and five pseudogenes (Fig. 1). Based on our analysis, we hypothesize that pSOG3 is an extrachromosomal element that arose from the integration of two distinct ancestral phage genomes and subsequently degenerated in the adaptation towards symbiosis. Another sequence containing phage-related genes (pSG3) has been obtained from an S. glossinidius strain isolated from Glossina palpalis palpalis (4). It comprises only about 40% of the pSOG3 genome (Fig. 1).

FIG. 1.

FIG. 1.

Representation of the genomes of prophage extrachromosomal elements pSOG3 of S. glossinidius from Glossina morsitans morsitans and pSG3 of S. glossinidius from Glossina palpalis palpalis (4). pSOG3 is drawn to emphasize its composition as two sets of phage genes. The first set extends from the HK620-like virion structural genes at the extreme left to the int-like gene at nucleotide position 30000; the second set extends from the epsilon15-like structural genes at position 33000 to the int-like gene at the extreme right. ORFs are shown as open arrows pointing in the direction of transcription. Filled arrows represent putative pseudogenes. The names given to some of the genes or their putative proteins are indicated above the arrows representing them. Two regions of protein amino acid sequence homology are shown as open rectangles with dashed lines embracing the homologous ORFs. The region of HK620 homology is also homologous to Sf6 and APSE1 (2). A nucleotide scale is indicated at the bottom of the figure.

Phage genomes are described as mosaic because they consist of DNA sequences or modules inherited from distinct ancestral phages (6). Typically, each mosaic phage has a single copy of each of the genes necessary for the phage lifestyle. Thus, pSOG3 is unusual because it has a striking heterodimeric structure composed of two contiguous regions harboring genes derived from two ancestral phages related to extant phages HK620, a relative of P22 (3), and epsilon15 (Fig. 1). The most parsimonious explanation for the formation of pSOG3 is the coalescence or heterodimerization of two ancestral phage genomes, resulting from homologous recombination between orthologous sequences from distinct ancestral prophages. In these circumstances, natural selection would be expected to favor the inactivation of genes that are functionally redundant in the resulting heterodimer. In the case of pSOG3, the region derived from a P22-like ancestor has a complete set of virion structural genes and no replication genes, while the region derived from an epsilon15-like ancestor has intact genes encoding proteins presumed to be involved in replication alongside inactivated structural genes (Fig. 1). Both regions include ORFs encoding putative proteins homologous to RecT, RecE, HU, Int, and the lysis triad holin, lysozyme, and Rz (Fig. 1). One copy of each of the recE and holin genes also appears to have been inactivated.

On the left of the pSOG3 map (Fig. 1) is a segment of 15.2 kb containing 14 ORFs whose putative proteins share substantial identity (50 to 80%) with virion structural proteins of temperate phages HK620 (Escherichia coli), Sf6 (Shigella flexneri), and APSE-1 (“Candidatus Hamiltonella defensa”). Virions of these phages are morphologically similar to those of phage P22, and their virion structural proteins show substantial identity to those of Salmonella enterica phage P22 (2, 3, 15). By comparison, in the middle of the map (Fig. 1) is a region of 7.3 kb that shares substantial sequence identity with a region of phage epsilon15, which is another temperate phage of Salmonella enterica that is distinct from P22 (M. R. McConnell, personal communication; GenBank accession no. NC004775). The putative structural genes in the epsilon15-like region of pSOG3 are clearly pseudogenes, because they have coding sequences that are truncated by frameshifts and/or deletions.

Several lines of evidence indicate that pSOG3 replicates extrachromosomally in S. glossinidius. First, the circular assembly of pSOG3 contains two sequences hypothesized to be involved in replication. One of these is an ORF (repA) whose putative protein product is 27% identical to the RepA protein of plasmid pEA29 from Erwinia amylovora. Upstream of the pSOG3 repA are four imperfect repeats (ACCGCTTTTTGGTCGTC) sharing similarity with the putative replicator designated for pEA29 (8). Downstream of repA is an ORF whose translated protein is 30% identical to the plasmid partitioning protein, ParA, of Corynebacterium glutamicum plasmid pGA2 (13). Southern hybridization experiments also indicate that pSOG3 is not integrated into the chromosome of S. glossinidius (Fig. 2), consistent with the complete sequence of the S. glossinidius chromosome, which does not include the prophage and is derived from the same strain (14). The copy number of pSOG3, estimated by TaqMan real-time PCR, closely matches that of the S. glossinidius chromosome (Fig. 2).

FIG. 2.

FIG. 2.

Evidence that pSOG3 replicates as a plasmid.

(A) Left panel, pulsed-field gel of S. glossinidius total DNA (lane 1) and plasmid DNA (lane 2) digested with XmaI and S. glossinidius uncut plasmid DNA (lane 3). Right panel, corresponding Southern hybridization of the gel on the left with a radiolabeled probe specific for pSOG3 gn53. Hybridization reveals the presence of a single band in XmaI digests of S. glossinidius total DNA and a band of the same mobility in the plasmid DNA (indicated by arrowheads), indicating that pSOG3 replicates as a plasmid and is not integrated into the chromosome of S. glossinidius. (B) Relative copy numbers of single-copy chromosomal genes and pSOG3 gn52 determined by TaqMan quantitative PCR assays conducted in triplicate with S. glossinidius total DNA were used to determine the copy number of pSOG3. The numbers of pSOG3 gn52 relative to single-copy chromosomal genes fusA (gp52:fusA) and rplB (gp52:rplB) are close to 1, indicating that the pSOG3 copy number is close to that of the chromosome.

Since phage virions have been identified sporadically in cultures of S. glossinidius isolated from G. m. morsitans (C. Dale, unpublished observations), we hypothesize that pSOG3 is the prophage responsible for those virions. It is notable that pSOG3 retains a full complement of ORFs required for virion assembly (Fig. 1), while no other complete set of such ORFs is present in the chromosomal sequence (14). Molecular evolutionary analyses of three structural-protein-encoding virion genes reveal that these proteins have evolved under strong purifying selection, i.e., selection that eliminates variants because they lack activity or selective advantage. Purifying selection is evidenced by low ratios of nonsilent (amino-acid-changing) substitutions (dN) to silent substitutions (dS) in pairwise comparisons of homologous sequences among pSOG3, HK620, Sf6, and APSE-1 (Table 1). This is consistent with the preservation of the functionality of structural virion proteins in pSOG3 by natural selection, suggesting that lysis and virion production do play important roles in vivo. However, the conditions required to induce pSOG3 lysis have not yet been defined. Notably, the pSOG3 prophage does not carry any ORFs homologous to the typical regulatory genes (i.e., cI, cro) found in lambdoid phages, although such sequences are found in the S. glossinidius chromosome (14). However, the prophage does maintain three putative coding sequences (gn18, gn26, and gn43) that could serve as transcriptional regulators.

TABLE 1.

dN/dS ratios for three structural-protein-encoding virion genesa

Protein dN/dS ratio
Sf6 APSE-1 pSOG3
Capsid
    HK620 0.063 0.049 0.063
    Sf6 0.064 0.063
    APSE-1 0.049
Portal
    HK620 0.088 0.117 0.078
    Sf6 0.115 0.075
    APSE-1 0.135
Gp10
    HK620 0.035 0.170 0.178
    Sf6 ND 0.175
    APSE-1 0.173
a

The dN/dS ratios were calculated using the pairwise Kumar method in MEGA 3.1 (available at http://www.megasoftware.net/). The nucleotide sequences for the following genes encoding the proteins listed were used for the comparison: for HK620, hkbS, hkbQ, and hkbV (3); for Sf6, genes 5, 3, and 8 (2); for APSE-1, ORFs 24, 19, and 28 (15); and for pSOG3, gn05, gn03 and gn07 (this study). Gp10 is the name of a P22 protein that participates in virion tail formation and thus helps stabilize DNA in the phage head. For reference, dN/dS ratios of <1 are consistent with purifying selection; the strength of purifying selection increases as the dN/dS ratio approaches zero. The dN/dS ratio was not computed for the Gp10 homologues shared by Sf6 and APSE-1 because the sequences were too divergent (indicated by ND [not determined]).

Nucleotide sequence accession number.

The sequence of pSOG3 has been deposited in GenBank under accession no. DQ785801.

Acknowledgments

We gratefully acknowledge Roger Hendrix and Michael Ford, who provided us with an initial library of shotgun DNA fragments.

This work was supported in part by a personal grant to the University of Arizona and in part by NSF award EF-58501127 (to C.D.).

Footnotes

Published ahead of print on 5 January 2007.

REFERENCES

  • 1.Akman, L., A. Yamashita, H. Watanabe, K. Oshima, T. Shiba, M. Hattori, and S. Aksoy. 2002. Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidia. Nat. Genet. 32:402-407. [DOI] [PubMed] [Google Scholar]
  • 2.Casjens, S., D. A. Winn-Stapley, E. B. Gilcrease, R. Morona, C. Kuhlewein, J. E. Chua, P. A. Manning, W. Inwood, and A. J. Clark. 2004. The chromosome of Shigella flexneri bacteriophage Sf6: complete nucleotide sequence, genetic mosaicism, and DNA packaging. J. Mol. Biol. 339:379-394. [DOI] [PubMed] [Google Scholar]
  • 3.Clark, A. J., W. Inwood, T. Cloutier, and T. S. Dhillon. 2001. Nucleotide sequence of coliphage HK620 and the evolution of lambdoid phages. J. Mol. Biol. 311:657-679. [DOI] [PubMed] [Google Scholar]
  • 4.Darby, A. C., J. Lagnel, C. Z. Matthew, K. Bourtzis, I. Maudlin, and S. C. Welburn. 2005. Extrachromosomal DNA of the symbiont Sodalis glossinidius. J. Bacteriol. 187:5003-5007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gil, R., F. J. Silva, E. Zientz, F. Delmotte, F. Gonzalez-Candelas, A. Latorre, C. Rausell, J. Kamerbeek, J. Gadau, B. Holldobler, R. C. van Ham, R. Gross, and A. Moya. 2003. The genome sequence of Blochmannia floridanus: comparative analysis of reduced genomes. Proc. Natl. Acad. Sci. USA 100:9388-9393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lukashin, A. V., and M. Borodovsky. 1998. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res. 26:1107-1115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Masui, S., S. Kamoda, T. Sasaki, and H. Ishikawa. 2000. Distribution and evolution of bacteriophage WO in Wolbachia, the endosymbiont causing sexual alterations in arthropods. J. Mol. Evol. 51:491-497. [DOI] [PubMed] [Google Scholar]
  • 8.McGhee, G. C., and A. L. Jones. 2000. Complete nucleotide sequence of ubiquitous plasmid pEA29 from Erwinia amylovora strain Ea88: gene organization and intraspecies variation. Appl. Environ. Microbiol. 66:4897-4907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Moran, N. A., P. H. Degnan, S. R. Santos, H. E. Dunbar, and H. Ochman. 2005. The players in a mutualistic symbiosis: insects, bacteria, viruses, and virulence genes. Proc. Natl. Acad. Sci. USA 102:16919-16926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Moran, N. A., J. A. Russell, R. Koga, and T. Fukatsu. 2005. Evolutionary relationships of three new species of Enterobacteriaceae living as symbionts of aphids and other insects. Appl. Environ. Microbiol. 71:3302-3310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Shigenobu, S., H. Watanabe, M. Hattori, Y. Sakaki, and H. Ishikawa. 2000. Genome sequence of the endocellular bacterial symbiont of aphids Buchnera sp. APS. Nature 407:81-86. [DOI] [PubMed] [Google Scholar]
  • 12.Sinkins, S. P., T. Walker, A. R. Lynd, A. R. Steven, B. L. Makepeace, H. C. Godfray, and J. Parkhill. 2005. Wolbachia variability and host effects on crossing type in Culex mosquitoes. Nature 436:257-260. [DOI] [PubMed] [Google Scholar]
  • 13.Tauch, A., A. Puhler, J. Kalinowski, and G. Thierbach. 2003. Plasmids in Corynebacterium glutamicum and their molecular classification by comparative genomics. J. Biotechnol. 104:27-40. [DOI] [PubMed] [Google Scholar]
  • 14.Toh, H., B. L. Weiss, S. A. Perkin, A. Yamashita, K. Oshima, M. Hattori, and S. Aksoy. 2006. Massive genome erosion and functional adaptations provide insights into the symbiotic lifestyle of Sodalis glossinidius in the tsetse host. Genome Res. 16:149-156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.van der Wilk, F., A. M. Dullemans, M. Verbeek, and J. F. van den Heuvel. 1999. Isolation and characterization of APSE-1, a bacteriophage infecting the secondary endosymbiont of Acyrthosiphon pisum. Virology 262:104-113. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Bacteriology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES