Skip to main content
Current Genomics logoLink to Current Genomics
. 2012 Apr;13(2):153–162. doi: 10.2174/138920212799860698

Microbial Lifestyle and Genome Signatures

Chitra Dutta 1,*, Sandip Paul 1,$
PMCID: PMC3308326  PMID: 23024607

Abstract

Microbes are known for their unique ability to adapt to varying lifestyle and environment, even to the extreme or adverse ones. The genomic architecture of a microbe may bear the signatures not only of its phylogenetic position, but also of the kind of lifestyle to which it is adapted. The present review aims to provide an account of the specific genome signatures observed in microbes acclimatized to distinct lifestyles or ecological niches. Niche-specific signatures identified at different levels of microbial genome organization like base composition, GC-skew, purine-pyrimidine ratio, dinucleotide abundance, codon bias, oligonucleotide composition etc. have been discussed. Among the specific cases highlighted in the review are the phenomena of genome shrinkage in obligatory host-restricted microbes, genome expansion in strictly intra-amoebal pathogens, strand-specific codon usage in intracellular species, acquisition of genome islands in pathogenic or symbiotic organisms, discriminatory genomic traits of marine microbes with distinct trophic strategies, and conspicuous sequence features of certain extremophiles like those adapted to high temperature or high salinity.

Keywords: Reductive genome evolution, Intra-amoeba pathogens, Strand-specific codon bias, Genome islands, Trophic strategies, Thermophiles, Halophiles.

INTRODUCTION

Micorbes are the most ancient and tiny, yet the most diverse and versatile life forms of our planet! For nearly four billion years, they have evolved to adapt themselves to every lifestyle imaginable and every environment conceivable, including the most extreme and inhospitable ones. The genome architectures of microorganisms often bear the telltale signs of this long journey of adaptive evolution. Microbes from distant lineages but of similar lifestyle may exhibit similar genomic/proteomic traits, telling the tale of tailor-made convergence. On contrary, closely related bacterial species, even strains of the same species, when acclimatize to distinct ecology, may display substantial genomic diversity, narrating the history of niche-driven divergence. The lifestyle of a microbe, therefore, not only can contribute significantly in sculpting its genome, but also may inscribe own signature in its genome fabrics. The present review aims to provide an account of such niche-specific genome signatures in microorganisms adapted to specialized lifestyle and/or environment.

The term “genome signature”, coined by Karlin & Burge [1], has been used by various investigators to refer to similar concepts, but to different genomic properties. Typically, a genome signature refers to any sequence feature that enables characterization of the source organism from mere knowledge of its nucleotide sequence (complete or even partial genome sequence of sufficient length). The major advantage of the concept of genome signatures over the traditional approaches of rRNA-based phylogeny is that it does not depend on sequence alignment [2-4]. An ideal genome signature should satisfy two major criteria – i) it should be species-specific, i.e., the signature should be different for different genomes and ii) it should be pervasive, i.e., the imprint of the global signature should persist locally at smaller scales throughout the genome. Signatures of closely related species are expected to be more similar to one another than the signatures of distant ones. Usually “closely related species” refers to close phylogenetic lineages. However, the present review intends to focus on the signatures of “ecological kinship” rather than those of the “taxonomical cliques”.

SEQUENCE FEATURES OF MICROBIAL GENOMES INFLUENCED BY LIFESTYLE

a). G+C-Content

The simplest compositional parameter that might be influenced by environment or lifestyle of a microbial species is the G+C-content of its genome [5-7], which remains fairly constant within a microbial species, but varies widely across microbial species. The genomic G+C-content of a microbe, reflecting optimization between the directional mutational bias [8, 9], natural selection and genetic drift [10, 11], is often influenced by factors like temperature [12], niche complexity [5], cost and availability of nucleotides [13, 14], aerobiosis [15], nitrogen utilization [16] etc. There is a general tendency of large genomes to be G+C rich and small genomes to be G+C poor [17-19]. The obligatory intracellular pathogens/symbionts and microorganisms surviving in nutrient-limiting environments are, in most cases, characterized by relatively small genomes of low G+C-content, apparently in attempt to reduce replication expenses [13, 14], while free-living organisms, especially of the ones surviving in the soil [6], usually possess much larger genomes of higher G+C-content.

b). Oligonucleotide Composition

Oligonucleotide frequencies capture species-specific characteristics of nucleotide composition more effectively than simple G+C-content [20]. Comparison of di-, tri- or higher order oligonucleotide frequencies in DNA sequences has long been used as a method of sequence characterization, particularly because this approach did not require sequence alignment [2-4]. The concept of a non-alignment, genomic signature approach to genome analysis was introduced for the first time by Karlin & Burge [1], when they defined “Dinucleotide relative abundances”, the deviation of observed dinucleotide frequencies from those expected from the mononucleotide frequencies. A series of work, conducted by Karlin & Burge as well as other groups of investigators [1, 21-23], have established the set of dinucleotide relative abundances as a robust genome signature that can discriminate between sequences from different organisms. Dinucleotide relative abundance values appear to reflect the chemistry of dinucleotide stacking energies and base-step conformational preferences, as well as the species-specific properties of DNA modification, replication and repair mechanisms [1, 22].

Karlin & Burge [1] speculated that dinucleotide genome signatures might be influenced by environmental influences such as pH, temperature and salinity. In a study of seven complete and several partial microbial genomes, Karlin et al. [22] noted that the dinucleotide TA, though broadly underrepresented in prokaryotes and eukaryotes, occurs with normal frequencies in two archaeal extremophiles - Sulfolobus and Pyrobaculum aerophilum and the dinucleotide CG is underrepresented in three thermophilic archaea, namely Methanococcus jannaschii, Sulfolobus sp., and M. thermoautotrophicum, but overrepresented in halobacteria. It has later been reported [24] that the halophilic microbes, characterized by overrepresentation of the dinucleotides GA/TC, CG and AC/GT, can be differentiated from non-halophiles on the basis of their dinucleotide abundance values.

Parallel to the applications of dinucleotide relative abundance, oligonucleotides of varying length ranging from dinucleotides to octanucleotides have also been employed in combination with various metrics, clustering algorithms, or supervised machine learning methods to detect species-specific patterns in genome sequences from all kingdoms of life [20, 22, 25-34]. These signatures could not only detect taxonomic relationships, but also showed potential in delineating niche-specific patterns. Karlin et al. [22] reported that the tetranucleotide CTAG is extremely underrepresented and distributed in an anomalous fashion along the genome of the thermophilic microbe M. jannaschi. Applying classification and regression tree (CART) analysis to genome-wide tetranucleotide frequencies of 195 archaea and bacteria, Dyer et al. [35] reported the discriminating tetramers, the frequencies of which could differentiate between three temperature ranges, hyperthermophily, thermophily and mesophily.

Analysis of dinucleotide composition of bacteriophage genomes revealed that the phage genomes often display distinct genomic signatures depending on their replication and repair mechanisms [36]. The signatures of temperate phages, whose replication and repair depends on the host machinery, converge toward the signatures of their respective hosts, whereas autonomously replicating phages like T4 or T7 display their own characteristic signatures. Recently, use of tetranucleotide-based genome signatures enabled differentiation of the phages infecting E. coli, S. aureus, M. smegmatis and P. aeruginosa, where most of the temperate phages exhibited a shorter genomic signature distance between their genomes and that of their hosts than that of the lytic phages [37]. These observations advocate for the hypothesis that the intrinsic replication and repair mechanisms contribute significantly to the species-specific nature of dinucleotide relative abundances [38].

Free-living bacteria, in general, display stronger bias in oligonucleotide usage than host-associated bacteria, as observed in a hierarchical clustering based on hexanucleotide–based genome signatures of 867 prokaryotic genomes [31]. Recently, comparative analysis of tetranucleotide composition in a set of 774 sequenced microbial genomes revealed convergence of compositional patterns among genomes with similar habitats [34], displaying distinct clusters of obligate intracellular organisms (both pathogen and endosymbiont) and grouping of the halophilic bacterium Salinibacter rubber, not with its fellow Bacteroidetes, but with halophilic and methanogenic Archaea [34].

c). Codon Usage

Trends in codon usage in microorganisms often carry the signals of their lifestyle or environment. Synonymous codon usage patterns in unicellular organisms, in general, follow species-specific biases that reflect an optimization between mutational biases and selective forces [39-42]. Among the major selection forces, translational selection is operative on large number of organisms, in which the highly expressed genes prefer to use a subset of synonymous codons [42]. The study conducted by Rocha [43] suggested that the fast-growing bacteria have higher codon usage bias in highly expressed genes due to the presence of fewer anticodons and hence, fewer subsets of distinct tRNAs. Lynn et al. [44] reported the presence of a characteristic pattern of codon usage among the thermophiles, which has later been reconfirmed by several investigators [45-48]. Distinct niche-specific trends in synonymous codon usage have also been observed in microbes thriving at high salinity [24]. An analysis of synonymous codon usage patterns in bacterial and fungal genomes by Willenbrok et al. [49] demonstrated that differences in codon preferences of translational codon adaptation and dominant codon adaptation provide an environmental signature that can segregate bacteria according to their lifestyle, for instance soil bacteria and soil symbionts, spore formers, enteric bacteria, aquatic bacteria, and small intercellular and extracellular pathogens.

Codon usage bias in viral genomes often reflects imprints of adaptation to specific host environment. The dinucleotide CpG and the CpG-containing codons are often significantly underrepresented in ORFs of small vertebrate DNA viruses [50], such as poliovirus genomes, especially in vaccine-derived poliviruses and the attenuated virus of polioviruses genotype 1 [51]. This might be due to the fact that the unmethylated CpGs are recognized by the host's innate immune system (Toll-like receptor 9) as a pathogen signature [52], while methylated CpGs in a small vertebrate DNA or RNA virus would face a high chance of mutation that would result in a reduction of this dinucleotide [50, 53]. A lower frequency of CpG might also help the vaccine derived polio virus out of the host immunity [51]. Multivariate analysis of codon usage patterns in the genes from segment 1 to segment 6 of avian and human influenza viruses, including pandemic H1N1, showed that the codon preferences of seasonal human influenza viruses were distinct among their subtypes and different from those of avian viruses [54] and a plausible explanation could be that the replication of the influenza virus depends on its host's machinery, and hence, the codon usage of the viral genes might be subject to host selection pressures, especially after interspecies transmission.

Apart from these factors, there are various other sequence attributes like purine-loading [55], GC-skew [56], genomic islands [57] etc, that are often employed to reveal specific evolutionary traits, which will be discussed later in relevant sections. Properties like conserved sequence repeats [58], “periodicity signatures” – the formal representation of periodic sequence patterns related to DNA curvature [59] and compositional spectra based on imperfect occurrences of long olignucleotide words [60, 61] are also potentially characteristic of different ecological groups of microbes. For instance, the archaea of the order Halobacteriaceae displayed the “periodicity signatures” distinct from other archaeal species, which might be due to their early divergence from other archaeal lineages, extensive lateral gene transfer or adaptation to high salt environments [59]. Clustering of genomes of 39 species of Eukarya, Eubacteria, and Archaea using the compositional spectra [61] could classify the organisms on the basis of two ecological parameters, temperature and oxygen.

SPECIALIZED LIFESTYLES OF MICROBIAL COMMUNITY & THEIR GENOME SIGNATURES

a). Obligatory Intracellular Lifestyle, Characterized by Genome Reduction

Bacteria often trade their free-living lifestyle for an obligatory symbiotic or parasitic relationship with eukaryotic hosts. Examples include endocellular symbionts like Wigglesworthia glossinidia or Buchnera aphidicola, as well as pathogenic bacteria like Mycobacterium leprae, the causative agents of leprosy, Borrelia burgdorferi, the agent of Lyme disease and many other parasitic bacteria. Such host-restricted bacteria usually display some specific genome features, not observed in their close relatives retaining free-living stages [18, 62-65]. These include much smaller genome sizes; significant reduction in gene repertoire; accumulation of pseudogenes; accelerated sequence evolution; appreciable enrichment in A+T-content; and significant increase in the frequency of mobile elements in the genomes, in some cases [62, 63, 66]. All these features together represent a general syndrome of reductive genome evolution, which has been observed repeatedly in obligatory intracellular microbes from diverse lineages and of distinct host environment.

There could be various factors driving reductive evolution in host-restricted bacteria. Within the protected and predictable environment inside the host, many genes are rendered redundant or superfluous, and previously deleterious mutations become neutral in effect, due to relaxed selection. Furthermore, an obligatory association with host may result in the drastic reduction in effective population size of a lineage owing to the strict vertical inheritance modes, thereby leading to recurrent bottlenecks [66-70] and these may lead to an increase in the fixation rates for slightly deleterious mutations [71], as postulated by the so-called Muller’s ratchet [72]. Limited opportunities for the horizontal transmission of genetic elements in the secluded lifestyle of endosymbionts are also likely to contribute to smaller genome sizes. One could argue that genome shrinkage represents a selective process of genome streamlining, by which the organisms judiciously get rid off extraneous DNA. But the retention of nonfunctional DNA in the form of pseudogenes or increase in spacer regions in the reduced genomes, as observed in Rickettsia or Buchnera [62], advocates for the notion of the genetic drift, rather that of the selection driven genome shrinkage.

Intracellular microbes, in general, follow some common trends for gene retention/disposal. Small genomes, in general, retain fewer tRNAs and fewer DNA repair/recombination enzymes [73]. Relatively A+T–rich genomes of host-restricted organisms may be attributed, at least partially, to the elimination or decreased efficiency of genes encoding DNA repair enzymes [74]. For instance, in U. urealyticum, the mutation pressure leading to A+T enrichment may be attributed to the decreased ability to remove uracil from DNA due to the absence/inefficiency of the DNA repair enzymes dUTPase, which prevents dUTP from being integrated into DNA, and uracil-DNA glycosylase, which removes uracil from DNA. Spontaneous deamination of deoxycytidine and mis-incorporation of dUTP by DNA polymerase are the sources of uracil in DNA, and simultaneous repair of GU mismatches by DNA polymerase leads to an A/T enrichment [74]. Genes involved in redundant/unneeded pathways like biosynthesis pathways, transcriptional regulatory mechanisms or regulatory elements like sigma factors [65, 73] are usually eliminated, while the genes involved in essential functions like DNA replication, transcription and translation, chaperone systems and the protein translocation machinery are likely to be retained. As revealed in Buchnera [75], reductive genome evolution may lead to a shrinkage in the modular structure of their protein interaction networks in a way to maintain the essential characteristics of the networks. Symbiont lineages often retain distinctive gene sets, depending on their provisioning roles in hosts, as observed in Buchnera, Wigglesworthia and Blochmannia [76].

Newly host-dependent bacteria, which are still in the process of transition from free-living lifestyle to obligatory host association may be distinguished from anciently host-restricted ones by two genomic attributes: expansion of insertion sequence (IS) elements and abundance of pseudogenes. Recently evolved endosymbionts/pathogens that are still in transition usually possess much higher numbers of IS elements as compared to their free-living relatives. At initial stages of host restriction, IS elements can promote genome degradation by inactivation of genes [77] and regulatory elements [78], and also by serving as repetitive sequences that induce large deletions through homologous recombination [79]. The anciently host-restricted genomes might also have passed through such stages of IS spread, but traces of these mobile elements have now been deleted or mutated beyond recognition, as exemplified by the extreme genomic stasis of Buchnera having no sign of chromosome rearrangements or gene acquisitions in the past 50 to 70 million years [80]. One striking exception is Wolbachia, which, despite carrying typical features of anciently host-restricted bacteria, retains very large numbers of mobile elements [81]. It has been suggested that abundance of mobile elements might enable this arthropod-associated endosymbiont to coinfect individual insect hosts and undergo lineage-specific gene rearrangements.

Another distinctive signs of ongoing gene inactivation in genomes of host-restricted bacteria is the presence of numerous pseudogenes, observed in Rickettsia prowazekii [82], M. leprae [83] and many other microbes. In a genome under the process of shrinkage, inactivation of individual genes results in pseudogenes that slowly dwindle through deletions and as a result, ancestral genes are present in varying stages of decline [62].

It has recently been suggested that the phenomenon of reductive evolution could be a distinct characteristics of the bacteria specifically associated with human communities, agriculture and animal domestication - three features clearly linked to the Neolithic revolution [84]. It is hypothesized that after the first Neolithic settlements, bacteria specialized in human-associated niches underwent the reductive evolution, which did not occur in related species that are not specialized in humans. Recently, a comparative study of genome evolution in Lactobacillus reuteri populations associated with rodent and human hosts revealed that the rodent-restricted strains possess a large and adaptable pan-genome while its human-restricted relatives are subjected to a process of reductive evolution [85].

b). Non-Specialized Intra-Amoebal Lifestyle, Characterized by Genome Expansion

The notion of post-neolithic genome reduction in human-associated intracellular bacteria has also been supported by the observation that intra-amoebal pathogens exhibit, in general, increased genome size compared to their human-specialized relatives [86]. Free-living amoebae feed on several bacteria, fungi, and algae that they encounter. Some microorganisms, which have evolved to resist these phagocytic protests, survive and replicate within their amoebal host. These amoeba-resistant microorganisms include many established pathogens like Legionella spp., Chlamydophila pneumoniae, Mycobacterium avium, Listeria monocytogenes, Pseudomonas aeruginosa etc. Interestingly enough, many of these amoeba-resistant pathogens, despite their strict intracellular lifestyle, are reported to have larger genomes as compared to their human-infecting relatives [86]. It has been proposed that these nonspecialized microorganisms live in community within their hosts, promoting horizontal gene exchanges between different sympatric intra-amoebal parasites, as well as between the parasites and amoeba, which increases their genome sizes [86, 87].

c). Strand-Specific Codon Bias, Frequently Observed in Intracellular Microorganisms

Another distinctive genomic feature, observed in a number of obligate intracellular bacteria is significantly different synonymous and/or non-synonymous codon usage patterns in genes transcribed on the leading and lagging strands of replication [88, 89]. Bacterial genomes are, in general, characterized by polarized nucleotide composition in the two strands of DNA replication [90, 91], where the leading and lagging strands tend to be richer in keto (G and T) bases and the amino bases (C and A), respectively [92, 93]. In genomes of most of the free-living bacteria, this strand-specific nucleotide composition, as measured by their GC-skew values [93] could not impart any significant influence in the codon and/or amino acid preferences in the genes/gene-products encoded by two strands of replication. The strand-specific codon bias was observed for the first time in the intracellular pathogen Borrelia burgdorferi [88] and since then, more than ten bacterial and viral genomes were reported to have significant strand-specific codon bias. Strikingly enough, most of these species are obligate intracellular [94, 95]. Examples include B. burgdorferi, T. pallidum [88], Chlamydia trachomatis [89], Buchnera aphidicola [96], Bartonella [97], Tropheryma whipplei [98], Chlamydia muridarum [99], Lawsonia intracellularis [100], Ehrlichia canis [101], adenovirus [102] etc.

Numerous hypotheses were put forward attributing the strand-specific compositional bias to the replication-induced and/or to the transcription/translation-coupled mutation/ repair asymmetry [94, 103-105]. For either kind of hypothesis, cytosine deamination of single-stranded DNA is thought to play a vital role [92, 104]. Due to the inherent asymmetry of the mechanism of DNA replication, the leading strand is exposed in the single-stranded state for a longer time than the lagging strand and hence, is more prone to cytosine deamination. During transcription, the coding strand remains in the single-stranded state for a longer time and hence, experiences more C -> T mutations. The transcription-associated asymmetries can either increase or decrease replication-associated strand asymmetries, depending on the transcription direction and the position of the gene relative to the origin of replication [105]. In most of the intracellular species displaying strong strand-specific biases, replicational and transcriptional selection are coupled together - replicational selection is responsible for the higher number of genes on the leading strand and transcriptional selection for the enrichment of highly expressed and/or essential genes on the same strand [106, 107]. Genomes of free-living bacteria have, in general, much higher plasticity, and frequent chromosomal rearrangements in these species might weaken the inter-strand compositional skews. But in intracellular microbes with reduced genomes and a protected lifestyle within the host, frequencies of chromosomal rearrangement might be too low to upset strand-specific codon biases [94]. Loss of genes for replication restart pathways in reduced genomes of intracellular microbes might also contribute to their pronounced strand-specific compositional asymmetry [108]. There are, however, some intracellular bacteria showing little or no inter-strand differences in codon bias, where genomic rearrangements are likely to occur at a higher rate despite their obligate intracellular lifestyle, as observed in Rickettsia [109, 110] or Wolbachia [111].

d). Acquisition of Genome Islands in Organisms Having Pathogenic or Symbiotic Lifestyle

A pathogenic microbe is often distinguished from the non-pathogenic variants of the same or related species by the presence of the pathogenicity island – a flexible gene pool encoding virulence factors like toxins, adherence factors, invasion factors, secretion systems etc., clustered in a specific genomic region, the G+C-composition of which usually differ significantly from that of its core genome [57, 112]. PAIs were first described in the genomes of human pathogens of the species Escherichia coli [113], but with accumulation of more bacterial genome sequences, it became apparent that they represent a subclass of a more diverse group of genetic elements, designated as genomic islands (GI), found in abundance in bacterial genomes [57, 114, 115]. A GI refers to a part of a genome - usually between 10 to 200 kb in length - harbouring a number of accessory genes that might be beneficial for the host bacterium under specific environmental conditions. GIs usually differ in compositional statistics like G+C-content, cumulative GC skew, tetranucleotide frequencies, codon usage etc. from the rest of the chromosome. They are often inserted at tRNA gene loci and flanked by 16–20 bp perfect or almost perfect direct repeats [116]. They may also carry insertion elements or transposons and the same GI can occur in distantly related species. All these strongly argue in favor of horizontal acquisition of GIs by their host genomes [112, 116]. It has been suggested that GIs enable a large number of genes to be transferred and incorporated into the recipient genome that may lead to dramatic changes in the behavior of the organism resulting in “evolution in quantum leaps” [117, 118]. Evolutionary forces shaping the codon and amino acid usage in genes/gene-products of a genomic island may differ from those influencing the composition of the core genes/gene-products of its host, as demonstrated in case of the symbiotic island of the Bradorizobium japonicum [119].

Since acquisition of GIs often enhances the fitness of the recipient microbes facilitating microbial transmission, survival or colonization within a niche, they are also known as 'fitness islands' [120]. Fitness islands may be associated with diverse adaptive functions that contribute to different microbes' unique lifestyles. For instance, nitrogen fixation genes in Rhizobiaceae species are encoded by “symbiosis islands” [121], genes for phenolic compound degradation in Pseudomonas putida are harbored by “metabolic islands” [122], the iron-uptake ability of many pathogens are conveyed by “adaptive islands” [115] and the mecA-region of staphylococci that enhances survival of the carrier strains in presence of antibiotic-producing microbes in soil [123] may be termed resistance island. The same or similar GIs may exhibit distinct functionality under diverse ecological conditions or lifestyles of its host microbe. GIs in E. coli strains of the human gut microbiome encoding the adherence factors like P-, S-, and F1C-fimbriae [124] usually function as a saprophytic island, facilitating colonization of the gut. But under special circumstances, P-, S- or F1C-positive E. coli may reach the urinary tract, when the same island serves as a true pathogenicity island, helping its host microbe to emerge as a virulent strain causing infections of the bladder/kidney [125]. Similarly, GIs encoding secretion systems of type III in the virulent strains of Salmonella [126], Shigella [127], and Yersinia-groups [128] or type IV in Legionella pneumophila [129] strains and Helicobacter pylori [130] are involved in the infectious process of the respective bacteria and hence, are called pathogenicity islands. But similar GIs carrying the type III system of rhizobia, or the type IV system of F plasmids act as symbiotic or ecological islands that enhance the fitness of its host microbes in their natural niche [120]. Therefore, categorization of GIs not only depends on the genetic composition of the island itself, but also on the genetic background and lifestyle of its bacterial host.

e). Discriminatory Genome Features of Trophic Life Strategies of Marine Microbes

Marine bacteria often specialize to survive in distinct trophic habitats in the oceans - some have evolved to colonize low-nutrient (oligotrophic) environments, while others prefer to thrive in nutrient-rich (copiotrophic) sites. Comparison of genome sequences of two marine microbes, the copiotroph Photobacterium angustum S14 and the oligotroph Sphingopyxis alaskensis RB2256 as model representatives of two major classes of heterotrophic marine lifestyles, along with sequence information for 32 related microbes with well-characterized trophic lifestyles, has led to the identification of 43 genomic markers related to trophism [131]. Oligotrophs are typified by shorter genomes, fewer rRNA operons, fewer prophages, higher number of cytoplasmic and lower number of periplasmic proteins and distinct Cluster of Orthologous Groups (COG) of proteins distribution patterns. For instance, the COG categories representing defense mechanisms, cell motility, transcriptional regulators or signal transduction pathways are significantly higher in copiotrophs, while the COGs involved in transport or metabolism of lipids and secondary metabolites are typically over represented in oligotrophs. Copiotroph genomes also contain more repeats within clustered regularly interspaced short palindromic repeats (CRISPRs). There are, however, some microbes exhibiting mixed genomic traits, as exemplified by the Planctomycetes, possessing large genomes (a feature of copiotrophs), but having a single copy of the rRNA operon (a feature of oligotrophs). By creating self-organizing maps that integrated these genomic markers, Lauro et al. [131] could effectively distinguish microbial trophic strategies from mere knowledge of their genomic sequences.

The marine cyanobacterium Prochlorococcus marinus has been the first documented example of genome shrinkage along with A+T enrichment in a free-living organism [132] – an event ascribed to various factors related to their growth in oligotrophic waters [133-135], selection for metabolic economy [132, 133, 136], loss of low fitness genes [137], and smaller cell sizes [132]. Interestingly enough, it is also the first documented example of significant differences in dinucleotide abundance genome signatures across different strains of the same species [138]. On the basis of vertical niche partitioning, strains of P. marinus may be broadly divided into two major ecotypes: high-light-adapted (HL) ecotypes dominating the surface waters ecotype and the low-light adapted (LL) ecotype growing preferentially at depths between 80 and 200 meters. Analysis of complete genome sequences of 6 HL and 6 LL strains of P. marinus revealed existence of distinct dinucleotide signatures not only across the HL and LL strains, but also within LL strains of varying genome sizes and G+C-content. Analysis of synonymous codon usage profiles indicated the presence of pronounced strand-specific asymmetry in LL strains. The study [138] also delineated definite trends in amino acid usage as well as physicochemical and structural features in core proteome of different ecotypes of Prochlorococcus strains, which are not solely governed by their genomic G+C-bias.

f). Distinct Genome Features in Microbes Adapted to Extreme Environments

i). Microorganisms Thriving at High Temperatures

Thermophiles and hyperthermphiles, i.e., the organisms thriving at temperatures greater than 50° and 80° Celsius, require special adaptation strategies at genome and proteome levels to withstand extremely high temperatures. Several studies indicated a possible relationship between the optimal growth temperature (OGT) of microorganisms and their genomic base composition [46, 139, 140]. In both thermophiles and hyperthermophiles, the G+C contents of tRNA/rRNA genes exhibit strong positive correlations with their optimal growth temperature, probably to facilitate the intramolecular stabilization of RNA secondary structure at elevated temperature [47, 141, 142], but no such simple correlation exists for the genomic DNA [143]. Regression analyses of the sequence data for thermophilic, mesophilic (OGT = 20-50°C) and psychrophilic (OGT <20°C) bacteria revealed linear relationships between OGT and a combination of purine and pyrimidine dimer compositions, RR +YY – RY – YR, where R= A/G, Y = C/T), the correlation coefficient being 0.66 [144, 145]. Evaluation of the physicochemical parameters of dinucleotides suggested that such linear relationship may be attributed to distinct levels of supercoiling of DNA relevant to its thermostability [145]. It has also been demonstrated that in microbes adapted to high temperature, the purine-pyrimidine skew (R-Y)/(R+Y) correlates strongly with the location of the ORFs in two strands, so that the ORFs residing in both the direct and complementary strands, in general, tend to be purine-rich [47, 139]. While the predicted ORFs of thermophiles and hyperthermophiles are characterized by overre-presentation of purine content, the structural RNA genes of hyperthermophiles, in general, exhibit much higher G+C-content than those of the mesophiles [47].

Differences in the codon usage between thermophilic and mesophilic organisms have been described by several investigators [44-48]. Montanucci et al. [146] formulated a codon frequency index that could highlight robust determinants of thermostability capable of discriminating thermophilic from mesophilic genomes.

However, it is not clear yet whether the selection in favour of purine-rich coding sequences in thermophiles has its root at the nucleic acid levels or protein levels. Lobry & Chassel [45] argued that the trend for the amino-acid composition of thermophilic proteins could be under the control of a pressure at the nucleic acid level, not a selection at the protein level. It was also proposed that the selection for purine-rich mRNA sequences in thermophilic organisms may minimize unnecessary RNA-RNA interactions and prevent double-strand RNA formation within the molecule [147]. On the contrary, an analysis of 204 complete proteomes of archaea and bacteria spanning the temperature range from −10 °C to 110 °C [148] indicated that the specific amino acid composition adaptation at distinct thermal environment might be a primary factor, while the signatures at the nucleotide level, such as purine loading index, may largely be the consequences of the amino acid adaptation requirement.

ii). Microbes Thriving at High Salinity

Microbes thriving in hypersaline environments like the Great Salt Lake in Utah, Owens Lake in California, the Dead Sea etc. are known as halophilic (salt-loving) organisms. In order to prevent desiccation through osmotic movement of water out of their cytoplasm, halophiles employ two different strategies: the “compatible-solute” strategy, involving accumulation of osmoprotecting solutes in the cytoplasm or the salt-in strategy that relies on selective influx of inorganic ions, especially K+ and Cl, into the cytoplasm. The compatible solute strategy does not require any significant structural adjustment in intracellular macromolecules. But in the salt-in haloadaptation, the entire intracellular protein machinery must adapt themselves to high salt level and as a consequence, taxonomically divergent halophiles show similar amino acid and other compositional biases, irrespective of their genomic GC-bias [24].

The obligatory halophiles generally contain G+C-rich genomes (well above 60%), presumably to avoid UV induced thymidine dimer formation and possible accumulation of mutations in their specialized habitat (shallow coastal lagoons), characterized by high levels of UV irradiation [149, 150]. The extreme halophilic archaeon Haloquadratum walsbyi is so far the only exception, with a remarkably low genomic G+C-content of 47.9% [151]. In H. walsbyi, the disadvantage of a low G+C-genome is thought to be partly compensated for by the presence of a relatively higher number (four copies) of photolyases [151].

All obligatory halophiles show specific dinucleotide abundance signatures, characterized by overrepresentation of CG, GA/TC and AC/GT, which may be regarded as specific genomic siganture of haloadaptation. The abundance of GA, AC and GT dinucleotides may partly be coupled with the specific amino acid requirements at the protein level, while the abundance of CG increases the propensity to transition from B-DNA to a Z-DNA conformation that is stabilized at high salt concentrations [152]. The requirement for Asp, Glu, Thr and Val residues in halophilic protein sequences increases frequencies of GA, AC and GT dinucleotides at the first and second codon positions of the genomic DNA. The pattern of synonymous codon usage in halophiles has also shown to be significantly different from that in non-halophiles.

CONCLUDING REMARKS

The genomic architecture of a microbe often bears the signatures not only of its phylogenetic position, but also of the kind of lifestyle to which it is adapted. Two taxonomically, spatially and temporally distant microbial populations may acquire similar genomic traits, if they intend to flourish at similar ecological niches; while two closely related microbes, when acclimatized to differential environmental conditions, may undergo adaptive radiations through selection of conspicuous genomic traits. Niche-specific genome signatures may include but are not restricted to the sequence features like base composition, GC-skew, purine-pyrimidine ratio, dinucleotide abundance, codon bias, and oligonucleotide composition, presence of specific gene-families, horizontal acquisition of genome islands and the processes of genome shrinkage/expansion. The remarkable diversity in the niche-specific signature features, characterized so far, underscores the evolutionary plasticity of the microbial genomes and there are reasons to believe that this represents only the tip of an iceberg - with most of the signature features remained to be explored and unveiled yet. With ever-increasing number of microbial genome sequences in the public domain, one would expect revelation of many more novel niche-specific genome signatures in microbes adapted to different specialized lifestyles or extreme ecological niches. Such signatures, if properly interpreted, may not only offer insight into the molecular strategies of niche specialization in microorganisms, but may also have far-reaching implications of metagenomic and biotechnological perspective.

ACKNOWLEDGEMENT

The authors acknowledge the infrastructural support received from Bioinformatics Centre, CSIR-Indian Institute of Chemical Biology. SP was supported by Senior Research Fellowships from Council of Scientific and Industrial Research, India.

REFERENCES

  • 1.Karlin S, Burge C. Dinucleotide relative abundance extremes a genomic signature. Trends Genet . 1995;11:283–290. doi: 10.1016/s0168-9525(00)89076-9. [DOI] [PubMed] [Google Scholar]
  • 2.Blaisdell BE. A measure of the similarity of sets of sequences not requiring sequence alignment. Proc Natl Acad Sci USA. 1986;83:5155–5159. doi: 10.1073/pnas.83.14.5155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Brendel V, Beckmann JS, Trifonov EN. Linguistics of nucleotide sequences: morphology and comparison of vocabularies. J Biomol Struct Dyn . 1986;4:11–21. doi: 10.1080/07391102.1986.10507643. [DOI] [PubMed] [Google Scholar]
  • 4.Pietrokovski S, Hirshon J, Trifonov EN. Linguistic measure of taxonomic and functional relatedness of nucleotide sequences. J Biomol Struct Dyn. 1990;7:1251–1268. doi: 10.1080/07391102.1990.10508563. [DOI] [PubMed] [Google Scholar]
  • 5.Foerstner KU, von Mering C, Hooper SD, Bork P. Environments shape the nucleotide composition of genomes. EMBO Rep . 2005;6:1208–1213. doi: 10.1038/sj.embor.7400538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Rocha EPC, Feil EJ. Mutational patterns cannot explain genome composition are there any neutral sites in the genomes of bacteria? Plos Genet . 2010;6:e1001104. doi: 10.1371/journal.pgen.1001104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hildebrand F, Meyer A, Eyre-Walker A. Evidence of selection upon genomic GC-content in bacteria. PLoS Genet . 2010;6:e1001107. doi: 10.1371/journal.pgen.1001107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sueoka N. Directional mutation pressure and neutral molecular evolution. Proc Natl Acad Sci USA. 1988;85:2653–2657. doi: 10.1073/pnas.85.8.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lynch M. The origins of genome architecture. Sunderland, MA:: Sinauer Associates, Inc; 2007. [Google Scholar]
  • 10.Kimura M. Evolutionary rate at the molecular level. Nature. 1968;217:624–626. doi: 10.1038/217624a0. [DOI] [PubMed] [Google Scholar]
  • 11.Suzuki DT, Griffiths AJF, Miller JH, Lewontin RC. in An Introduction to Genetic Analysis. 4th ed. W.H. Freeman; 1989. p. 704. [Google Scholar]
  • 12.Musto H, Naya H, Zavala A, Romero H, Alvarez-Valin F, Bernardi G. Genomic GC level optimal growth temperature and genome size in prokaryotes. Biochem Biophys Res Commun . 2006;347:1–3. doi: 10.1016/j.bbrc.2006.06.054. [DOI] [PubMed] [Google Scholar]
  • 13.Rocha EPC. Base composition might result from competition for metabolic resources. Trends Genet . 2002;18:291–294. doi: 10.1016/S0168-9525(02)02690-2. [DOI] [PubMed] [Google Scholar]
  • 14.Mann S, Chen YP. Bacterial genomic G + C composition-eliciting environmental adaptation. Genomics . 2010;95:7–1. doi: 10.1016/j.ygeno.2009.09.002. [DOI] [PubMed] [Google Scholar]
  • 15.Naya H, Romero H, Zavala A, Alvarez B, Musto H. Aerobiosis increases the genomic guanine plus cytosine content (GC%) in prokaryotes. J Mol Evol. 2002;55:260–264. doi: 10.1007/s00239-002-2323-3. [DOI] [PubMed] [Google Scholar]
  • 16.McEwan CE, Gatherer D, McEwan NR. Nitrogen-fixing aerobic bacteria have higher genomic GC content than non-fixing species within the same genus. Hereditas. 1998;128:173–178. doi: 10.1111/j.1601-5223.1998.00173.x. [DOI] [PubMed] [Google Scholar]
  • 17.Heddi A, Charles H, Khatchadourian C, Bonnot G, Nardon P. Molecular characterization of the principal symbiotic bacteria of the weevil Sitophilus oryzae a peculiar G+C content of an endocytobiotic. DNA J Mol Evol . 1998;47: 52–61. doi: 10.1007/pl00006362. [DOI] [PubMed] [Google Scholar]
  • 18.Moran NA. Microbial minimalism genome reduction in bacterial pathogens. Cell . 2002;108:583–586. doi: 10.1016/s0092-8674(02)00665-7. [DOI] [PubMed] [Google Scholar]
  • 19.Rocha EPC, Danchin A. Base composition bias might result from competition for metabolic resources. Trends Genet . 2002;18:291–294. doi: 10.1016/S0168-9525(02)02690-2. [DOI] [PubMed] [Google Scholar]
  • 20.Sandberg R, Bränden C, Ernberg I, Cöster J. Quantifying the species-specificity in genomic signatures synonymous codon choice amino acid usage and G + C content. Gene. 2003;311:35–42. doi: 10.1016/s0378-1119(03)00581-x. [DOI] [PubMed] [Google Scholar]
  • 21.Gelfand MS, Koonin EV. Avoidance of palindromic words in bacterial and archaeal genomes A close connection with restriction enzymes. Nucl Acids Res . 1997;25:2430–2439. doi: 10.1093/nar/25.12.2430. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Karlin S, Mrázek J, Campbell AM. Compositional biases of bacterial genomes and evolutionary implications. J Bacteriol . 1997;179:3899–3913. doi: 10.1128/jb.179.12.3899-3913.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mark WJ, van Passel Kuramae EE, Luyf ACM, Bart A, Boekhout T. The reach of the genome signature in prokaryotes. BMC Evol Biol . 2006;6:84. doi: 10.1186/1471-2148-6-84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Paul S, Bag SK, Das S, Harvill ET, Dutta C. Molecular signature of hypersaline adaptation insights from genome and proteome composition of halophilic prokaryotes. Genome Biol . 2008;9:R70. doi: 10.1186/gb-2008-9-4-r70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Burge C, Campbell AM, Karlin S. Over- and under-representation of short oligonucleotides in DNA sequences. Proc Natl Acad Sci. 1992;89:1358–1362. doi: 10.1073/pnas.89.4.1358. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Nakashima H, Ota M, Nishikawa K, Ooi T. Genes from nine genomes are separated into their organisms in the dinucleotide composition space. DNA Res . 1998;5:251–259. doi: 10.1093/dnares/5.5.251. [DOI] [PubMed] [Google Scholar]
  • 27.Pride D T, Meinersmann R J, Wassenaar T M, Blaser M J. Evolutionary implications of microbial genome tetranucleotide frequency biases. Genome Res . 2003;13:145–158. doi: 10.1101/gr.335003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Teeling H, Meyerdierks A, Bauer M, Amann R, Glockner FO. Application of tetranucleotide frequencies for the assignment of genomic fragments. Environ Microbiol . 2004;6:938–947. doi: 10.1111/j.1462-2920.2004.00624.x. [DOI] [PubMed] [Google Scholar]
  • 29.Abe T, Kanaya S, Kinouchi M, Ichiba Y, Kozuki T, Ikemura T. Informatics for unveiling hidden genome signatures. Genome Res. 2003;13:693–702. doi: 10.1101/gr.634603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Fertil B, Massin M, Lespinats S, Devic C, Dumee P, Giron A. GENSTYLE exploration and analysis of DNA sequences with genomic signature. Nucleic Acids Res . 2005;33:W512–W515. doi: 10.1093/nar/gki489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Bohlin J, Skjerve E, Ussery DW. Investigations of oligonucleotide usage variance within and between prokaryotes. PLoS Comput. Biol. 2008;4:e1000057. doi: 10.1371/journal.pcbi.1000057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Dick GJ, Andersson AF, Baker BJ, Simmons SL, Thomas BC, Yelton AP, Banfield JF. Community-wide analysis of microbial genome sequence signatures. Genome Biol . 2009;10:R85. doi: 10.1186/gb-2009-10-8-r85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Tyagi A, Bag SK, Shukla V, Roy S, Tuli R. Oligonucleotide frequencies of barcoding loci can discriminate species across kingdoms. Plos One . 2010;5:e12330. doi: 10.1371/journal.pone.0012330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Perry SC, Beiko RG. Distinguishing Microbial Genome Fragments Based on Their Composition Evolutionary and Comparative Genomic Perspectives. Genome Biol Evol . 2010;2:117–131. doi: 10.1093/gbe/evq004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dyer BD, Kahn MJ, Leblanc MD. Classification and regression tree (CART) analyses of genomic signatures reveal sets of tetramers that discriminate temperature optima of archaea and bacteria. Archaea. 2008;2:159–167. doi: 10.1155/2008/829730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Blaisdell BE, Campbell AM, Karlin S. Similarities and dissimilarities of phage genomes. Proc Natl Acad Sci USA. 1996;93:5854–5859. doi: 10.1073/pnas.93.12.5854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Deschavanne P, DuBow M S, Regeard C. The use of genomic signature distance between bacteriophages and their hosts displays evolutionary relationships and phage growth cycle determination. Virol J. 2010;7:163. doi: 10.1186/1743-422X-7-163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Karlin S. Global dinucleotide signatures and analysis of genomic heterogeneity. Curr Opin Microbiol . 1998;1:598–610. doi: 10.1016/s1369-5274(98)80095-7. [DOI] [PubMed] [Google Scholar]
  • 39.Grantham R, Gautier C, Gouy M, Mercier R, Pavé A. Codon catalog usage and the genome hypothesis. Nucleic Acids Res . 1980;8:r49–r62. doi: 10.1093/nar/8.1.197-c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Andersson SGE, Kurland CG. Codon preferences in free-living microorganisms. Microbiol Rev . 1990;54:198–210. doi: 10.1128/mr.54.2.198-210.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Bulmer M. The selection-mutation-drift theory of synonymous codon usage. Genetics. 1991;129:897–907. doi: 10.1093/genetics/129.3.897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Sharp PM, Stenico M, Peden JF, Lloyd AT. Codon usage: Mutational bias, translational selection, or both? Biochem Soc Trans. 1993;21:835–841. doi: 10.1042/bst0210835. [DOI] [PubMed] [Google Scholar]
  • 43.Rocha EP. Codon usage bias from tRNA’s point of view redundancy specialization and efficient decoding for translation optimization. Genome Res . 2004;14:2279–2286. doi: 10.1101/gr.2896904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lynn DJ, Singer GAC, Hickey DA. Synonymous codon usage is subject to selection in thermophilic bacteria. Nucl Acids Res . 2002;30:4272–4277. doi: 10.1093/nar/gkf546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Lobry JR, Chessel D. Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria. J Appl Genet . 2003;44:235–261. [PubMed] [Google Scholar]
  • 46.Singer GA, Hickey DA. Thermophilic prokaryotes have characteristic patterns of codon usage amino acid composition and nucleotide content. Gene. 2003;317:39–47. doi: 10.1016/s0378-1119(03)00660-7. [DOI] [PubMed] [Google Scholar]
  • 47.Das S, Paul S, Bag SK, Dutta C. Analysis of Nanoarchaeum equitans genome and proteome composition Indications for hyperthermophilic and parasitic adaptation. BMC Genomics. 2006;7:186. doi: 10.1186/1471-2164-7-186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Lobry JR, Necsulea A. Synonymous codon usage and its potential link with optimal growth temperature in prokaryotes. Gene. 2006;385:128–136. doi: 10.1016/j.gene.2006.05.033. [DOI] [PubMed] [Google Scholar]
  • 49.Willenbrock H, Friis C, Juncker AS, Ussery DW. An environmental signature for 323 microbial genomes based on codon adaptation indices. Genome Biol . 2006;7:R114. doi: 10.1186/gb-2006-7-12-r114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Shackelton LA, Parrish CR, Holmes EC. Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses. J Mol Evol . 2006;62:551–563. doi: 10.1007/s00239-005-0221-1. [DOI] [PubMed] [Google Scholar]
  • 51.Zhang J, Wang M, Liu W, Zhou J, Chen H, Ma L, Ding Y, Gu Y, Liu Y. Analysis of codon usage and nucleotide composition bias in polioviruses. Virol J . 2011;8:146. doi: 10.1186/1743-422X-8-146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Kri0eg A M. CpG DNA trigger of sepsis mediator of protection or both? Scand J Infect Dis. 2003;35:653–659. doi: 10.1080/00365540310015999. [DOI] [PubMed] [Google Scholar]
  • 53.Ambinder RF, Robertson KD, Tao Q. DNA methylation and the Epstein-Barr virus. Semin. Cancer Biol . 1999;9:369–375. doi: 10.1006/scbi.1999.0137. [DOI] [PubMed] [Google Scholar]
  • 54.Wong EH, Smith DK, Rabadan R, Peiris M, Poon LL. Codon usage bias and the evolution of influenza A viruses Codon Usage Biases of Influenza Virus. BMC Evol Biol . 2010;10:253. doi: 10.1186/1471-2148-10-253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Forsdyke D R, Bell S J. Purine loading stem-loops and Chargaff's second parity rule a discussion of the application of elementary principles to early chemical observations. Appl Bioinformatics. 2004;3:3–8. doi: 10.2165/00822942-200403010-00002. [DOI] [PubMed] [Google Scholar]
  • 56.Lobry J R. Asymmetric substitution patterns in the two DNA strands of bacteria. Mol Biol Evol . 1996;13:660–665. doi: 10.1093/oxfordjournals.molbev.a025626. [DOI] [PubMed] [Google Scholar]
  • 57.Hacker J, Kaper J B. Pathogenicity islands and the evolution of microbes. Annu Rev Microbiol . 2000;54:641–679. doi: 10.1146/annurev.micro.54.1.641. [DOI] [PubMed] [Google Scholar]
  • 58.van Belkum A, Scherer S, van Alphen L, Verbrugh H. Short-sequence DNA repeats in prokaryotic genomes. Microbiol Mol Biol Rev. 1998;62:275–293. doi: 10.1128/mmbr.62.2.275-293.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Mrázek J. Phylogenetic signals in DNA composition limitations and prospects. Mol Biol Evol. 2009;26:1163–1169. doi: 10.1093/molbev/msp032. [DOI] [PubMed] [Google Scholar]
  • 60.Kirzhner V, Nevo E, Korol A, Bolshoy A. A large-scale comparison of genomic sequences: one promising approach. Acta Biotheor . 2003;51:73–89. doi: 10.1023/a:1024553109779. [DOI] [PubMed] [Google Scholar]
  • 61.Kirzhner V, Paz A, Volkovich Z, Nevo E, Korol A. Different clustering of genomes across life using the A-T-C-G and degenerate R-Y alphabets early and late signaling on genome evolution? J Mol Evol . 2007;64:448–456. doi: 10.1007/s00239-006-0178-8. [DOI] [PubMed] [Google Scholar]
  • 62.Moran NA, Plaque GR. Genomic changes following host restriction in bacteria. Curr Opin Genet Dev. 2004;14:627–633. doi: 10.1016/j.gde.2004.09.003. [DOI] [PubMed] [Google Scholar]
  • 63.Klasson L, Andersson SGE. Evolution of minimal-gene-sets in host-dependent bacteria. Trends Microbiol. 2004;12:37–43. doi: 10.1016/j.tim.2003.11.006. [DOI] [PubMed] [Google Scholar]
  • 64.Wernegreen JJ. For better or worse genomic consequences of intracellular mutualism and parasitism. Curr Opin Genet Dev . 2005;15:572–583. doi: 10.1016/j.gde.2005.09.013. [DOI] [PubMed] [Google Scholar]
  • 65.Moya A, Peretó J, Gil R, Latorre A. Learning how to live together: genomic insights into prokaryote-animal symbioses. Nat Rev Genet . 2008;9:218–229. doi: 10.1038/nrg2319. [DOI] [PubMed] [Google Scholar]
  • 66.Moran NA, Wernegreen JJ. Lifestyle evolution in symbiotic bacteria: insights from genomics. Trends Ecol Evol . 2000;15:321–326. doi: 10.1016/s0169-5347(00)01902-9. [DOI] [PubMed] [Google Scholar]
  • 67.Andersson SGE, Kurland CG. Reductive evolution of resident genomes. Trends Microbiol . 1998;6:263–268. doi: 10.1016/s0966-842x(98)01312-2. [DOI] [PubMed] [Google Scholar]
  • 68.Mira A, Moran NA. Estimating population size and transmission bottlenecks in maternally transmitted endosymbiotic bacteria. Microb Ecol. 2002;44:137–143. doi: 10.1007/s00248-002-0012-9. [DOI] [PubMed] [Google Scholar]
  • 69.Woolfit M, Bromham L. Increased rates of sequence evolution in endosymbiotic bacteria and fungi with small effective population sizes. Mol Biol Evol . 2003;20:1545–1555. doi: 10.1093/molbev/msg167. [DOI] [PubMed] [Google Scholar]
  • 70.O'Fallon B. Population structure levels of selection and the evolution of intracellular symbionts. Evolution . 2008;62:361–373. doi: 10.1111/j.1558-5646.2007.00289.x. [DOI] [PubMed] [Google Scholar]
  • 71.Moran NA. Accelerated evolution and Muller's rachet in endosymbiotic bacteria. Proc Natl Acad Sci USA. 1996;93:2873–2878. doi: 10.1073/pnas.93.7.2873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Muller JJ. The relation of recombination to mutational advance. Mutat Res . 1964;1:2–9. doi: 10.1016/0027-5107(64)90047-8. [DOI] [PubMed] [Google Scholar]
  • 73.Mendonça A G, Alves R J, Pereira-Leal J B. Loss of genetic redundancy in reductive genome evolution. PLoS Comput Biol . 2011;7:e1001082. doi: 10.1371/journal.pcbi.1001082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Glass J L, Lefkowitz E J, Glass J S, Hlener C R, Chen E Y, Cassell G H. The complete sequence of the mucosal pathogen Ureaplasma urealyticum. Nature. 2000;407:757–762. doi: 10.1038/35037619. [DOI] [PubMed] [Google Scholar]
  • 75.Tamames J, Moya A, Valencia A. Modular organization in the reductive evolution of protein-protein interaction networks. Genome Biol. 2007;8:R94. doi: 10.1186/gb-2007-8-5-r94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Zientz E, Dandekar T, Gross R. Metabolic interdependence of obligate intracellular bacteria and their insect hosts. Microbiol Mol Biol Rev . 2004;68:745–770. doi: 10.1128/MMBR.68.4.745-770.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Lerat E, Ochman H. Ψ-Φ: exploring the outer limits of bacterial pseudogenes. Genome Res. 2004;14:2273–2278. doi: 10.1101/gr.2925604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Mira A, Pushker R. The silencing of pseudogenes. Mol Biol Evol . 2005;22:2135–2138. doi: 10.1093/molbev/msi209. [DOI] [PubMed] [Google Scholar]
  • 79.Schneider D, Duperchy E, Coursange E, Lenski RE, Blot M. Long-term experimental evolution in Escherichia coli. IX. Characterization of insertion sequence-mediated mutations and rearrangements. Genetics. 2000;156:477–488. doi: 10.1093/genetics/156.2.477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Tamas I, Klasson L, Canbäck B, Näslund A K, Eriksson A S, Wernegreen J J, Sandström J P, Moran N A, Andersson S G. 50 million years of genomic stasis in endosymbiotic bacteria. Science. 2002;296:2376–2379. doi: 10.1126/science.1071278. [DOI] [PubMed] [Google Scholar]
  • 81.Wu M, Sun LV, Vamathevan J, Riegler M, Deboy R, Brownlie JC, McGraw EA, Martin W, Esser C, Ahmadinejad N, Wiegand C, Madupu R, Beanan MJ, Brinkac LM, Daugherty SC, Durkin AS, Kolonay JF, Nelson WC, Mohamoud Y, Lee P, Berry K, Young MB, Utterback T, Weidman J, Nierman WC, Paulsen IT, Nelson KE, Tettelin H, O'Neill SL, Eisen JA. Phylogenomics of the reproductive parasite Wolbachia pipientis wMel a streamlined genome overrun by mobile genetic elements. PLoS Biol . 2004;2:E69. doi: 10.1371/journal.pbio.0020069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Andersson SG, Zomorodipour A, Andersson JO, Sicheritz-Ponten T, Alsmark UC, Podowski RM, Naslund AK, Eriksson AS, Winkler HH, Kurland CG. The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature. 1998;396:133–140. doi: 10.1038/24094. [DOI] [PubMed] [Google Scholar]
  • 83.Cole ST, Eiglmeier K, Parkhill J, James KD, Thomson NR, Wheeler PR, Honore N, Garnier T, Churcher C, Harris D, Mungall K, Basham D, Brown D, Chillingworth T, Connor R, Davies RM, Devlin K, Duthoy S, Feltwell T, Fraser A, Hamlin N, Holroyd S, Hornsby T, Jagels K, Lacroix C, Maclean J, Moule S, Murphy L, Oliver K, Quail MA, Rajandream MA, Rutherford KM, Rutter S, Seeger K, Simon S, Simmonds M, Skelton J, Squares R, Squares S, Stevens K, Taylor K, Whitehead S, Woodward JR, Barrell BG. Massive gene decay in the leprosy bacillus. Nature . 2001;409:1007–1011. doi: 10.1038/35059006. [DOI] [PubMed] [Google Scholar]
  • 84.Mira A, Pushker R, Rodriguez-Valera F. The Neolithic revolution of bacterial genomes. Trends Microbiol . 2006;14:200–206. doi: 10.1016/j.tim.2006.03.001. [DOI] [PubMed] [Google Scholar]
  • 85.Frese S A, Benson A K, Tannock G W, Loach D M, Kim J, Zhang M, Oh PL, Heng N C, Patil P B, Juge N, Mackenzie D A, Pearson B M, Lapidus A, Dalin E, Tice H, Goltsman E, Land M, Hauser L, Ivanova N, Kyrpides N C, Walter J. The evolution of host specialization in the vertebrate gut symbiont Lactobacillus reuteri. PLoS Genet. 2011;7:e1001314. doi: 10.1371/journal.pgen.1001314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Greub G, Raoult D. Microorganisms resistant to free-living amoebae. Clin Micro Rev . 2004;17:413–433. doi: 10.1128/CMR.17.2.413-433.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Moliner C, Fournier P E, Raoult D. Genome analysis of microorganisms living in amoebae reveals a melting pot of evolution. FEMS Microbiol Rev. 2010;34:281–294. doi: 10.1111/j.1574-6976.2010.00209.x. [DOI] [PubMed] [Google Scholar]
  • 88.Mcinerney J O. Replicational and transcriptional selection on codon usage in Borrelia burgdoreferi. Proc Natl Acad Sci USA. 1998;95:10698–10703. doi: 10.1073/pnas.95.18.10698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Romero H, Zavala A, Musto H. Codon usage in Chlamydia trachomatis is the result of strandspecific mutational biases and a complex pattern of selective forces. Nucleic Acids Res. 2000;28:2084–2090. doi: 10.1093/nar/28.10.2084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Lobry J R, Louarn J M. Polarization of prokaryotic chromosomes. Curr Opin Microbiol . 2003;6:101–108. doi: 10.1016/s1369-5274(03)00024-9. [DOI] [PubMed] [Google Scholar]
  • 91.Lobry J R, Sueoka N. Asymmetric directional mutation pressures in bacteria. Genome Biol. 2002;3 doi: 10.1186/gb-2002-3-10-research0058. research0058.1-0058.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Francino M P, Ochman H. Deamination as the basis of strand-asymmetric evolution in transcribed Escherichia coli sequences. Mol Biol Evol . 2001;18:1147–1150. doi: 10.1093/oxfordjournals.molbev.a003888. [DOI] [PubMed] [Google Scholar]
  • 93.Lobry J R. Asymmetric substitution patterns in the two DNA strands of bacteria. Mol Biol Evol . 1996;13:660–665. doi: 10.1093/oxfordjournals.molbev.a025626. [DOI] [PubMed] [Google Scholar]
  • 94.Rocha E P. The replication-related organization of bacterial genomes. Microbiology . 2004;150:1609–1627. doi: 10.1099/mic.0.26974-0. [DOI] [PubMed] [Google Scholar]
  • 95.Wei W, Guo F B. Strong Strand Composition Bias in the Genome of Ehrlichia canis Revealed by Multiple Methods. Open Microbiol J. 2010;4:98–102. doi: 10.2174/1874285801004010098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Rispe C, Delmotte F, van Ham R C, Moya A. Mutational and selective pressures on codon and amino acid usage in Buchnera endosymbiotic bacteria of aphids. Genome Res . 2004;14:44–53. doi: 10.1101/gr.1358104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Das S, Paul S, Chatterjee S, Dutta C. Codon and amino acid usage in two major human pathogens of genus Bartonella--optimization between replicational-transcriptional selection translational control and cost minimization. DNA Res . 2005;12:91–102. doi: 10.1093/dnares/12.2.91. [DOI] [PubMed] [Google Scholar]
  • 98.Das S, Paul S, Dutta C. Evolutionary constraints on codon and amino acid usage in two strains of human pathogenic actinobacteria Tropheryma whipplei. J Mol Evol . 2006;62:645–658. doi: 10.1007/s00239-005-0164-6. [DOI] [PubMed] [Google Scholar]
  • 99.Guo F B, Yu X J. Separate base usages of genes located on the leading and lagging strands in Chlamydia muridarum revealed by the Z curve method. BMC Genomics. 2007;8:366. doi: 10.1186/1471-2164-8-366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Guo F B, Yuan J B. Codon usages of genes on chromosome and surprisingly genes in plasmid are primarily affected by strand-specific mutational biases in Lawsonia intracellularis. DNA Res . 2009;16:91–104. doi: 10.1093/dnares/dsp001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Wei W, Guo F B. Strong Strand Composition Bias in the Genome of Ehrlichia canis Revealed by Multiple Methods. Open Microbiol J. 2010;4:98–102. doi: 10.2174/1874285801004010098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Das S, Paul S, Dutta C. Synonymous codon usage in adenoviruses: influence of mutation selection and protein hydropathy. Virus Res . 2006;117:227–236. doi: 10.1016/j.virusres.2005.10.007. [DOI] [PubMed] [Google Scholar]
  • 103.Beletskii A, Bhagwat A S. Transcription-induced mutations: increase in C to T mutations in the nontranscribed strand during transcription in Escherichia coli. Proc Natl Acad Sci USA. 1996;93:13919–13924. doi: 10.1073/pnas.93.24.13919. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Frank A C, Lobry J R. Asymmetric substitution patterns a review of possible underlying mutational or selective mechanisms. Gene. 1999;238:65–77. doi: 10.1016/s0378-1119(99)00297-8. [DOI] [PubMed] [Google Scholar]
  • 105.Necsulea A, Lobry J R. A new method for assessing the effect of replication on DNA base composition asymmetry. Mol Biol Evol . 2007;24:2169–2179. doi: 10.1093/molbev/msm148. [DOI] [PubMed] [Google Scholar]
  • 106.Lopez P, Philippe H. Composition strand asymmetries in prokaryotic genomes mutational bias and biased gene orientation. C R Acad Sci III. 2001;324:201–208. doi: 10.1016/s0764-4469(00)01298-1. [DOI] [PubMed] [Google Scholar]
  • 107.Rocha E P, Danchin A. Essentiality not expressiveness drives gene-strand bias in bacteria. Nat Genet . 2003;34:377–378. doi: 10.1038/ng1209. [DOI] [PubMed] [Google Scholar]
  • 108.Klasson L, Andersson S G. Strong asymmetric mutation bias in endosymbiont genomes coincide with loss of genes for replication restart pathways. Mol Biol Evol . 2006;23:1031–1039. doi: 10.1093/molbev/msj107. [DOI] [PubMed] [Google Scholar]
  • 109.Ogata H, Audic S, Renesto-Audiffren P, Fournier PE, Barbe V, Samson D, Roux V, Cossart P, Weissenbach J, Claverie JM, Raoult D. Mechanisms of evolution in Rickettsia conorii and R prowazekii. Science. 2001;293: 2093–2098. doi: 10.1126/science.1061471. [DOI] [PubMed] [Google Scholar]
  • 110.Andersson J O, Andersson S G. Genomic rearrangements during evolution of the obligate intracellular parasite Rickettsia prowazekii as inferred from an analysis of 52015 bp nucleotide sequence. Microbiology. 1997;143:2783–2795. doi: 10.1099/00221287-143-8-2783. [DOI] [PubMed] [Google Scholar]
  • 111.Wu M, Sun L V, Vamathevan J, Riegler M, Deboy MR, Brownlie JC, McGraw EA, Martin W, Esser C, Ahmadinejad N, Wiegand C, Madupu R, Beanan MJ, Brinkac LM, Daugherty SC, Durkin AS, Kolonay JF, Nelson WC, Mohamoud Y, Lee P, Berry K, Young MB, Utterback T, Weidman J, Nierman WC, Paulsen IT, Nelson KE, Tettelin H, O'Neill SL, Eisen JA. Phylogenomics of the reproductive parasite Wolbachia pipientis wMel a streamlined genome overrun by mobile genetic elements. PLoS Biol . 2004;2:0327–0341. doi: 10.1371/journal.pbio.0020069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Hacker J, Blum-Oehler G, Mühldorfer I, Tschäpe H. Pathogenicity islands of virulent bacteria: structure function and impact on microbial evolution. Mol Microbiol . 1997;23:1089–97. doi: 10.1046/j.1365-2958.1997.3101672.x. [DOI] [PubMed] [Google Scholar]
  • 113.Blum G, Ott M, Lischewski A, Ritter A, Imrich H, Tschäpe H, Hacker J. Excision of large DNA regions termed pathogenicity islands from tRNA-specific loci in the chromosome of an Escherichia coli wild-type pathogen. Infect Immun. 1994;62:606–614. doi: 10.1128/iai.62.2.606-614.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Strauss EJ, Falkow S. Microbial pathogenesis: genomics and beyond. Science. 1997;276:707–712. doi: 10.1126/science.276.5313.707. [DOI] [PubMed] [Google Scholar]
  • 115.Dobrindt U, Hochhut B, Hentschel U, Hacker J. Genomic islands in pathogenic and environmental microorganisms. Nat Rev Microbiol . 2004;2:414–424. doi: 10.1038/nrmicro884. [DOI] [PubMed] [Google Scholar]
  • 116.Schmidt H, Hensel M. Pathogenicity islands in bacterial pathogenesis. Clin Microbiol Rev . 2004;17:14–56. doi: 10.1128/CMR.17.1.14-56.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Groisman EA, Ochman H. Pathogenicity islands bacterial evolution in quantum leaps. Cell. 1996;87:791–794. doi: 10.1016/s0092-8674(00)81985-6. [DOI] [PubMed] [Google Scholar]
  • 118.Finlay BB, Falkow S. Common themes in microbial pathogenicity revised. Microbiol Mol Biol Rev . 1997;61:136–169. doi: 10.1128/mmbr.61.2.136-169.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Das S, Pan A, Paul S, Dutta C. Comparative Analyses of Codon and Amino Acid Usage in Symbiotic Island and Core Genome in Nitrogen-Fixing Symbiotic Bacterium Bradyrhizobium japonicum. J Biomol Struc Dyn. 2005;23:231–232. doi: 10.1080/07391102.2005.10507061. [DOI] [PubMed] [Google Scholar]
  • 120.Preston GM, Hauboldt B, Rainey PB. Bacterial genomics and adaptation to life on plants: implications for the evolution of pathogenicity and symbiosis. Curr Opin Microbiol . 1998;1:589–597. doi: 10.1016/s1369-5274(98)80094-5. [DOI] [PubMed] [Google Scholar]
  • 121.Sullivan JT, Ronson CW. Evolution of rhizobia by acquisition of a 500-kb symbiosis island that integrates into a phe-tRNA gene. Proc Natl Acad Sci USA. 1998;95:5145–5149. doi: 10.1073/pnas.95.9.5145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Ravatn R, Studer S, Springael D, Zehnder AJ, van der MJR. Chromosomal integration tandem amplification and deamplification in Pseudomonas putida F1 of a 105-kilobase genetic element containing the chlorocatechol degradative genes from Pseudomonas sp Strain B13. J Bacteriol . 1998;180:4360–4369. doi: 10.1128/jb.180.17.4360-4369.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Ito T, Katayama Y, Hiramatsu K. Cloning and nucleotide sequence determination of the entire mec DNA of pre-methicillin-resistant Staphylococcus aureus N315. Antimicrob Agents Chemother. 1999;43:1449–1458. doi: 10.1128/aac.43.6.1449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Hacker J. Urinary tract infection: From basic science to clinical application. In: Emödy L, Blum G, Hacker J, Pal T, editors. Genes and Proteins Underlying Microbial Urinary Tract Virulence: Basic Aspects and Applications. Advances in Experimental Medicine and Biology. 2000. pp. 1–8. [DOI] [PubMed] [Google Scholar]
  • 125.Khan A S, Kniep B, Ölschläger AT, Van Die I, Korhonen T, Hacker J. The receptor structure for F1C fimbriae of uropathogenic Escherichia coli. Infect Immun . 2000;68:3541–3547. doi: 10.1128/iai.68.6.3541-3547.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Galán JE, Collmer A. Type III secretion machines bacterial devices for protein delivery into host cells. Science. 1999;284:1322–1328. doi: 10.1126/science.284.5418.1322. [DOI] [PubMed] [Google Scholar]
  • 127.Parsot C, Sansonetti PJ. The virulence plasmid of Shigellae an archipelago of pathogenicity islands? In: Kaper J, Hacker J, editors. Pathogenicity Islands and Other Mobile Virulence Elements. Washington, DC: ASM Press; 1999. pp. 151–65. [Google Scholar]
  • 128.Cornelis GR, Boland A, Boyd AP, Geuijen C, Iriarte M, Neyt C, Sory MP, Stainier I. The virulence plasmid of Yersinia an antihost genome. Microbiol Mol Biol Rev . 1998;62:1315–1352. doi: 10.1128/mmbr.62.4.1315-1352.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Vogel JP, Andrews HL, Wong SK, Isberg RR. Conjugative transfer by the virulence system of Legionella pneumophila. Science. 1998;279:873–876. doi: 10.1126/science.279.5352.873. [DOI] [PubMed] [Google Scholar]
  • 130.Cesini S, Lange C, Xiang Z, Crabtree JE, Ghiara P, Borodovsky M, Rappuoli R, Covacci A. cag a pathogenicity island of Helicobacter pylori encodes type-I specific and disease-associated virulence factors. Proc Natl Acad Sci USA. 1996;93:14648–14653. doi: 10.1073/pnas.93.25.14648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Lauro FM, McDougald D, Thomas T, Williams TJ, Egan S, Rice S, DeMaere MZ, Ting L, Ertan H, Johnson J, Ferriera S, Lapidus A, Anderson I, Kyrpides N, Munk AC, Detter C, Han CS, Brown MV, Robb FT, Kjelleberg S, Cavicchioli R. The genomic basis of trophic strategy in marine bacteria. Proc Natl Acad Sci USA. 2009;106:15527–15533. doi: 10.1073/pnas.0903507106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Dufresne A, Garczarek L, Partensky F. Accelerated evolution associated with genome reduction in a free-living prokaryote. Genome Biol. 2005;6:R14. doi: 10.1186/gb-2005-6-2-r14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Garcia-Fernandez JM, de Marsac NT, Diez J. Streamlined regulation and gene loss as adaptive mechanisms in Prochlorococcus for optimized nitrogen utilization in oligotrophic environments. Microbiol Mol Biol Rev. 2004;68:630–638. doi: 10.1128/MMBR.68.4.630-638.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Martiny AC, Coleman M L, Chisholm SW. Phosphate acquisition genes in Prochlorococcus ecotypes evidence for genome-wide adaptation. Proc Natl Acad Sci USA. 2006;103:12552–12557. doi: 10.1073/pnas.0601301103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Rocap G, Larimer FW, Lamerdin J, Malfatti S, Chain P, Ahlgren NA, Arellano A, Coleman M, Hauser L, Hess WR, Johnson ZI, Land M, Lindell D, Post AF, Regala W, Shah M, Shaw SL, Steglich C, Sullivan MB, Ting CS, Tolonen A, Webb EA, Zinser ER, Chisholm SW. Genome divergence in two Prochlorococcus ecotypes reflects oceanic niche differentiation. Nature. 2003;424:1042–1047. doi: 10.1038/nature01947. [DOI] [PubMed] [Google Scholar]
  • 136.Dufresne A, Salanoubat M, Partensky F, Artiguenave F, Axmann IM, Barbe V, Duprat S, Galperin MY, Koonin EV, Le Gall F, Makarova KS, Ostrowski M, Oztas S, Robert C, Rogozin IB, Scanlan DJ, Tandeau de Marsac N, Weissenbach J, Wincker P, Wolf YI, Hess WR. Genome sequence of the cyanobacterium Prochlorococcus marinus SS120 a nearly minimal oxyphototrophic genome. Proc Natl Acad Sci USA. 2003;100:10020–10025. doi: 10.1073/pnas.1733211100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.Marais GA, Calteau A, Tenaillon O. Mutation rate and genome reduction in endosymbiotic and free-living bacteria. Genetica. 2008;134:205–210. doi: 10.1007/s10709-007-9226-6. [DOI] [PubMed] [Google Scholar]
  • 138.Paul S, Dutta A, Bag S K , Das S, Dutta C. Distinct ecotype-specific genome and proteome signatures in the marine cyanobacteria Prochlorococcus. BMC Genomics. 2010;11:103. doi: 10.1186/1471-2164-11-103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Lambros RJ, Mortimer JR, Forsdyke DR. Optimum growth temperature and the base composition of open reading frames in prokaryotes. Extremophiles. 2003;7:443–450. doi: 10.1007/s00792-003-0353-4. [DOI] [PubMed] [Google Scholar]
  • 140.Musto H, Naya H, Zavala A, Romero H, Alvarez-Valin F, Bernardi G. Correlations between genomic GC levels and optimal growth temperatures in prokaryotes. FEBS Lett . 2004;573:73–77. doi: 10.1016/j.febslet.2004.07.056. [DOI] [PubMed] [Google Scholar]
  • 141.Galtier N, Lobry JR. Relationships between genomic G+C content, RNA secondary structures and optimal growth temperature in prokaryotes. J Mol Evol . 1997;44:632–636. doi: 10.1007/pl00006186. [DOI] [PubMed] [Google Scholar]
  • 142.Galtier N, Tourasse N, Gouy M. A nonhyperthermophilic common ancestor to extant life forms. Science. 1999;283:220–221. doi: 10.1126/science.283.5399.220. [DOI] [PubMed] [Google Scholar]
  • 143.Hurst LD, Merchant AR. High guanine-cytosine content is not an adaptation to high temperature a comparative analysis amongst prokaryotes. Proc Biol Sci . 2001;268:493–497. doi: 10.1098/rspb.2000.1397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 144.Kawashima T, Amano N, Koike H, Makino S, Higuchi S, Kawashima-Ohya Y, Watanabe K, Yamazaki M, Kanehori K, Kawamoto T, Nunoshiba T, Yamamoto Y, Aramaki H, Makino K, Suzuki M. Archaeal adaptation to higher temperatures revealed by genomic sequence of Thermoplasma volcanium. Proc Natl Acad Sci USA. 2000;97:14257–14262. doi: 10.1073/pnas.97.26.14257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145.Nakashima H, Fukuchi S, Nishikawa K. Compositional changes in RNA DNA and proteins for bacterial adaptation to higher and lower temperatures. J Biochem . 2003;133:507–513. doi: 10.1093/jb/mvg067. [DOI] [PubMed] [Google Scholar]
  • 146.Montanucci L, Fariselli P, Martelli PL, Casadio R. Predicting protein thermostability changes from sequence upon multiple mutations. Bioinformatics . 2008;24:i190–195. doi: 10.1093/bioinformatics/btn166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Paz A, Mester D, Baca I, Nevo E, Korol A. Adaptive role of increased frequency of polypurine tracts in mRNA sequences of thermophilic prokaryotes. Proc Natl Acad Sci USA . 2004;101:2951–2956. doi: 10.1073/pnas.0308594100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 148.Zeldovich KB, Berezovsky IN, Shakhnovich EI. Protein and DNA sequence determinants of thermophilic adaptation. PLoS Comput Biol . 2007;3:62–72. doi: 10.1371/journal.pcbi.0030005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 149.Kennedy S P, Ng W V, Salzberg S L, Hood L, DasSarma S. Understanding the adaptation of Halobacterium species NRC-1 to its extreme environment through computational analysis of its genome sequence. Genome Res . 2001;11:1641–1650. doi: 10.1101/gr.190201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 150.Soppa J. From genomes to function haloarchaea as model organisms. Microbiology . 2006;152:585–590. doi: 10.1099/mic.0.28504-0. [DOI] [PubMed] [Google Scholar]
  • 151.Bolhuis H, Palm P, Wende A, Falb M, Rampp M, Rodriguez-Valera F, Pfeiffer F, Oesterhelt D. The genome of the square archaeon Haloquadratum walsbyi : life at the limits of water activity. BMC Genomics . 2006;7:169. doi: 10.1186/1471-2164-7-169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 152.Misra V K, Honig B. The electrostatic contribution to the B to Z transition of DNA. Biochemistry. 1996;35:1115–1124. doi: 10.1021/bi951463y. [DOI] [PubMed] [Google Scholar]

Articles from Current Genomics are provided here courtesy of Bentham Science Publishers

RESOURCES