Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2007 May 9;104(20):8385–8390. doi: 10.1073/pnas.0701652104

The neoselectionist theory of genome evolution

Giorgio Bernardi 1,*
PMCID: PMC1866311  PMID: 17494746

Abstract

The vertebrate genome is a mosaic of GC-poor and GC-rich isochores, megabase-sized DNA regions of fairly homogeneous base composition that differ in relative amount, gene density, gene expression, replication timing, and recombination frequency. At the emergence of warm-blooded vertebrates, the gene-rich, moderately GC-rich isochores of the cold-blooded ancestors underwent a GC increase. This increase was similar in mammals and birds and was maintained during the evolution of mammalian and avian orders. Neither the GC increase nor its conservation can be accounted for by the random fixation of neutral or nearly neutral single-nucleotide changes (i.e., the vast majority of nucleotide substitutions) or by a biased gene conversion process occurring at random genome locations. Both phenomena can be explained, however, by the neoselectionist theory of genome evolution that is presented here. This theory fully accepts Ohta's nearly neutral view of point mutations but proposes in addition (i) that the AT-biased mutational input present in vertebrates pushes some DNA regions below a certain GC threshold; (ii) that these lower GC levels cause regional changes in chromatin structure that lead to deleterious effects on replication and transcription; and (iii) that the carriers of these changes undergo negative (purifying) selection, the final result being a compositional conservation of the original isochore pattern in the surviving population. Negative selection may also largely explain the GC increase accompanying the emergence of warm-blooded vertebrates. In conclusion, the neoselectionist theory not only provides a solution to the neutralist/selectionist debate but also introduces an epigenomic component in genome evolution.

Keywords: base composition, isochores, nearly neutral theory, neutral theory


The most famous sentence from The Origin of Species (1), “This preservation of favourable variations and the rejection of injurious variations I call Natural Selection,” suggests a dichotomy in the fate of “variations” and is generally interpreted accordingly. This sentence was immediately followed, however, by another one that still is only exceptionally quoted: “Variations neither useful nor injurious would not be affected by natural selection.” I.e., Darwin distinguished not two, but three kinds of variations: advantageous, deleterious, and neutral. Whereas advantageous variations expand in the progeny (by positive, or Darwinian, selection), the deleterious ones tend to disappear (by negative, or purifying, selection), and the neutral ones may come out of their limbo to be fixed (like the advantageous variations), or to disappear (like the deleterious ones). Incidentally, the concept of neutral variations is absent in Wallace (2).

Neutral variations were simply obliterated by the selectionists (the neo-Darwinians) (3, 4), although not by all of them (see ref. 5). They were, however, resurrected by the neutral theory, which broke the long predominance of the selectionist theories (see ref. 6). Indeed, Kimura (7, 8) claimed that “the main cause of evolutionary change at the molecular level—changes in the genetic material itself—is random fixation of selectively neutral or nearly neutral mutants.” Darwin's “survival of the fittest” was replaced by Kimura's “survival of the luckiest,” and Darwinian and neo-Darwinian evolution were substituted by “non-Darwinian evolution” (9). Kimura's revolutionary proposal started a neutralist/selectionist debate, which is still going on (see ref. 10) and which concerns the important issue of the role of chance in evolution. A very significant modification of the neutral theory was the nearly neutral theory of Ohta (11, 12), who proposed that a substantial fraction of changes are caused by random fixation of nearly neutral changes, a class that “includes intermediates between neutral and advantageous, as well as between neutral and deleterious classes” (13). The first four schemes of Fig. 1 display a qualitative picture of the classical theories just mentioned.

Fig. 1.

Fig. 1.

Darwin postulated the existence of deleterious, advantageous, and neutral changes. The neo-Darwinians (or selectionists) neglected neutral changes. These were reintroduced and amplified by Kimura (7, 8), who developed the neutral theory of evolution (a non-Darwinian evolution, according to ref. 9). The nearly neutral theory was proposed by Ohta (12, 13, 102) to include intermediates between neutral and advantageous, as well as between neutral and deleterious changes. In the neoselectionist theory, the critical changes are responsible for the transition from point mutations to regional changes (modified from refs. 12 and 103).

All classical theories proposed that natural selection acted on the “phenotype” [called the “classical phenotype” by Bernardi and Bernardi (14), to distinguish it from the “genome phenotype” presented below]. Whether visualized through hereditary morphological traits (by Darwin), genetic characters (by the neo-Darwinians), or protein and DNA sequences (by Kimura), the classical phenotype is the result of the expression of single genes or of a small number of genes, and its evolution was seen as essentially due to single-nucleotide mutations.

In contrast, the “neoselectionist theory” arose by comparing the genome phenotypes (14), namely the “compositional patterns” or “compositional landscapes” of whole genomes. The “compositional approach” was initially made possible by a high-resolution analysis (see refs. 1518) of eukaryotic genomes, in which DNA was fractionated by ultracentrifugation in Cs2SO4 density gradients in the presence of sequence-specific ligands (Ag+ or, later, BAMD [bis(acetato-mercuri-methyl)dioxane]). Although experimentally sophisticated, this approach was conceptually simple, because it concerned the frequency of ligand-binding oligonucleotides, a property correlated with the base composition of DNA. This compositional approach was extended by computer work to sequences of genes and whole genomes as soon as they became available.

The compositional approach led to the following: (i) the discovery of compositional compartmentalization, compositional correlations, and compositional evolution; (ii) evidence that genome evolution does not only proceed by point mutations, but also by regional changes; and (iii) the development of the neoselectionist theory of genome evolution. The three points under (i) will be summarized in the sections below by using our results on the vertebrate genome.

Compositional Compartmentalization: Isochores and Chromosomal Bands, and Gene Distribution

The genomes of mammals are compositionally compartmentalized (16), in that they are “mosaics of isochores” (18), fairly homogeneous regions, originally estimated as longer than 300 kb (the upper limit that could be measured by the approach used at that time; see previous section). These regions could be assigned to five compositionally narrow families that cover a very wide GC range (34–59% GC in the human genome). This discontinuous compartmentalization was in sharp contrast with the then-predominant view of a continuous compositional change of DNA along chromosomes.

Recent results (19) have confirmed these conclusions at the sequence level by mapping isochores on human chromosomes and by providing information on the number (≈3,200), the average size (≈1 Mb), and the GC levels of isochores. The standard deviations of GC levels are ≈1% GC within isochores covering 85% of the genome, and ≈2% GC in the remaining, mostly GC-rich, isochores. If isochores are pooled in 1% GC bins, their distribution confirms (19) that they belong in the five families previously described (20) (see Fig. 2). Isochores are, in fact, the ultimate chromosomal bands because below the 100 kb level the GC profiles of chromosomes (especially of the GC-richest regions) are characterized by large fluctuations due to coding sequences, introns, interspersed repeats, etc. Interestingly, the number of isochores is close to that, 3,000, of the highest-resolution bands observed by Yunis et al. (21) in early prophase, and the average size of an isochore is equal to the average size of a “site of replication” as defined by Ma et al. (22).

Fig. 2.

Fig. 2.

DNA and gene distribution in the isochore families of the human genome. The major structural and functional properties associated with each gene space are listed (in blue for the genome desert and in red for the genome core). The top frames are modified from M. Costantini, F. Auletta, and G.B. (unpublished data). SINEs, short interspersed sequences; LINEs, long interspersed sequences.

It is generally accepted that the molecular basis of cytogenetic bands is not well understood. Isochores do allow one, however, to define the standard Giemsa and reverse bands at a 850- or 400-band resolution on purely compositional grounds (23), thus ending the debate on whether the chromosomal bands are due to DNA, as thought by Caspersson (see ref. 24), or to the associated proteins (25). In addition, the isochore map allowed us to detect a nested structure that concerns not only contiguous isochores but also contiguous high-resolution bands (23).

The assessment of gene density in compositional DNA fractions (see Fig. 2) led to the discovery (20, 26, 27) that genes are not uniformly distributed in the mammalian genome, contrary to what was previously thought. Indeed, in the human genome almost two-thirds of the protein-coding genes are concentrated in the GC-richest isochore families H2 and H3 (only representing 15% of the genome), which was called the “genome core” (28, 29). The rest is spread over the vast (≈85%) GC-poor “empty space,” or “empty quarter” (from the name of the Arabian desert), eventually called the “genome desert” (3032), namely the GC-poor isochore families L1, L2, and H1. These two “gene spaces” (30) are different not only in gene density but also in a number of other basic properties, which are summarized in Fig. 2. Recent data have revealed that the isochore map (19) matches the replicon cluster map of Watanabe et al. (33), and that both isochore size and average GC of isochore families are conserved in vertebrates (M. Costantini, F. Auletta, and G.B., unpublished data), reinforcing the concept that isochores represent “a fundamental level of genome organization” (34).

Compositional Correlations: The Genomic Code

Positive compositional correlations were found to hold between coding and contiguous noncoding sequences, both intergenic and intragenic (20). Moreover, GC1, GC2, and GC3 (the GC levels of the three codon positions) were shown to be correlated with each other (14, 35). Such general rules were called (36) “genomic code” (not to be confused with the “genetic code”), a definition later extended to the correlations between the GC levels of the three codon positions with amino acid composition (37), hydrophobicity (38), and secondary structures of the encoded proteins (39, 40). Interestingly, the genomic code allows predicting the composition of flanking sequences and introns from the composition of coding sequences (and vice versa), as well as predicting protein properties from the composition of the corresponding coding sequences (and vice versa). Along the same line, plots of GC2 vs. GC3 revealed clusters of protein-coding genes corresponding to each isochore family and forming “gene landscapes” (41) that have led to simple proteomic checks for detecting noncoding RNA (42). Needless to say, all these results provided strong evidence for the genome as an integrated ensemble with little or no room left for what Ohno called “junk DNA” (43).

Compositional Evolution: Genome Phenotypes, and the Transitional and Conservative Modes of Evolution

In vertebrates, the compositional patterns of isochores and of coding sequences, as well as the GC3 levels of orthologous coding sequences, were found to be strikingly different between fish/amphibians on the one hand and mammals/birds on the other, reptiles showing intermediate patterns (17, 4447). More precisely, at the transition between cold- and warm-blooded vertebrates, the moderately GC-rich ancestral genome core of cold-blooded vertebrates underwent a remarkable GC increase to become the genome core of mammals and birds (48) (see Fig. 3). This implied the existence of a “transitional” (or “shifting”) “mode” of evolution that took place independently in the ancestral genomes of mammals and birds, and yet led to similar isochore patterns that were maintained in mammalian and avian genomes (49, 50). In contrast, the genome desert did not undergo any major compositional change (48), except for the decrease of methylcytosine and CpG, which affects the whole genome (see below).

Fig. 3.

Fig. 3.

Scheme of the compositional evolution of vertebrate genomes. At the transition from cold- to warm-blooded vertebrates, the gene-dense, moderately GC-rich “ancestral genome core” (pink box) became the gene-dense, GC-rich genome core (red box), but the GC-poor and gene-poor (blue box) genome desert did not undergo any major compositional change. This transitional (or shifting) mode, which was accompanied by an overall decrease of CpG doublets and methylcytosine, was followed by a conservative mode of genome evolution in which compositional patterns were maintained (modified from ref. 32).

Compositional differences among genomes from vertebrates belonging to the same classes (fishes, amphibians, etc.) could, however, be observed as the result of “whole genome (or horizontal) shifts” (5153). These shifts apparently are the consequence of changes in genome size, in environmental conditions, in mutation rates, and in the intensity and duration of the mutational AT bias, i.e., the predominance of GC→AT vs. AT→GC changes (5463). This bias is likely to be due, as in the case of “mutator mutations” of prokaryotes (64), to mutations in the genes coding for subunits of the replication machinery (65).

The compositional transition between cold- and warm-blooded vertebrates could not be accounted for by the random nucleotide substitutions of the neutral theory (see also below) and required, therefore, another explanation. The “thermodynamic stability hypothesis” (14) proposed that the GC increase of the genome core accompanied the emergence of homeothermy and simultaneously provided the selective advantages of increased stabilities of DNA, RNA, and proteins. Indeed, high GC levels in coding sequences not only increase the stability of DNA and of the stem structures of transcribed RNAs, as demonstrated by the GC increases of corresponding stems of ribosomal RNAs from cold- to warm-blooded vertebrates (66, 67), but also favor amino acids that thermodynamically stabilize proteins, as indicated by observations of Argos et al. (68) and Zuber (69) that were recently confirmed by Nishio et al. (70, 71).

This selectionist explanation was supported (i) by the similar genome changes occurring in the independent lines of mammals and birds; (ii) by the decreases of CpG doublets and methylcytosine, which are correlated with increasing body temperatures in vertebrates ranging from Antarctic fishes to mammals (7276); (iii) by the variable compositional heterogeneity and methylation levels (intermediate between those of fishes/amphibians and mammals/birds) of the genomes from reptiles (51, 47, 77), which are known to have different body temperatures and thermal regulations; (iv) by the mammalian-like isochore organization of the genomes of subtropical and tropical insects (Drosophila, Anopheles) (78, 79); and (v) by the increase of GC levels that accompanies the increase in optimal growth temperatures in many families of prokaryotes (80).

The thermodynamic stability hypothesis stressed the molding effect of the environment on the genome (14). In fact, our use of the old-fashioned distinction between cold- and warm-blooded vertebrates was meant to suggest that the cause of genome changes was “body temperature.” The hypothesis could also explain why, at the emergence of a stable body temperature ≈40°C (homeothermy), the “open chromatin” of the genome core, which has an expanded configuration at the center of the interphase nucleus, required a GC increase to be stabilized, whereas the “closed chromatin” of the GC-poor genome desert did not, being stabilized by its own compact structure packed against the nuclear membrane (81). The different chromatin structure could also explain why retroviral sequences initially integrate in GC-rich regions (see ref. 32 for a review).

The transitional mode of evolution stopped in mammals after they emerged from their common ancestor, suggesting that an equilibrium was reached (the decline of the isochore structure in some mammalian orders, proposed by some authors, still is under debate) (see refs. 33 and 8285). The evidence for this compositionally “conservative mode” of evolution was initially provided by the similarity of the compositional distribution of coding sequences and of GC3 levels of orthologous coding sequences of mammals from different orders (44). Recent results showed that, in addition, the patterns of orthologous (or syntenic) isochores are conserved in different mammalian orders (e.g., Fig. 4), including the human/mouse case in which one of the two genomes, the mouse genome, is characterized by a great acceleration in substitution rate (86, 87). This is an astonishing result in view of the mutational AT bias of vertebrates and of the fact that the species under consideration have been diverging for 90 million years (88). Because the neutral substitution rate has been estimated as 3.9 × 10−9 substitutions per year (10, 89), ≈70% nucleotide substitutions took place. This would approximately correspond to 54% sequence identity if one accounts for multiple and parallel substitutions. Some differences in size and in GC levels between syntenic isochores do exist, but they are largely accounted for by inserted mobile elements, and by the whole-genome shifts mentioned above. Needless to say, the remarkable conservation of both isochore size and average base composition of isochore families in all vertebrates (M. Costantini, F. Auletta, and G.B., unpublished data) adds two very important properties to the general picture just presented.

Fig. 4.

Fig. 4.

An example of isochore conservation between syntenic chromosome regions of dog and human (from M. Costantini, F. Auletta, and G.B., unpublished data).

The conservative mode of evolution was originally explained by “negative selection acting at a regional (isochore) level to eliminate any strong deviation from presumably functionally optimal composition of isochores” (44), a general conclusion supported by Gillespie (90). It should be stressed that the compositional conservation of syntenic isochores (see Fig. 4) not only reinforces the original idea of negative selection but also implies a functional role of noncoding sequences in the genome, which is also suggested by a number of other considerations (see, for example, ref. 91). Objections to selection and alternative hypotheses (in particular the biased gene conversion) (see ref. 84) for the formation and maintenance of the GC-rich isochores of mammals and birds have been discussed in detail elsewhere and shown not to hold (31, 32).

In conclusion, selection appears to account for both the formation and the maintenance of isochores, two phenomena that cannot be explained by the neutral theory, which, incidentally, was elaborated without any knowledge of isochore patterns. In fact, comparisons of orthologous mammalian genes or proteins (such as those used at that time) unfortunately missed not only the compositional heterogeneity of mammalian genes within the same genome, but also the compositional changes that took place between cold- and warm-blooded vertebrates. However, whereas a random fixation of neutral mutants (or a biased genome conversion) could not generate GC-rich isochores and would erode the isochore pattern over evolutionary time, the majority of mutations per se could only be neutral and nearly neutral. Indeed, in genomes in which noncoding sequences may represent 99% of DNA, no single-nucleotide change could have a strong enough selective advantage or disadvantage, at least in the vast majority of cases. There was, therefore, a need to reconcile the neutrality or near-neutrality of the point mutation process with selection at the regional level.

Neoselectionist Theory of Evolution

Let us consider now the conservative mode and follow the evolution of a gene-dense, GC-rich region from a warm-blooded vertebrate. Initially, the region is assumed to be at a compositionally acceptable GC level (Fig. 5). Because of the local accumulation of AT-biased mutations, some regions will drift toward lower GC values over evolutionary time. This ratchet process goes on until a certain GC threshold is trespassed (we called “critical changes” the changes responsible for trespassing the threshold). This leads to a regional change in chromatin structure that can spread (see refs. 92 and 93 for the spreading of silent chromatin) and interfere with DNA replication. Needless to say, the regional changes in chromatin structure would also have deleterious effects on the expression of genes in the region (see ref. 94 for chromatin structure control on gene expression and ref. 95 for insulators). The end result of this “epigenomic process” is negative selection on the carriers of such deleterious changes, the survivors showing a conserved isochore pattern. Interestingly, the proposed selection process does not create an impossible “genetic load” (96), as is the case for the selectionist theory of the neo-Darwinians, and does not substantially increase the “cost of natural selection” (97) because of its low frequency. Incidentally, the development of the model of Fig. 5 and some pioneering views along similar lines (91, 98) were described elsewhere (32).

Fig. 5.

Fig. 5.

Time course of typical compositional changes of a GC-rich region from a warm-blooded vertebrate in the conservative mode of evolution. In an early phase, the average GC level of the region, initially visualized at its compositional optimum (arbitrarily set here at 54% GC), is decreasing because of the mutational AT bias (the vertical blue bars crossing the black DNA line represent the “excess” GC→AT changes), but remains within a tolerated range (whose arbitrary thresholds are indicated by the thick horizontal broken lines). In a late phase, the average GC level trespasses the lower threshold (arbitrarily fixed here at 52% GC), because of the last changes, the critical changes. The chromatin (red boxes) then undergoes a structural change (broken blue box) that is deleterious for transcription and replication (see text). Until then, the changes may be neutral or, more probably, nearly neutral (modified from ref. 32).

To sum up, the essential points of the model are the deleterious chromatin changes, and the negative selection of the carriers of such changes. At present, the best explanations for initiating the regional chromatin changes are the compositional drift due to the AT bias and the trespassing of a lower GC threshold. It is possible, however, to think about other ways that allow one to reach the crucial changes in chromatin structure. For example, the insertion of a retroviral sequence or a mobile element might conceivably replace nucleotide substitutions as critical changes.

We will now consider in more detail some features of the model. (i) First, until the threshold is reached, the mutational process essentially consists of neutral or nearly neutral changes. (ii) After the threshold is trespassed and chromatin changes have occurred, the effects will be more or less deleterious according to the affected genome regions and the extent of the epigenomic changes. (iii) The lowered GC level of a region may be revealed by a comparison with a syntenic region from another species or another population, so providing a test for the theory. (iv) The time required for the actual disappearance of the carriers of epigenomic changes from the population will depend on the severity of the deleterious changes, and should be comparable with the time needed for the elimination of classical deleterious changes in coding and regulatory sequences located in the same regions. (v) The deleterious effects of chromatin alteration may also concern physiological chromatin remodeling. For instance, in the course of spermatogenesis, chromatin is drastically remodeled when histones are replaced by protamines, the reverse process occurring after fertilization. It is conceivable that regional compositional changes in DNA/chromatin may interfere with this process and cause gamete or zygote death.

The model of Fig. 5 also helps understand the “transitional mode” of genome evolution (Fig. 6) by positing (i) a GC increase of the threshold, which accompanies the increasing body temperature of vertebrates on the way to the higher body temperature of mammals and birds; and (ii) a ratchet mechanism, which only requires negative selection because of the deleterious chromatin changes associated with sequences trespassing the threshold. In this case, negative selection, in combination with an environmental factor (body temperature), far from simply maintaining the “status quo” (and being “boring”) (99) can help genome adaptation (and be “creative”) (100). Two points should be stressed here. The first is that the GC increase process just outlined took a very long time and comprised at least two major phases, involving the compositional transitions of amphibians to reptiles and of reptiles to mammals/birds, respectively. The second point is that the process took place at the level of isochores, the common basic structure of all vertebrate genomes.

Fig. 6.

Fig. 6.

A scheme of the transitional mode of evolution describing the GC increase of a gene-dense DNA region during the emergence of homeothermy. The basic feature is an increase in the GC level of the lower threshold (broken blue line) by a ratchet mechanism, which leads to an increased GC level of the region (pink-to-red lines).

We know, however, that positive selection can also play a role in the transitional mode by increasing GC levels according to the thermodynamic hypothesis. For example, nonsynonymous changes characterized by GC increases are predominant over synonymous changes in many coding sequences and lead to amino acid replacements that favor protein stability in Gillichthys seta, a fish living at temperatures up to 40°C, compared with Gillichthys mirabilis, a congeneric fish living at a lower temperature (G. Bucciarelli, D. Costagliola, and G.B., unpublished data).

In conclusion, the major novelties of the neoselectionist theory are the demonstrations (i) that the compositional compartmentalization, the compositional correlations between coding and noncoding sequences, and the transitional and conservative modes of evolution cannot be accounted for by an AT-biased point mutation process occurring at random locations (nor by a random gene conversion process, for that matter); (ii) that the structural and evolutionary features of the vertebrate genome can only be explained by a regional process; incidentally, the main reason why the neutralist/selectionist debate went on for so long probably was its being focused on the neutrality/nonneutrality of point mutations; and (iii) that such a regional process can only be visualized as due to an epigenomic event such as a change in chromatin structure, which interferes with replication and transcription; the coincidence between isochores and replicon clusters explains why the functional problems may have an isochore dimension.

Needless to say, invoking the participation of chromatin in genome evolution is a paradigm shift that justifies calling the neoselectionist model (32) a theory (101). It should be noted that, strictly speaking, the definition of neoselectionist theory applies only to the “selective mode” of Fig. 5. In this case, the combination of the neoselectionist theory and the nearly neutral theory of Ohta should be called the “neosynthetic theory” (the synthetic theory combined Darwinism and Mendelism). For the sake of simplicity, the definition of neoselectionist theory might be extended, however, to incorporate the nearly neutral theory of Ohta, as depicted in the bottom scheme of Fig. 1, where the critical changes are indicated as the most deleterious ones because they affect whole genome regions instead of individual nucleotides.

It should be stressed that the neutral theory and the neoselectionist theory proposed here lead to very different general views of evolution. Indeed, although Kimura's view that “increases and decreases in the mutant frequencies are due mainly to chance” led him, logically, to the idea of the survival of the luckiest, obviously the control of natural selection on the genome phenotype leads us back to the survival of the fittest. In fact, the neoselectionist theory is an extension of Darwin's theory, and could be defined an “ultra-Darwinian theory,” in that even neutral and nearly neutral changes are eventually controlled by natural selection. Some general implications of the neoselectionist theory concerning the deterministic vs. the stochastic nature of the evolutionary process, the central dogma, and the genetic code will be dealt with elsewhere.

Finally, the neoselectionist theory not only accounts for the organization and evolution of the vertebrate (and eukaryotic) genome but also makes some predictions, the main one being that an organism or a gamete may be endowed with a different degree of “genomic fitness” depending on the different numbers and locations of altered chromatin regions, some of the latter being possibly responsible for “genomic (not genetic) diseases” in which gene expression is not necessarily affected by nucleotide changes in the coding or in the promoter sequences, but by changes in chromatin structure.

Acknowledgments

I thank Tomoko Ohta for important suggestions, Emile Zuckerkandl for innumerable discussions and encouragement, and Fernando Alvarez-Valin, Giacomo Bernardi, Oliver Clay, Bernardino Fantini, and Vittorio Sgaramella for very useful comments.

Footnotes

The author declares no conflict of interest.

This article is a PNAS Direct Submission.

References

  • 1.Darwin C. On the Origin of Species, a Facsimile of the First Edition with an Introduction by Ernst Mayr. Cambridge, MA: Harvard Univ Press; 1859. reprinted (1964) [Google Scholar]
  • 2.Wallace AR. J Proc Linn Soc (Zoology) 1858;3:53–62. [Google Scholar]
  • 3.Fisher RA. The Genetical Theory of Natural Selection. Oxford: Clarendon; 1930. [Google Scholar]
  • 4.Haldane JBS. The Causes of Evolution. London: Longmans, Green, and Co; 1932. [Google Scholar]
  • 5.Wright S. Genetics. 1931;16:97–159. doi: 10.1093/genetics/16.2.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mayr E. Animal Species and Evolution. Cambridge, MA: Belknap; 1963. [Google Scholar]
  • 7.Kimura M. Nature. 1968;217:624–625. doi: 10.1038/217624a0. [DOI] [PubMed] [Google Scholar]
  • 8.Kimura M. The Neutral Theory of Molecular Evolution. Cambridge, UK: Cambridge Univ Press; 1983. [Google Scholar]
  • 9.King JL, Jukes TH. Science. 1969;164:788–798. doi: 10.1126/science.164.3881.788. [DOI] [PubMed] [Google Scholar]
  • 10.Graur D, Li WH. Fundamentals of Molecular Evolution. 2nd Ed. Sunderland, MA: Sinauer; 2000. [Google Scholar]
  • 11.Ohta T. Nature. 1973;246:96–98. doi: 10.1038/246096a0. [DOI] [PubMed] [Google Scholar]
  • 12.Ohta T. Annu Rev Ecol Syst. 1992;23:263–286. [Google Scholar]
  • 13.Ohta T. Proc Nat Acad Sci USA. 2002;99:16134–16137. doi: 10.1073/pnas.252626899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bernardi G, Bernardi G. J Mol Evol. 1986;24:1–11. doi: 10.1007/BF02099946. [DOI] [PubMed] [Google Scholar]
  • 15.Corneo G, Ginelli E, Soave C, Bernardi G. Biochemistry. 1968;7:4373–4379. doi: 10.1021/bi00852a033. [DOI] [PubMed] [Google Scholar]
  • 16.Filipski J, Thiery JP, Bernardi G. J Mol Biol. 1973;80:177–197. doi: 10.1016/0022-2836(73)90240-4. [DOI] [PubMed] [Google Scholar]
  • 17.Thiery JP, Macaya G, Bernardi G. J Mol Biol. 1976;108:219–235. doi: 10.1016/s0022-2836(76)80104-0. [DOI] [PubMed] [Google Scholar]
  • 18.Macaya G, Thiery JP, Bernardi G. J Mol Biol. 1976;108:237–254. doi: 10.1016/s0022-2836(76)80105-2. [DOI] [PubMed] [Google Scholar]
  • 19.Costantini M, Clay O, Auletta F, Bernardi G. Genome Res. 2006;16:536–541. doi: 10.1101/gr.4910606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bernardi G, Olofsson B, Filipski J, Zerial M, Salinas J, Cuny G, Meunier-Rotival M, Rodier F. Science. 1985;228:953–957. doi: 10.1126/science.4001930. [DOI] [PubMed] [Google Scholar]
  • 21.Yunis JJ, Tsai MY, Willey AM. In: Molecular Structure of Human Chromosomes. Yunis JJ, editor. New York: Academic; 1977. [Google Scholar]
  • 22.Ma H, Samarabandu J, Devdhar RS, Acharya R, Cheng PC, Meng C, Berezney R. J Cell Biol. 1998;143:1415–1425. doi: 10.1083/jcb.143.6.1415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Costantini M, Clay O, Federico C, Saccone S, Auletta F, Bernardi G. Chromosoma. 2007;116:29–40. doi: 10.1007/s00412-006-0078-0. [DOI] [PubMed] [Google Scholar]
  • 24.Caspersson TO. Am J Hum Genet. 1989;44:441–451. [PMC free article] [PubMed] [Google Scholar]
  • 25.Bostock CJ, Sumner AT. The Eukaryotic Chromosome. Amsterdam: North–Holland; 1978. [Google Scholar]
  • 26.Mouchiroud D, D'Onofrio G, Aïssani B, Macaya G, Gautier C, Bernardi G. Gene. 1991;100:181–187. doi: 10.1016/0378-1119(91)90364-h. [DOI] [PubMed] [Google Scholar]
  • 27.Zoubak S, Clay O, Bernardi G. Gene. 1996;174:95–102. doi: 10.1016/0378-1119(96)00393-9. [DOI] [PubMed] [Google Scholar]
  • 28.Bernardi G. In: Chromosomes Today. Sumner AT, Chandley A, editors. Vol 11. London: Chapman and Hall; 1993. pp. 49–60. [Google Scholar]
  • 29.Bernardi G. Annu Rev Genet. 1995;29:445–476. doi: 10.1146/annurev.ge.29.120195.002305. [DOI] [PubMed] [Google Scholar]
  • 30.Bernardi G. Gene. 2000;241:3–17. doi: 10.1016/s0378-1119(99)00485-0. [DOI] [PubMed] [Google Scholar]
  • 31.Bernardi G. Gene. 2001;276:3–13. doi: 10.1016/s0378-1119(01)00644-8. [DOI] [PubMed] [Google Scholar]
  • 32.Bernardi G. Structural and Evolutionary Genomics, Natural Selection in Genome Evolution. Amsterdam: Elsevier; 2004. reprinted (2005) [Google Scholar]
  • 33.Watanabe Y, Fujiyama A, Ichiba Y, Hattori M, Yada T, Sakaki Y, Ikemura T. Hum Mol Genet. 2002;11:13–21. doi: 10.1093/hmg/11.1.13. [DOI] [PubMed] [Google Scholar]
  • 34.Eyre-Walker A, Hurst LD. Nat Rev Genet. 2001;2:549–555. doi: 10.1038/35080577. [DOI] [PubMed] [Google Scholar]
  • 35.D'Onofrio G, Bernardi G. Gene. 1992;110:81–88. doi: 10.1016/0378-1119(92)90447-w. [DOI] [PubMed] [Google Scholar]
  • 36.Bernardi G. Biofutur. 1990;94:43–46. [Google Scholar]
  • 37.D'Onofrio G, Mouchiroud D, Aïssani B, Gautier C, Bernardi G. J Mol Evol. 1991;32:504–510. doi: 10.1007/BF02102652. [DOI] [PubMed] [Google Scholar]
  • 38.D'Onofrio G, Jabbari K, Musto H, Bernardi G. Gene. 1999;238:3–14. doi: 10.1016/s0378-1119(99)00257-7. [DOI] [PubMed] [Google Scholar]
  • 39.Chiusano ML, D'Onofrio G, Alvarez-Valin F, Jabbari K, Colonna G, Bernardi G. Gene. 1999;238:23–31. doi: 10.1016/s0378-1119(99)00258-9. [DOI] [PubMed] [Google Scholar]
  • 40.D'Onofrio G, Ghosh TC, Bernardi G. Gene. 2002;300:179–187. doi: 10.1016/s0378-1119(02)01045-4. [DOI] [PubMed] [Google Scholar]
  • 41.Cruveiller S, Jabbari K, Clay O, Bernardi G. Genome Res. 2004;14:886–892. doi: 10.1101/gr.2246704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Cruveiller S, Clay O, Jabbari K, Bernardi G. Proteomics. 2007;7:361–363. doi: 10.1002/pmic.200600813. [DOI] [PubMed] [Google Scholar]
  • 43.Ohno S. Brookhaven Symp Biol. 1972;23:366–370. [PubMed] [Google Scholar]
  • 44.Bernardi G, Mouchiroud D, Gautier C, Bernardi G. J Mol Evol. 1988;28:7–18. doi: 10.1007/BF02143493. [DOI] [PubMed] [Google Scholar]
  • 45.Aïssani B, Bernardi G. Gene. 1991;106:173–183. doi: 10.1016/0378-1119(91)90197-j. [DOI] [PubMed] [Google Scholar]
  • 46.Aïssani B, Bernardi G. Gene. 1991;106:185–195. doi: 10.1016/0378-1119(91)90198-k. [DOI] [PubMed] [Google Scholar]
  • 47.Hughes S, Clay O, Bernardi G. Gene. 2002;295:323–329. doi: 10.1016/s0378-1119(02)00732-1. [DOI] [PubMed] [Google Scholar]
  • 48.Bernardi G. Mol Biol Evol. 1993;10:186–204. doi: 10.1093/oxfordjournals.molbev.a039994. [DOI] [PubMed] [Google Scholar]
  • 49.Bernardi G. Gene. 2000;259:31–43. doi: 10.1016/s0378-1119(00)00441-8. [DOI] [PubMed] [Google Scholar]
  • 50.Cruveiller S, D'Onofrio G, Bernardi G. Gene. 2000;261:71–83. doi: 10.1016/s0378-1119(00)00520-5. [DOI] [PubMed] [Google Scholar]
  • 51.Bernardi G, Bernardi G. J Mol Evol. 1990;31:265–281. doi: 10.1007/BF02101122. [DOI] [PubMed] [Google Scholar]
  • 52.Bernardi G, Bernardi G. J Mol Evol. 1990;31:282–293. doi: 10.1007/BF02101123. [DOI] [PubMed] [Google Scholar]
  • 53.Bernardi G, Hughes S, Mouchiroud D. J Mol Evol. 1997;44:S41–S51. doi: 10.1007/pl00000051. [DOI] [PubMed] [Google Scholar]
  • 54.Gojobori T, Li WH, Graur D. J Mol Evol. 1982;18:360–369. doi: 10.1007/BF01733904. [DOI] [PubMed] [Google Scholar]
  • 55.Kunkel TA, Alexander PS. J Biol Chem. 1986;261:160–166. [PubMed] [Google Scholar]
  • 56.Eyre-Walker A. Genetics. 1999;152:675–683. doi: 10.1093/genetics/152.2.675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Smith NG, Eyre-Walker A. Mol Biol Evol. 2001;18:982–986. doi: 10.1093/oxfordjournals.molbev.a003899. [DOI] [PubMed] [Google Scholar]
  • 58.Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
  • 59.Alvarez-Valin F, Lamolle G, Bernardi G. Gene. 2002;300:161–168. doi: 10.1016/s0378-1119(02)01043-0. [DOI] [PubMed] [Google Scholar]
  • 60.Arndt PF, Petrov DA, Hwa T. Mol Biol Evol. 2003;20:1887–1896. doi: 10.1093/molbev/msg204. [DOI] [PubMed] [Google Scholar]
  • 61.Arndt PF, Hwa T, Petrov DA. J Mol Evol. 2005;60:748–763. doi: 10.1007/s00239-004-0222-5. [DOI] [PubMed] [Google Scholar]
  • 62.Santini S, Bernardi G. Gene. 2005;346:51–61. doi: 10.1016/j.gene.2004.10.027. [DOI] [PubMed] [Google Scholar]
  • 63.Lipatov M, Arndt PF, Hwa T, Petrov DA. J Mol Evol. 2006;62:168–175. doi: 10.1007/s00239-005-0207-z. [DOI] [PubMed] [Google Scholar]
  • 64.Treffers HP, Spinelli V, Belser NO. Proc Natl Acad Sci USA. 1954;40:1064–1071. doi: 10.1073/pnas.40.11.1064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Cox EC, Yanofsky C. Proc Natl Acad Sci USA. 1967;58:1895–1902. doi: 10.1073/pnas.58.5.1895. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Xia X, Xie Z, Kjer KM. Syst Biol. 2003;52:283–295. doi: 10.1080/10635150390196948. [DOI] [PubMed] [Google Scholar]
  • 67.Wang HC, Xia X, Hickey D. J Mol Evol. 2006;63:120–126. doi: 10.1007/s00239-005-0255-4. [DOI] [PubMed] [Google Scholar]
  • 68.Argos P, Rossmann MG, Grau UM, Zuber A, Franck G, Tratschin JD. Biochemistry. 1979;18:5698–5703. doi: 10.1021/bi00592a028. [DOI] [PubMed] [Google Scholar]
  • 69.Zuber H. In: Structural and Functional Aspects of Enzyme Catalysis. Eggerer H, Huber R, editors. Berlin: Springer; 1981. pp. 114–127. [Google Scholar]
  • 70.Nishio Y, Nakamura Y, Kawarabayasi Y, Usuda Y, Kimura E, Sugimoto S, Matsui K, Yamagishi A, Kikuchi H, Ikeo K, et al. Genome Res. 2003;13:1572–1579. doi: 10.1101/gr.1285603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Nishio Y, Nakamura Y, Usuda Y, Sugimoto S, Matsui K, Kawarabayasi Y, Kikuchi H, Gojobori T, Ikeo K. Mol Biol Evol. 2004;21:1683–1691. doi: 10.1093/molbev/msh175. [DOI] [PubMed] [Google Scholar]
  • 72.Cacciò S, Jabbari K, Matassi G, Guermonprez F, Desgrès J, Bernardi G. Gene. 1997;205:119–124. doi: 10.1016/s0378-1119(97)00560-x. [DOI] [PubMed] [Google Scholar]
  • 73.Jabbari K, Cacciò S, Païs de Barros JP, Desgrès J, Bernardi G. Gene. 1997;205:109–118. doi: 10.1016/s0378-1119(97)00475-7. [DOI] [PubMed] [Google Scholar]
  • 74.Jabbari K, Clay O, Bernardi G. Gene. 2003;317:161–163. doi: 10.1016/s0378-1119(03)00699-1. [DOI] [PubMed] [Google Scholar]
  • 75.Jabbari K, Bernardi G. Gene. 2004;333:179–181. doi: 10.1016/j.gene.2004.02.048. [DOI] [PubMed] [Google Scholar]
  • 76.Varriale A, Bernardi G. Gene. 2006;385:111–121. doi: 10.1016/j.gene.2006.05.031. [DOI] [PubMed] [Google Scholar]
  • 77.Varriale A, Bernardi G. Gene. 2006;385:122–127. doi: 10.1016/j.gene.2006.05.034. [DOI] [PubMed] [Google Scholar]
  • 78.Jabbari K, Bernardi G. Gene. 2000;247:287–292. doi: 10.1016/s0378-1119(00)00128-1. [DOI] [PubMed] [Google Scholar]
  • 79.Jabbari K, Bernardi G. Gene. 2004;333:183–186. doi: 10.1016/j.gene.2004.02.038. [DOI] [PubMed] [Google Scholar]
  • 80.Musto H, Naya H, Zavala A, Romero H, Alvarez-Valin F, Bernardi G. FEBS Lett. 2004;573:73–77. doi: 10.1016/j.febslet.2004.07.056. [DOI] [PubMed] [Google Scholar]
  • 81.Saccone S, Federico C, Bernardi G. Gene. 2002;300:169–178. doi: 10.1016/s0378-1119(02)01038-7. [DOI] [PubMed] [Google Scholar]
  • 82.Alvarez-Valin F, Clay O, Cruveiller S, Bernardi G. Mol Phylogenet Evol. 2004;31:788–793. doi: 10.1016/j.ympev.2004.01.016. [DOI] [PubMed] [Google Scholar]
  • 83.Antezana MA. J Mol Evol. 2005;61:834–836. doi: 10.1007/s00239-005-0158-4. [DOI] [PubMed] [Google Scholar]
  • 84.Duret L, Eyre-Walker A, Galtier N. Gene. 2006;385:71–74. doi: 10.1016/j.gene.2006.04.030. [DOI] [PubMed] [Google Scholar]
  • 85.Gu J, Li WH. Gene. 2006;385:50–56. doi: 10.1016/j.gene.2006.03.026. [DOI] [PubMed] [Google Scholar]
  • 86.Wu CI, Li W. Proc Natl Acad Sci USA. 1985;82:1741–1745. doi: 10.1073/pnas.82.6.1741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Gu X, Li WH. Mol Phylogenet Evol. 1992;1:211–214. doi: 10.1016/1055-7903(92)90017-b. [DOI] [PubMed] [Google Scholar]
  • 88.van Rheede T, Bastiaans T, Boone DN, Hedges SB, de Jong WW, Madsen O. Mol Biol Evol. 2006;23:587–597. doi: 10.1093/molbev/msj064. [DOI] [PubMed] [Google Scholar]
  • 89.Fleishman SJ, Dagan T, Graur D. Mol Biol Evol. 2003;20:1876–1880. doi: 10.1093/molbev/msg202. [DOI] [PubMed] [Google Scholar]
  • 90.Gillespie JH. The Causes of Molecular Evolution. New York: Oxford Univ Press; 1991. [Google Scholar]
  • 91.Zuckerkandl E. J Mol Evol. 1986;24:12–27. doi: 10.1007/BF02099947. [DOI] [PubMed] [Google Scholar]
  • 92.Talbert PB, Henikoff S. Nat Rev Genet. 2006;7:793–803. doi: 10.1038/nrg1920. [DOI] [PubMed] [Google Scholar]
  • 93.Zuckerkandl E, Cavalli G. Gene. 2007;390:232–242. doi: 10.1016/j.gene.2006.12.001. [DOI] [PubMed] [Google Scholar]
  • 94.Felsenfeld G, Groudine M. Nature. 2003;421:448–453. doi: 10.1038/nature01411. [DOI] [PubMed] [Google Scholar]
  • 95.Gaszner M, Felsenfeld G. Nat Rev Genet. 2006;7:703–713. doi: 10.1038/nrg1925. [DOI] [PubMed] [Google Scholar]
  • 96.Muller HJ. Am J Hum Genet. 1950;2:111–176. [PMC free article] [PubMed] [Google Scholar]
  • 97.Haldane JBS. J Genet. 1957;55:511–524. [Google Scholar]
  • 98.Zuckerkandl E. J Mol Evol. 1975;7:1–57. doi: 10.1007/BF01732178. [DOI] [PubMed] [Google Scholar]
  • 99.Kondrashov AS. Trends Genet. 2000;16:580. [Google Scholar]
  • 100.Schmalhausen II. Factors of Evolution, the Theory of Stabilizing Selection. Philadelphia: Blakiston; 1949. [Google Scholar]
  • 101.Honderich T. The Oxford Companion to Philosophy. Oxford: Oxford Univ Press; 1995. [Google Scholar]
  • 102.Ohta T. J Mol Evol. 1972;1:305–314. [PubMed] [Google Scholar]
  • 103.Bromham L, Penny D. Nat Rev Genet. 2003;4:216–224. doi: 10.1038/nrg1020. [DOI] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES