Abstract
Selfish genes, such as meiotic drive elements, propagate themselves through a population without increasing the fitness of host organisms. X-linked (or Y-linked) meiotic drive elements reduce the transmission of the Y (X) chromosome and skew progeny and population sex ratios, leading to intense conflict among genomic compartments. Drosophila simulans is unusual in having a least three distinct systems of X chromosome meiotic drive. Here, we characterize naturally occurring genetic variation at the Winters sex-ratio driver (Distorter on the X or Dox), its progenitor gene (Mother of Dox or MDox), and its suppressor gene (Not Much Yang or Nmy), which have been previously mapped and characterized. We survey three North American populations as well as 13 globally distributed strains and present molecular polymorphism data at the three loci. We find that all three genes show signatures of selection in North America, judging from levels of polymorphism and skews in the site-frequency spectrum. These signatures likely result from the biased transmission of the driver and selection on the suppressor for the maintenance of equal sex ratios. Coalescent modeling indicates that the timing of selection is more recent than the age of the alleles, suggesting that the driver and suppressor are coevolving under an evolutionary “arms race.” None of the Winters sex-ratio genes are fixed in D. simulans, and at all loci we find ancestral alleles, which lack the gene insertions and exhibit high levels of nucleotide polymorphism compared to the derived alleles. In addition, we find several “null” alleles that have mutations on the derived Dox background, which result in loss of drive function. We discuss the possible causes of the maintenance of presence–absence polymorphism in the Winters sex-ratio genes.
MEIOTIC drive can leave signatures in the genome similar to positive natural selection without increasing the fitness of an organism (Lyttle 1993). Drive elements are preferentially transmitted during meiosis by disrupting the development or function of sperm carrying the homologous chromosome (Zimmering et al. 1970, meiotic drive sensu lato), or by true chromosome segregation defects during meiosis (Sandler and Novitski 1957, meiotic drive sensu stricto; Tao et al. 2007a). While drive elements may arise on any chromosome, sex-linked drivers have higher population invasion probabilities than autosomal drivers and are more easily detected due to their impact on progeny sex ratios (Hurst and Pomiankowski 1991). To survive, a driver must maintain tight linkage with an insensitive target locus lest it drive against itself, a condition ensured by the lack of recombination between sex chromosomes (Charlesworth and Hartl 1978). Because of the impact drive elements have on sex ratios, sex-linked drivers are often referred to as “sex-ratio distorters” and the phenotype of skewed progeny sex ratios is termed “sex-ratio.” The mere transmission advantage of a driver, unless balanced by some detrimental fitness effect or masked by a suppressor, can cause it to sweep through a population in a manner similar to a positively selected mutation (Edwards 1961; Vaz and Carvalho 2004).
Obviously, a complete sweep of a sex-linked driver dooms a male-less (or female-less) population to extinction (Hamilton 1967), and natural selection strongly favors genetic factors that suppress drive and restore Mendelian segregation. Fisher (1930) presented a qualitative argument for the maintenance of an equal sex ratio, which predicts selection on any heritable variant that increases the production of the rarer sex. Fisher's principle has been formalized mathematically and demonstrated empirically (e.g., Bodmer and Edwards 1960; Carvalho et al. 1998). Suppressors have been identified in a wide variety of meiotic drive systems and are predicted to be strongly favored by natural selection for the maintenance of equal sex ratios (reviewed by Jaenike 2001). Furthermore, the evolution of linked enhancer genes may enable drivers to evade suppression, setting off another bout of Fisherian selection for equal sex ratios (Hartl 1975).
Meiotic drive is widespread, with systems identified in mammals, insects, and plants (Jaenike 2001). Drosophila is the most extensively studied insect taxon, and sex-chromosome meiotic drive systems have been identified in more than a dozen species (Jaenike 2001). Cryptic (i.e., suppressed) distorters may be identified when the association between driver and suppressor is lost, such as in hybrids between species or populations that do not share meiotic drive systems (Mercot et al. 1995). The coevolutionary arms race between drivers and suppressors likely contributes to Haldane's rule (the preferential sterility or inviability of heterogametic hybrids) and is a leading explanation for the importance of X-linked loci in causing hybrid male sterility (Frank 1991; Hurst and Pomiankowski 1991; Tao et al. 2007b; Presgraves 2008). Indeed, two recently characterized hybrid male sterility factors are also sex-ratio distorters—direct evidence of a link between meiotic drive and speciation (Tao et al. 2001; Orr and Irving 2005; Phadnis and Orr 2009).
The three X-linked drive systems of Drosophila simulans are genetically distinct and have been termed Paris, Durham, and Winters (Tao et al. 2007a). Here, we focus on the Winters sex-ratio (SR), whose driver and suppressor have been mapped to the gene level and whose molecular and cellular features have been elucidated (Tao et al. 2007a,b). Distortion requires two genes, Distorter on the X (Dox) and Mother of Dox (MDox); Dox is a duplicate copy of MDox (Tao et al. 2007a; Y. Tao, personal communication). The dominant suppressor, Not Much Yang (Nmy), is a retrotransposed copy of Dox on chromosome 3R (Tao et al. 2007b). Nmy likely suppresses Dox through an RNA interference mechanism by forming a double stranded RNA with homology to the distorter RNAs (Tao et al. 2007b). The genes of the Winters sex-ratio are not found in D. melanogaster, which diverged from D. simulans ∼2.3 million years ago (Li et al. 1999). Initial surveys of the genes in the simulans clade indicate that a functional Nmy gene is present in D. mauritiana (Tao et al. 2007b). Thus, the Winters genes are >250,000 years old, the speciation time of D. simulans, D. mauritiana, and D. sechellia (McDermott and Kliman 2008).
Signatures of positive selection have been previously detected at genomic regions linked to Drosophila sex-ratio distorters. However, this study represents the first evidence of selection acting directly on a sex-ratio distorter gene and its suppressor gene. In D. recens, driving X chromosomes show reduced nucleotide and haplotype variability relative to standard (nondriving) X chromosomes, and linkage disequilibrium extends over 130 cM of the driving chromosome (Dyer et al. 2007). The Paris driver has been localized to a pair of duplicated loci 150 kb apart; recent work shows reduced haplotype diversity and linkage disequilibrium between variants associated with drive (Derome et al. 2008). In this study, we characterize patterns of genetic variation in natural populations of North American D. simulans and find signatures of recent and strong positive selection at all three genes of the Winters sex-ratio.
MATERIALS AND METHODS
Population samples:
Samples from three North American populations of D. simulans were examined in this study (supporting information, Table S1). Two sets of isofemale lines were established from Massachusetts in September 2006: Tremont, collected in a backyard grape arbor on Tremont Street in Cambridge (n = 34), and Nicewicz, collected at the Nicewicz family farm in Bolton (n = 12), ∼30 miles west of Cambridge. F1 males were frozen and used for DNA extraction. In addition, a set of isofemale lines collected in Winters, CA, in the summer of 1995 (Begun and Whitley 2000) was kindly donated by Sergey Nuzhdin. We also obtained 13 lines of diverse geographic origins from the Tucson Species Stock Center: 5 African (Madagascar 14021.0251.196, 14021.0251.197; Kenya 14021.0251.199; Congo 14021.0251.184; and South Africa 14021.0251.169), 2 North American (California 14021.0251.194 and unknown 14021.0251.195), 2 European (Scotland 14021.0251.216 and Greece 14021.0251.181), and 4 Oceanian (New Guinea 14021.0251.009, New Zealand 14021.0251.007, Australia 14021.0251.176, and New Caledonia 14021.0251.198). All strains were sampled randomly with respect to sex-ratio phenotype and genotype.
Data collection:
Genomic DNA was extracted from single males using a modified protocol of the Wizard Genomic DNA Purification kit from Promega. From the Massachusetts populations F1 males from wild-caught females were used, and both autosomal alleles were included in our sample. All other stocks are inbred lab lines and the autosomal loci were found to be homozygous. Polymerase chain reaction was performed using Takara LA Taq polymerase according to manufacturer's instructions. Previously published PCR primers for Dox, MDox, and Nmy were used that amplified complete genes as well as flanking sequence (Tao et al. 2007a,b, Figure 1). Internal sequencing primers were used to obtain 2× coverage (forward and reverse reads) for PCR amplicons. Primers were designed using Primer3Plus (Untergasser et al. 2007) and Amplify v. 3.14 (Engels 2005). Sequencing was performed on an ABI3730 capillary sequencer according to manufacturer's protocols. Sequences were edited using Sequencher v. 4.8 (Gene Codes) and aligned by eye with the aid of the bl2seq program of the BLAST package (Tatusova and Madden 1999). Additional editing was performed using BioEdit (Hall 1999). At the Nmy locus, singleton variants that were observed as heterozygous sites in chromatograms were confirmed with repeated PCR and sequencing. Two samples from the Tremont population, T44 and T62, are double heterozygotes at the Nmy locus; both heterozygous sites for these samples feature a singleton variant. Haplotype phase was resolved for these two samples by assuming that each singleton variant arose independently on the lineage with the most frequent haplotype, rather than the less likely case of both mutations having arisen on the same lineage. For each sample we collected 6.2 kb from Dox, 4.5 kb from MDox, and 7.5 kb from Nmy (Figure 1). A total of 1.6 Mb of resequence data was obtained.
Figure 1.—
Regions sequenced of the genes of the Winters sex-ratio. Chromosomal location of the distorter locus and suppressor locus are shown at the top. The two genes of the distorter are Distorter on the X (Dox) and Mother of Dox (MDox). The suppressor gene is called Not Much Yang (Nmy). Dox and MDox are separated by ∼70 kb of DNA sequence on the X chromosome. Triangles indicate the location of the PCR primers used. Arrows indicate direction of transcription of the genes (Tao et al. 2007a,b).
Data analysis:
We calculated population genetic summary statistics using DnaSP (Rozas et al. 2003). The population mutation rate was estimated as the average pairwise diversity, π (Tajima 1983) and Watterson's estimator, θW (Watterson 1975). The site frequency spectrum was summarized by both Tajima's D (Tajima 1989) and Fay and Wu's H (Fay and Wu 2000). To summarize linkage disequilibrium (LD) across each gene, we estimated the statistic, ZnS (Kelly 1997), which is the average pairwise R2 value among all variable sites (Hill and Robertson 1968) and h, the number of haplotypes. To calculate the age of the origin of the genes, we estimated divergence as the average number of nucleotide substitutions, DXY (Nei 1987, equation 10.20). The fit of various summary statistics to the standard neutral model was assessed through coalescent simulations using the observed number of segregating sites, the conservative assumption of no recombination, and 1000 simulations, as implemented in DnaSP. HKA tests were performed using the HKA software (Hudson et al. 1987; Hey 2004); significance was determined from 10,000 coalescent simulations.
Modeling selection:
A Bayesian approach was taken to estimate the time since selection on the Winters sex-ratio genes in each of the three North American populations, using coalescent simulations of neutral variants linked to a site under selection. The simulation has two phases (going forward in time), a complete selective sweep of a new beneficial variant followed by a neutral (recovery) phase. We used a modified version of a computer program by Przeworski (2003), which models the selected phase as the structured coalescent in which recombination between neutral variants and the site under selection is treated analogously to migration between demes (Kaplan et al. 1989). The neutral locus evolves according to the infinite sites model, with population mutation rate, θ = 4NμL (where N is the effective population size, μ is the per-site, per-generation mutation rate, and L is the length of the sequence) and population recombination rate, ρ = 4NrL (where r is the per-site, per-generation recombination rate). Recombination between the neutral and selected sites occurs with rate C = 4NrK (where K is the distance between the closest neutral site and the selected site). The method estimates the posterior probability distribution for the intensity (4Ns) and the time (T) since the completion of the selective sweep using a summary likelihood method, in which the data are summarized by the number of segregating sites (S), number of haplotypes (h), and Tajima's D.
The selection model has the following parameters: N, effective population size; s, the selection coefficient; μ, mutation rate; r, recombination rate; and T, time since fixation of the beneficial variant. The posterior probability distributions for model parameters was generated using a rejection algorithm (Tavare et al. 1997). Briefly, parameter values are sampled from a prior distribution, a genealogy is simulated with the sampled parameters, and S segregating sites are placed randomly onto the genealogy. The data summaries described above are calculated from the simulated genealogy and compared to the summaries from the observed data. Parameter values that generate the observed number of haplotypes and a Tajima's D values within some user-specified interval (ɛ) are accepted and output to the posterior. To capture uncertainty in model parameters, the prior distribution of μ, r, and N are γ-distributed, whereas s is sampled from a uniform prior.
Choice of prior distributions of parameters:
In an effort to ensure that the prior distribution of model parameters accurately reflects neutral variation in North American populations of D. simulans, we calculated the mean θW and ρ (Hudson 1987) for 29 loci on the X and chromosome 3R sequenced in the same Winters, California population (Begun and Whitley 2000, see Table S2). We used γ-distributed priors for N, r, and μ that yielded priors of the model parameters, θ and ρ, with these empirically observed means. We estimated θW and ρ separately for loci on the X vs. 3R and included all variable sites. The empirically estimated mean per site θW and ρ for the X loci are 0.00488 and 0.01947 and for the 3R loci are 0.01029 and 0.08431 (Table S2). The inheritance scalar for the effective size of the X chromosome to that of the autosomes is accounted for in the joint prior probability distributions for θ and ρ. The analysis outputs time scaled in units of 4N generations and that scaling can be considered arbitrary. To avoid confusion, we have reported scaled times in unit of N generations.
Mutation rate was calculated from whole-genome divergence between D. simulans and D. melanogaster. Begun et al. (2007) estimated lineage-specific divergence for D. simulans in 10-kb windows across the entire genome. We calculated μ for each window as the lineage-specific divergence divided by 2.3 MY, the divergence time for D. simulans and D. melanogaster (Li et al. 1999). Assuming 10 generations per year, this calculation gives a median per-site, per-generation mutation rate for chromosomes X and 3R of 1.2 × 10−9 and 1.0 × 10−9, respectively. These estimates are within the range of other estimated mutation rates for Drosophila (Andolfatto and Przeworski 2000), but slightly lower than a commonly used mutation rate on the basis of synonymous sites only (Sharp and Li 1989). If we assume there is a single effective population size for a population, the per-site, per-generation r can be calculated as (ρ*μ)/θW. For the Winters, California population data (Begun and Whitley 2000), we calculated r = 4.8 × 10−9 for the X and r = 8.2 × 10−9 for 3R. The prior distributions of μ for the X and 3R were γ with shape parameter 12 and 10, respectively, and scale parameter 10−10; thus, the means of these distributions are 1.2 × 10−9 and 1.0 × 10−9, respectively (Table 1). The prior distributions of r for the X and 3R were γ-distributed with shape parameter 48 and 82, respectively, and scale parameter 10−10; thus, the means of these distributions are 4.8 × 10−9 and 8.2 × 10−9, respectively. The prior for the selection coefficient, s, was uniform between 5 × 10−4 and 0.5 because we have no biological expectation for the strength of selection on the driver or suppressor genes.
TABLE 1.
Prior distributions of parameters for selection model
|
Dox |
MDox |
Nmy |
|||||||
|---|---|---|---|---|---|---|---|---|---|
| Prior | Mean | 95% density | Prior | Mean | 95% density | Prior | Mean | 95% density | |
| μ | Gam (10−10, 12) | 1.2 × 10−9 | 6.3 × 10−10–2.0 × 10−9 | Gam (10−10, 12) | 1.2 × 10−9 | 6.3 × 10−10–2.0 × 10−9 | Gam (10−10, 10) | 1.0 × 10−9 | 4.8 × 10−10–1.7 × 10−9 |
| r | Gam (10−10, 48) | 4.8 × 10−9 | 3.5 × 10−9–6.3 × 10−9 | Gam (10−10, 48) | 4.8 × 10−9 | 3.5 × 10−9–6.3 × 10−9 | Gam (10−10, 82) | 8.2 × 10−9 | 6.5 × 10−9–10 × 10−9 |
| N | Gam (4 × 104, 25) | 1.0 × 106 | 6.5 × 105–1.4 × 106 | Gam (4 × 104, 25) | 1.0 × 106 | 6.5 × 105–1.4 × 106 | Gam (1 × 105, 25) | 2.5 × 106 | 1.6 × 106–3.7 × 106 |
| s | U (5 × 10−4, 0.5) | — | — | U (5 × 10−4, 0.5) | — | — | U (5 × 10−4, 0.5) | — | — |
| θ | — | 30 | 4.9–110 | — | 22 | 3.6–80 | — | 75 | 9.7–320 |
| ρ |
— |
120 |
35–330 |
— |
87 |
25–240 |
— |
610 |
160–1540 |
μ, per-site mutation rate; r, per-site recombination rate; N, effective population size; s, selection coefficient; θ, per-locus population mutation parameter; ρ, per-locus population recombination parameter.
We chose to estimate r using a population genetic estimate rather than genetic map data for several reasons. First, recombination rates estimated from genetic maps are systematically higher than rates based on population genetic data (Andolfatto and Przeworski 2000; O'Reilly et al. 2008). While this pattern may be shaped by selection, demographic factors such as population bottlenecks or population expansions may also increase levels of LD in natural populations (Stumpf and McVean 2003). Second, recombination rate in Drosophila is sensitive to maternal age, temperature, and genetic background, and recombination estimates in laboratory stocks do not take into account these biological factors (Ashburner et al. 2005). Third, our use of the lower, population-based estimates of recombination is conservative with regard to the estimated strength of selection and timing of selection (i.e., time since selection may be over estimated and strength of selection may be under estimated).
RESULTS
Ancestral alleles observed at all loci:
For each of the three sequenced loci we observe multiple chromosomes that lack the gene insertion, which represents the ancestral state of each locus (Table S1). For convenience we refer to the presence of the gene insertion as the “derived” allele. At the Dox locus, four strains (two from Madagascar, one from New Caledonia, and one from New Zealand) lack the 3833-bp Dox gene insertion; at MDox, four samples lack the 3549-bp gene insertion (two from Madagascar, one from Congo, and one from New Zealand); and at Nmy, two North American samples lack the 2041-bp gene insertion (one individual each from Winters, California and the Tremont population from Massachusetts).
Null mutations at Dox:
Three different alleles at Dox were observed, which have the derived gene insertion but have lost their ability to drive (see Figure 2). The wild-type allele is the functional distorter Dox and is present in 75% of the sampled lines (n = 53). A previously characterized null allele dox[del105] is present in three copies (4%) (Tao et al. 2007a). This allele has a 105-bp deletion overlapping intron 2 and exon 3, which removes a region that is critical for distortion. Ten samples (14%) have the dox[del150] null allele, which has a total of 150 bp deleted in exon 4, including one large 135-bp deletion and two smaller deletions of 12 bp and 3 bp. We found a single copy of dox[del585], which shares the exon 4 deletions with dox[del150] but has an additional 435-bp deletion spanning exon 1 and intron 1. We tested the ability of dox[del150] and dox[del105] to distort sex ratios in a nonsupressing nmy background, where nmy is a loss-of-function mutant of the Nmy gene (Tao et al. 2007b). These crosses yielded progeny with equal sex ratios (see Table S3 and Figure S1). We assume that dox[del585] is a loss-of-function mutant because it shares the dox[del150] deletions in addition to the large deletion in exon 1.
Figure 2.—
Predicted exon structure of the loss-of-function mutants at Dox. The allele name is followed by the frequency of the mutant in the total pooled sample. Exons are illustrated as shaded boxes; deletions are shown as dashed lines.
Insertion–deletion polymorphism:
Insertion–deletion (indel) polymorphisms at the Dox locus were already discussed in the context of loss-of-function mutations. At MDox, we observe one copy of MDox[del105], which has a 105-bp deletion that spans exons 2 and 3, one copy of MDox[ins135], which has a total of 135 bp inserted into exon 3, and one copy of MDox[ins32], which has 32 bp inserted in exon 1. The functional implications of these mutations are not known. In some cases, the same indel polymorphisms were observed at Dox and MDox, and evidently derive from gene conversion between the two paralogs (see next section). In addition to indel polymorphism in the MDox gene sequence, we observe variable numbers and lengths of the 360-bp repetitive elements that flank the MDox gene (Tao et al. 2007a). (Copies of this element also flank the Dox gene and may facilitate gene conversion between the paralogs). The New Zealand and Kenyan samples have an additional full-length repeat element 5′ of the MDox gene, and one of the Madagascar samples (14021.0251.197) is missing the two 3′ repeat elements. At Nmy, three samples (two from Madagascar and one from Congo) have a 6-bp insertion in one of the inverted repeats necessary for suppression by Nmy; we refer to this allele as Nmy[ins6]. Two of these three samples (the Congolese sample and one Madagascar sample, 14021.0251.196) also have a 201-bp insertion adjacent to a deletion of 77 bp between the inverted repeats, which is in the putative loop region of the RNA secondary structure (Tao et al. 2007a). The functional implications of these mutations at Nmy are not known.
Nucleotide polymorphism and divergence:
Estimates of nucleotide polymorphism for the full data set at all three genes are relatively low, but not unusually so compared to other data sets for D. simulans (Begun and Whitley 2000; Andolfatto 2001). Importantly, the derived alleles have highly reduced levels of nucleotide polymorphism compared to ancestral alleles (Table 2). Derived alleles have 2.22% of the ancestral allele diversity at Dox when measured as π (4.38% when measured as θW), and the corresponding parameters estimated for MDox are 0.99% (4.62%) and for Nmy 2.29% (14.65%). To test the significance of this reduced diversity, we performed association tests between the number of chromosomes sampled and the number of segregating sites among derived vs. ancestral alleles. These results were significant for all three loci (Dox, χ2 = 92.2, P < 0.00001; MDox, χ2 = 103, P < 0.00001; and Nmy, χ2 = 71.1, P < 0.00001).
TABLE 2.
Population genetic summary statistics
| n (nanc) | L | S | π | θW | h | ZnS | TD | FWH | |
|---|---|---|---|---|---|---|---|---|---|
| Dox | |||||||||
| All data | 71 | 2342 | 155 | 0.00509 | 0.01396 | 6*** | 0.52** | −2.19*** | −25.95 |
| Population | |||||||||
| Nicewicz | 12 (0) | 5521 | 19 | 0.00057 | 0.00114 | 4* | 0.80** | −2.17** | — |
| Tremont | 34 (0) | 5956 | 12 | 0.00045 | 0.00049 | 7 | 0.35 | −0.29 | 0.06 |
| Winters | 12 (0) | 60601 | 8 | 0.0004 | 0.00044 | 3 | 0.76* | −0.34 | — |
| Africa | 5 (2) | 2343 | 116 | 0.02945 | 0.02376 | 3* | 0.97** | 1.82* | −1.20 |
| Allele | |||||||||
| Derived | 67 | 5511 | 22 | 0.00044 | 0.00084 | 8* | 0.31 | −1.48* | 0.03 |
| Ancestral | 4 | 2388 | 84 | 0.01982 | 0.01919 | 4 | 0.5 | 0.35 | 2.33 |
| MDox | |||||||||
| All data | 69 | 2788 | 118 | 0.0023 | 0.00918 | 10*** | 0.32* | −2.58*** | −24.57 |
| Population | |||||||||
| Nicewicz | 12 (0) | 4401 | 12 | 0.00045 | 0.0009 | 3* | 0.83* | −2.09*** | 0.15 |
| Tremont | 33 (0) | 4507 | 9 | 0.00017 | 0.00049 | 5 | 0.25 | −1.98** | 0.18 |
| Winters | 12 (0) | 4508 | 1 | 0.00004 | 0.00007 | 2 | — | −1.14* | 0.15 |
| Africa | 5 (3) | 2788 | 103 | 0.01772 | 0.01859 | 4 | 0.37 | −0.36 | 6.40 |
| Allele | |||||||||
| Derived | 65 | 4400 | 18 | 0.00018 | 0.00086 | 8 | 0.26 | −2.40*** | 0.15 |
| Ancestral | 4 | 2815 | 92 | 0.01812 | 0.0186 | 4 | 0.41 | −0.27 | 4.33 |
| Nmy | |||||||||
| All data | 115 | 5335 | 155 | 0.0009 | 0.00553 | 11*** | 0.40* | −2.76*** | −91.63*** |
| Population | |||||||||
| Nicewicz | 24 (0) | 7461 | 0 | 0 | 0 | 1 | — | — | — |
| Tremont | 66 (1) | 5403 | 60 | 0.00034 | 0.00233 | 7*** | 0.84*** | −2.88*** | −36.19** |
| Winters | 12 (1) | 5385 | 121 | 0.00374 | 0.00744 | 2*** | 1.00*** | −2.32** | −72.88*** |
| Africa | 5 (0) | 7311 | 60 | 0.00372 | 0.00359 | 4*** | 0.47 | 0.315 | −3.50 |
| Allele | |||||||||
| Derived | 113 | 7310 | 67 | 0.00028 | 0.00179 | 11*** | 0.41** | −2.69*** | −29.72** |
| Ancestral |
2 |
5402 |
66 |
0.01222 |
0.01222 |
2 |
1.00 |
— |
— |
n, number of chromosomes sampled; nanc, number of ancestral alleles present in each population sample; L, total number of sites analyzed, excluding alignment gaps; S, number of segregating sites; h, number of haplotypes; π, average number of pairwise differences (Nei 1987); θW, Watterson's estimator of population diversity (Watterson 1975); ZnS, average pairwise R2 (Kelly 1997); TD, Tajima's D (Tajima 1989); FWH, Fay and Wu's H (Fay and Wu 2000). *P < 0.05; **P < 0.01; ***P < 0.001.
To determine whether the Winters SR genes show signatures of positive selection, we conducted multilocus HKA tests in which we compared polymorphism and divergence at each of the three Winters SR genes (or flanking region, in the case of divergence) in the three North American populations to that of 13 unrelated loci sampled in the same Winters, California population (Table 3). For our “neutral” set of loci, we chose a subset of the 29 loci sampled by Begun and Whitley (2000) that had the largest number of sampled chromosomes (n = 8). The Winters SR genes are predicted to be non-protein coding RNA genes (Tao et al. 2007a,b) so we included all variable sites in our analysis because we cannot restrict our analysis to synonymous sites only, whose evolution is least likely to be influence by non-neutral processes (Andolfatto 2005; Halligan and Keightley 2006). The original Begun and Whitely (2000) study analyzed only synonymous sites, so we reanalyzed all sites in their data in order to directly compare the datasets. A multi-locus HKA test on these 13 loci does not show any departure from neutral expectations (χ2 = 17.99, P < 0.0764, Table 3). However, when we include the Winters SR genes we observe significant deviation from neutral expectations in all but one test (Table 3). We first conducted nine tests where we added data for a single Winters SR locus from a North American population to the 13 Begun and Whitely (2000) loci. All nine tests are significant except when we added Nmy from the Winters population (χ2 = 20.28, P = 0.0903). For the Nmy data, we conducted two additional tests for the Tremont and Winters populations where we excluded the single ancestral allele present in each population. Both of these tests are significant (Winters: χ2 = 59.92, P < 0.0001; Tremont: χ2 = 94.52, P < 0.0001). (Here we report the uncorrected P-value but all tests remain significant at P < 0.0011 after a Bonferonni correction for multiple tests.) If positive selection has acted on the Winters SR genes, we expect to see deviation in the test in the direction of elevated divergence and reduced polymorphism at the Winters SR genes. In five of the 11 tests conducted, the Winters SR gene showed the largest deviation from neutral expectations in both polymorphism and divergence. In the remaining five significant tests, the Winters SR gene had the largest deviation from neutral expectations for divergence but not polymorphism. Moreover, these deviations were in the direction of reduced polymorphism and elevated divergence.
TABLE 3.
HKA Tests
| Gene | Population | Chromosome | L | n | S | Div | χ2 | P-value |
|---|---|---|---|---|---|---|---|---|
| Winters SR data | ||||||||
| Dox | Nicewicz | X | 5521 | 12 | 19 | 0.0567 | 52.51 | <0.0001 |
| Dox | Tremont | X | 5956 | 34 | 12 | 0.0567 | 93.27 | <0.0001 |
| Dox | Winters | X | 6061 | 12 | 8 | 0.0567 | 72.17 | <0.0001 |
| MDox | Nicewicz | X | 4401 | 12 | 12 | 0.0611 | 41.22b | <0.0001 |
| MDox | Tremont | X | 4507 | 33 | 9 | 0.0611 | 59.31 | <0.0001 |
| MDox | Winters | X | 4508 | 12 | 1 | 0.0611 | 49.72 | <0.0001 |
| Nmy | Nicewicz | 3R | 7461 | 24 | 0 | 0.0516 | 77.34 | <0.0001 |
| Nmy | Tremont | 3R | 7460 | 65 | 0 | 0.0516 | 94.52 | <0.0001 |
| Nmy | Winters | 3R | 7461 | 11 | 1 | 0.0516 | 59.92 | <0.0001 |
| NmyAlla | Tremont | 3R | 5403 | 66 | 60 | 0.0513 | 42.85 | <0.0001 |
| NmyAll | Winters | 3R | 5385 | 12 | 121 | 0.0511 | 20.28 | 0.0903 |
| Begun and Whitley (2000) data | ||||||||
| bnb | Winters | X | 1015 | 8 | 11 | 0.0197 | 17.99 | 0.0764 |
| mei-218 | Winters | X | 1187 | 8 | 14 | 0.0687 | ||
| ovo | Winters | X | 1356 | 8 | 9 | 0.0270 | ||
| sn | Winters | X | 1437 | 8 | 28 | 0.0370 | ||
| sog | Winters | X | 1233 | 8 | 8 | 0.0251 | ||
| X | Winters | X | 1425 | 8 | 24 | 0.0281 | ||
| yp3 | Winters | X | 1227 | 8 | 8 | 0.0473 | ||
| AP-50 | Winters | 3R | 1398 | 8 | 58 | 0.0293 | ||
| fzo | Winters | 3R | 1360 | 8 | 22 | 0.0708 | ||
| hyd | Winters | 3R | 1786 | 8 | 26 | 0.0208 | ||
| Osbp | Winters | 3R | 1166 | 8 | 31 | 0.0266 | ||
| ry | Winters | 3R | 1362 | 8 | 54 | 0.0419 | ||
| T-cp1 |
Winters |
3R |
1201 |
8 |
9 |
0.0325 |
||
L, number of bases sequenced in D. simulans; n, number of D. simulans chromosomes sampled; S, number of segregating sites; Div, per-base divergence from D. melanogaster; χ2 and P-values correspond to multilocus HKA tests on 13 loci previously sequenced in D. simulans (bottom) and when data from single Winters SR genes for each North American population were added to the 13 loci (top). See text for details.
Ancestral alleles were not excluded from the analysis (Nicewicz has no ancestral alleles in sample).
Site-frequency spectrum:
To test for deviation from neutrality in the site-frequency spectrum, we estimated Tajima's D (Tajima 1989) and Fay and Wu's H (Fay and Wu 2000). Tajima's D (TD) is a summary of the folded frequency spectrum and compares two estimates of nucleotide polymorphism, π and θW, yielding a negative value if a locus has an excess of low-frequency variants and a positive value if a locus has an excess of intermediate frequency variants. Fay and Wu's H (FWH) is a summary of the unfolded frequency spectrum and is sensitive to the frequency of derived mutations such that it is negative when there is an excess of high frequency derived variants. We calculated TD and FWH at the Winters SR genes for three partitions of our data: (1) the full data set, (2) each of the three North American populations and five African samples, and (3) the derived and ancestral alleles (Table 2; not significant, NS; test not performed due to lack of appropriate data, NA). In the full data set, we observe significantly negative Tajima's D values at each gene (Dox: −2.19, P < 0.00001; MDox: −2.58, P < 0.00001; and Nmy: −2.76, P < 0.00001). For the North American populations, all but two population samples for which we could conduct tests have significantly negative Tajima's D values (Dox: Nicewicz, −2.17, P = 0.003; Tremont, −0.29, NS; Winters, −0.34, NS; MDox: Nicewicz, −2.09, P < 0.00001; Tremont, −1.98, P = 0.008; Winters, −1.14, P < 0.05; Nmy: Nicewicz, NA; Tremont, −2.88, P < 0.00001; and Winters, −2.32, P = 0.008). The African sample has a significantly positive TD at Dox (1.82, P = 0.001) and TD values close to zero for the other loci (MDox: −0.36, NS; Nmy: 0.32, NS). The derived alleles have a significantly negative TD at all loci (Dox: −1.48, P = 0.041; MDox: −2.40, P < 0.00001; and Nmy: −2.69, P < 0.00001) whereas the ancestral alleles have TD's close to zero (Dox: 0.35, NS; MDox: −0.27, NS; and Nmy: NA). In summary, TD estimates are compatible with positive selection acting at all three Winters SR genes. At each gene, samples including all chromosomes as well as only the derived alleles show significantly negative TD values. Pooling among structured populations results in spuriously negative TD values (Ptak and Przeworski 2002; Hammer et al. 2003). The estimates for individual North American populations minimize this problem (but may not eliminate it as the geographic scale of population structure in North American D. simulans is not well understood). For the individual populations, we observe significantly negative TD values for all tests except for Tremont and Winters at Dox. This pattern is not likely to result from demographic forces such as population growth because significantly negative TD values are not observed at any of the reanalyzed Begun and Whitley (2000) loci (Table S2), which were sampled in the same Winters, California population.
For the complete data set, we observe significant FWH at Nmy, and marginal significance at the driver loci (Dox: −25.95, P = 0.067; MDox: −24.57, P = 0.052; and Nmy: −91.63, P < 0.00001). North American populations and samples of derived alleles have significant FWH at Nmy only (Dox: Nicewicz, NA; Tremont, 0.06, NS; Winters, NA; derived, 0.03, NS; MDox: Nicewicz, 0.15, NS; Tremont, 0.18, NS; Winters, 0.15, NS; derived, 0.15, NS; Nmy: Nicewicz, NA; Tremont, −36.19, P = 0.005; and Winters, −72.88, P < 0.00001, derived, −29.72 P = 0.003). None of the tests are significant for the African samples or the ancestral alleles.
Gene conversion between Dox and MDox:
Alignment of the paralogous region of the Dox and MDox loci reveal three gene-conversion tracts. The dox[del150] allele has a sequence motif of three deletions and a cluster of five single nucleotide polymorphisms (SNPs) that is shared with the wild-type MDox haplotype. In addition, we find one MDox haplotype that resembles the wild-type Dox haplotype in that it lacks these same three deletions and the SNP motif. Finally, the 105-bp deletion that characterizes the dox[del105] allele is also found in one MDox haplotype. These gene-conversion tracts were identified by eye and confirmed with the method of Betran et al. (1997) using the DnaSP software.
Linkage disequilibrium:
To test for elevated levels of LD at the Winters SR genes that may result from a selective sweep, we summarized LD as the average pairwise R2 value across each gene, ZnS (Kelly 1997) and tested for a reduction in the number of haplotypes (h) (Nielsen 2005). The results of this test are largely parallel with the estimates of ZnS (Table 2). In the full data set, we observe significantly elevated LD at all three loci (Dox: 0.52 P = 0.003; MDox: 0.32, P = 0.046; Nmy: 0.40, P = 0.01). Six of the 10 populations for which we could calculate ZnS show elevated LD (Dox: Nicewicz, 0.80, P = 0.007; Tremont, 0.35, NS; Winters, 0.76, P = 0.038; Africa, 0.97, P = 0.003; MDox: Nicewicz, 0.83, P = 0.015; Tremont, 0.25, NS; Winters, NA; Africa, 0.37, NS; Nmy: Nicewicz, NA; Tremont, 0.84, P < 0.00001; and Winters, 1.00, P < 0.00001, Africa, 0.47, NS).
Several factors besides selection may increase levels of LD in a sample. These include pooling derived and ancestral alleles (particularly when alleles differ by large genomic insertions that may inhibit recombination), paralogous gene conversion, and pooling samples from different biological populations. To explore these effects, we first calculated ZnS separately for derived and ancestral alleles. The derived alleles at Nmy showed elevated LD (n = 113, ZnS = 0.41, P = 0.005) but we see no significant ZnS values at other loci (Table 2). When we exclude the ancestral alleles in the Tremont and Winters populations at Nmy (no ancestral alleles were observed at Dox or MDox in North America), the signature of LD is no longer evident (Tremont, ZnS = 0.0002, NS; Winters ZnS = NA, no segregating sites), meaning that the elevated LD was caused by the presence of the single divergent ancestral allele. Next, gene conversion between Dox and MDox may have introduced several nonindependent mutations, which will initially be in linkage disequilibrium with each other until the association is eroded by recombination or mutation. We performed a second analysis of LD after encoding all mutations within gene conversion tracts as a single mutation. This reanalysis only differed from our initial analysis in the LD estimates at Dox and MDox, and resulted in no significant LD in either the North American populations or the derived alleles (data not shown). Finally, pooling among subpopulations can result in spuriously high levels of LD (Hartl and Clark 2007). The African samples include several lines from populations that are genetically differentiated from each other (Baudry et al. 2006), which may be the cause of the elevated LD in the full data set at each locus as well as the African sample at Dox. In summary, we do not observe elevated LD in samples of derived alleles in our North American populations after correcting for gene conversion or in the population samples after excluding ancestral alleles.
Age of derived alleles:
To estimate the age of the insertion events that gave rise to the Winters SR genes, the nucleotide divergence between the flanking sequence in the ancestral and derived alleles was calculated at each locus. From the sequence divergence, the age can be estimated as t = d/(2μg) if we assume no inter-allelic recombination (where d is the divergence per site, μ is the per-site, per-generation mutation rate, and g is generation time in years). We used the mutation rates calculated above for the modeling of selection. The per-site divergence between the ancestral and derived alleles for Dox, MDox, and Nmy are 0.0467, 0.0198, and 0.0165, yielding age estimates of 1.96 MY, 832,000 years and 817,000 years, respectively. On the basis of this result, the Dox gene appears to be much older than the other genes. It is possible that the duplication and transposition event that created the Dox gene may also be associated with extensive sequence changes, particularly in the repetitive sequences that flank the gene. A more accurate method of dating the Dox gene insertion is to determine the divergence between Dox and MDox at the gene insertion sequence, which is 0.0206, giving an age of 864,000 years, an estimate that is closer to the estimated ages of the other two genes. At MDox and Dox, we observed no shared polymorphisms and 22 and 77 fixed differences, respectively, between the ancestral and derived alleles. At Nmy, there are 12 shared polymorphism and 45 fixed differences—these shared polymorphisms result from a recombination event in the middle of the sequenced region such that sample T37a has the ancestral haplotype at the Nmy gene and a derived haplotype in the region distal to the gene. The per-site divergence between ancestral and derived alleles in the proximal sequenced region alone is 0.0217, which gives a divergence time of 1.08 MY.
Timing of selection:
To estimate the time since selection on the three genes of the Winters sex-ratio, we implemented a model of a selective sweep followed by a neutral (recovery) phase in each of the three North American populations (Przeworski 2003). We assume the selective sweep was complete and therefore restrict our analysis to the derived alleles at each gene, which leads us to exclude one ancestral Nmy allele from each of the Tremont and Winters populations. We were unable to perform the analysis for the Nicewicz population at the Nmy locus, because only one segregating site is present and Tajima's D could not be calculated. By assuming fixation, we may be upwardly biasing our estimates of the time since selection at Nmy (in North America, Dox and MDox are fixed in our sample, so this is less likely to be a problem at these loci). If ancestral alleles are segregating in the population, recombination between derived and ancestral alleles may introduce mutations onto the derived background, which would make derived alleles seem more diverse, and it would appear that selection occurred longer ago than it actually did. In view of the results actually obtained, therefore, excluding the ancestral Nmy sequences is conservative. In addition, the presence of gene conversion between Dox and MDox is expected to result in conservative estimates of the time since selection. Gene conversion increases the number of segregating sites by introducing multiple nonindependent mutations, thus increasing the length of the recovery phase after selection is complete.
We generate 1000 sets of model parameters that are compatible with our data summaries at each locus. For five of the data sets, we accepted simulated Tajima's D values within ɛ = 0.1 of the observed data. However, three data sets (Dox-Tremont, Dox-Winters, and Nmy-Winters) exhibited low acceptance rates, which led us to increase ɛ to 0.5. The fit of the selection model to the data summaries can be evaluated on the basis of the shape of the posterior distribution for T, the time since the sweep in coalescent time units of N generations (see Figure S2). If the posterior is flat, it suggests that selection is either too old to be detected (i.e., more than 4N generations ago) or else did not occur (Przeworski 2003). On the basis of an effective population size on the order of 1 × 106 years and 10 generations per year, we should be able to detect selection that occurred up to 4 million generations, or 400,000 years ago. All eight data sets are compatible with the model of selection (Figure S2). The median time since selection for Dox and MDox ranges from 0.0304 × N generations to 0.0348 × N generations (Table 4). At Nmy, selection is more recent, with a median time of 0.0068 × N generations for the Tremont population and 0.0164 × N generations for the Winters population. The time since selection in years can be calculated as t = TNg where T is the time in coalescent time units, N is the effective population size, and g is the generation time in years, in this case 0.1, or 10 generations per year. At Dox and MDox, selection occurred ∼3000 years ago, with median times ranging from 2800 years for the Tremont population at MDox to 3500 years ago for the Nicewicz population at Dox (Table 4). Selection in the Tremont population at Nmy is most recent (median time = 1600 years), while in the Winters population at Nmy the median time since selection is 3800 years. Importantly, the 95% credible interval for all eight data sets excludes the origin of the genes more than 250,000 years ago (Tao et al. 2007a). Selection most likely occurred <14,000 years ago, well after the genes of the Winters SR had evolved in the ancestor of the D. simulans clade.
TABLE 4.
Posterior distribution of parameters for selection model
|
Dox |
MDox |
Nmy |
||||
|---|---|---|---|---|---|---|
| Median | 95% C.I. | Median | 95% C.I. | Median | 95% C.I. | |
| Nicewicz | ||||||
| T (N gen) | 0.0348 | 0.0064–0.112 | 0.0308 | 0.0037–0.112 | ||
| T (years) | 3500 | 610–10,000 | 2900 | 330–11,000 | ||
| θ | 28 | 15–49 | 20 | 11–35 | ||
| ρ | 120 | 70–180 | 86 | 52–130 | ||
| s | 0.063 | 0.0019–0.46 | 0.063 | 0.0015–0.46 | ||
| Tremont | ||||||
| T (N gen) | 0.0312 | 0.0072–0.156 | 0.0304 | 0.0040–0.104 | 0.0068 | 0.0020–0.0212 |
| T (years) | 2900 | 760–12,000 | 2800 | 360–9700 | 1600 | 550–4500 |
| θ | 21 | 9.6–43 | 17 | 8.9–32 | 61 | 29–130 |
| ρ | 110 | 63–170 | 83 | 50–130 | 560 | 340–850 |
| s | 0.1 | 0.0023–0.48 | 0.055 | 0.0017–0.47 | 0.28 | 0.014–0.49 |
| Winters | ||||||
| T (N gen) | 0.034 | 0.0035–0.136 | 0.0328 | 0.0040–0.148 | 0.0164 | 0.0032–0.064 |
| T (years) | 3200 | 300–12,000 | 3100 | 400–14,000 | 3800 | 790–14,000 |
| θ | 22 | 11–42 | 18 | 8.5–36 | 59 | 26–120 |
| ρ | 110 | 68–180 | 81 | 49–133 | 570 | 340–890 |
| s |
0.059 |
0.0021–0.48 |
0.26 |
0.021–0.48 |
0.27 |
0.023–0.49 |
T (N gen), time since selection in coalescent time units; T (years), time since selection in years; θ, per-locus population mutation rate; ρ, per-locus population recombination rate; s, selection coefficient; 95% C.I., 95% credible interval.
DISCUSSION
In this study, we characterize patterns of genetic variation in North American populations of D. simulans at the genes of the Winters sex-ratio, one of three X-linked meiotic drive systems in this species (Tao et al. 2007a). We find that the presence of all genes—the distorter locus, Dox, its progenitor gene, MDox, and the suppressor, Nmy—are polymorphic in this species. The frequencies of the ancestral form of the driver loci (the allele that lacks the gene insertion) are highest in African and Oceanian samples, while ancestral Nmy is rare in the North American samples and absent in samples from other geographic localities. We also find evidence of gene conversion between Dox and MDox, the paralogous gene pair responsible for sex-ratio distortion in this system. Finally, we find several loss-of-function mutations on the derived Dox background, consistent with virtually complete suppression of the sex-ratio system in North American populations.
All three genes of the Winters sex-ratio show signatures consistent with recent positive selection. In this context, we use the term “selection” to also include the transmission-ratio advantage of the meiotic drive locus. The evidence for selection is twofold. First, nucleotide variability on the derived allele background is greatly reduced compared to the ancestral allele background (Table 2). Second, all genes show skews in the site-frequency spectrum with an excess of low-frequency variants observed in all genes, and an excess of high-frequency derived variants observed at Nmy. These site-frequency skews are reflected in significant negative Tajima's D and Fay and Wu's H statistics (Table 2). Both of these patterns are consistent with a hitchhiking model where a new mutation rapidly increases in frequency due to natural selection or biased transmission during meiosis. In addition, we find our data to be compatible with a coalescent model of a recent selective sweep at all loci that occurred well after the origins of the genes (Table 4 and Figure 3). In fact, the 95% credible interval for the time since selection at all loci is more recent than the split between D. simulans and D. mauritiana, ∼250,000 years ago (McDermott and Kliman 2008). This result is consistent with the theoretical prediction that meiotic drive systems experience repeated bouts of drive and suppression, and thus multiple rounds of selection (Frank 1991; Hall 2004).
Figure 3.—
Posterior distributions of the time since selection (in years) for a hitchhiking model.
Selection on the Winters sex-ratio is older than on the Paris sex-ratio, the other system in D. simulans that has been extensively studied. Derome et al. (2008) estimated that selection acted on the Paris driver only 88 years ago, on the basis of an analysis of linkage disequilibrium across a region linked to the driver. Our results indicate that selection acted <15,000 years ago, with an average age across loci of 3000 years. Consistent with this estimate, we do not observe elevated linkage disequilibrium in derived gene copies at any of the Winters sex-ratio genes, after correcting for gene conversion. Significant linkage disequilibrium would be a signature of very recent selection. This signal is absent, whereas the signal of reduced polymorphism and skewed site frequencies are evident. At the time that selection was most likely acting on the genes of the Winters sex-ratio, the geographic range of D. simulans was restricted to Africa, the Indian Ocean islands, and Eurasia (Lachaise et al. 1988). North America was likely settled ∼500 years ago during the European colonization of the New World, facilitated by commensalism with humans (Lachaise and Silvain 2004). Interestingly, the most recent round of selection on the Winters SR occurred around the time of the expansion into Eurasia, 6500–5000 years ago (Lachaise and Silvain 2004). Female-biased populations have higher growth rates than populations with even sex ratios (Hamilton 1967), suggesting that the Winters driver and the resulting excess of females may have facilitated the colonization of new habitats. However, due to the large credible intervals of the estimated time since selection, we cannot necessarily exclude the possibility that selection occurred when the species range was restricted to Africa.
Could other evolutionary forces besides selection have caused these departures from neutral patterns? Demographic forces such as population-size changes or population subdivision can have profound effects on genetic variation. However, these factors shape variation across all loci whereas selection targets particular genes or functional regions. Patterns of variation at Dox, MDox, and Nmy are unusual when compared to other loci sampled in North American populations (Begun and Whitley 2000). In all three populations, each gene has either reduced polymorphism or elevated divergence, or both, as evidenced by significant multilocus HKA tests (Table 3). Population growth can result in skews in the site-frequency spectrum similar to what we observed (i.e., an excess of rare variants and negative Tajima's D). However, previous work indicates that populations of D. simulans experienced a population bottleneck during their colonization of the New World (Wall et al. 2002). Recent population bottlenecks are expected to result in an excess of intermediate frequency variants (Wakeley 2009), whereas we observe a dearth in our data. Indeed, the Tajima's D estimates for the Begun and Whitley data (2000) are slightly positive, consistent with a population bottleneck (Table S2). Combined with our detailed knowledge of the function of these genes (Tao et al. 2007a,b), we are confident that the observed departures from neutral equilibrium expectations at the genes of the Winters sex-ratio are due to selection.
If all three genes show signatures of positive selection, why are they not fixed in D. simulans? Even under a simple model of selective neutrality and drift, mutations are not expected to persist beyond an average of 4N generations (or 3N generations for X-linked loci), which is equivalent to ∼400,000 years (300,000 for the X) in the case of D. simulans, assuming 10 generations per year and an effective population size on the order of one million (Hartl and Clark 2007). Four copies of the ancestral distorter alleles were found in African and Oceanian populations and two ancestral suppressors were found in North America. Polymorphism at the suppressor can be explained from a simple model of selection to maintain Fisherian sex ratios. Assume, after Fisher (1930), that the total reproductive value of males and females is equal,
where Wi is the fitness of the ith male, Wj is the fitness of the jth female, and there are m males and f females in the population. If we apportion fitness evenly among individuals of each sex, the fitness of each male is then simply equal to the sex ratio, Wi = f/m. In a female-biased population, members of the “rarer sex” (males) have higher fitness. Under a model where a sex-ratio distorter invades a population and fixes due to its transmission advantage, selection on a new suppressor is negative frequency dependent. At low frequency, a population is female biased and selection on a suppressor for the maintenance of equal sex ratios is strong; but at high frequency, most copies of the distorter are masked, the population sex ratio is close to 50/50, and selection on a suppressor is much weaker. This result explains why selection for Fisherian sex ratios may be inefficient at removing the last few copies of a nonsuppressor allele, even though under a deterministic model, the suppressor will eventually fix (Vaz and Carvalho 2004). In addition, selection is expected to be even less efficient at purging null suppressors if the functional suppressor is dominant, as vanishingly few individuals will express sex-ratio. This verbal model makes many simplifying assumptions such as panmixia, infinite population size, no pleiotropic fitness effects of drivers or distorters, and dominant suppression, but it could nevertheless explain the presence of ancestral Nmy alleles in North American populations that are fixed for the derived allele at both Dox and MDox.
Understanding the presence of null alleles of Dox and MDox is more complex. Under simple, single-population models of sex-chromosome drive, polymorphism between driving (SR) and standard (ST) X chromosomes can result from three conditions (Vaz and Carvalho 2004). First, the transmission advantage of an SR chromosome may be balanced by deleterious effects of either the driving locus itself or linked variants. Experimental work in a variety of Drosophila species indicates that when mated multiple times, SR males suffer reduced fertility as well as reduced sperm competitive ability; these are examples of pleiotropic effects of the drive locus due to reduced sperm production (Jaenike 2001). Linked deleterious mutations may affect either male or female fitness and are common when drivers occur in chromosomal inversions. In D. recens, females homozygous for SR chromosomes have reduced fertility, presumably due to a mutation at an unrelated locus trapped in the large inversion that contains the drive locus (Dyer et al. 2007). The last two conditions for SR/ST polymorphism require the evolution of suppressors by selection for Fisherian sex ratios or genomic conflict, which mask the expression of drive. If suppression is complete (i.e., suppressors are fixed) the meiotic drive system is essentially “dead” and both loci evolve neutrally. If the suppression is partial (i.e., suppressors are polymorphic) polymorphism in the driver may be maintained.
For the Winters sex-ratio, we may argue against an offsetting deleterious effect on the basis of several lines of evidence. First, the distorter is not located within a chromosomal inversion and is unlikely to be associated with linked deleterious variants. Second, theoretical work indicates that SR chromosomes balanced by deleterious effects cannot reach a frequency high enough to skew sex ratios and induce selection for suppressors (Vaz and Carvalho 2004). So the mere presence of Nmy indicates the Dox/MDox is not maintained as a balanced polymorphism. However, rejection of this hypothesis requires careful measurement of the fitness of each genotype. Interestingly, experiments suggest that the fertility of males expressing drive may be reduced relative to that of males with suppressed drivers (Tao et al. 2007b). Although rates of female remating in D. simulans is low (Markow 1996), in female-biased populations, sperm limitation may be an issue for males. One difficulty in testing this hypothesis stems from the fact that small fitness effects may have important consequences in natural populations yet be undetectable in the laboratory.
The partial suppression hypothesis seems unlikely because, although Nmy is not fixed, the frequency of males homozygous for nonsuppressing Nmy is very low. On the basis of the observed allele frequencies in our sample, nonsuppressing males are expected to occur at 0.6% in the Winters population and at 0.02% in the Tremont population. Thus, the neutral explanation seems most likely as it is supported by the presence of loss-of-function mutations on the derived Dox background and the nearly complete suppression of driving chromosomes on the basis of observed allele frequencies in our sample.
Our inability to distinguish among these three hypotheses for the polymorphism in the Winters driver is complicated by the fact that D. simulans violates many assumptions of the simple population-genetic models implicit in the discussion above. The species exhibits high levels of population structure, particularly in Africa (Hamblin and Veuille 1999), and it is possible that the ancestral alleles were sampled in populations that do not exchange migrants with populations that currently harbor the Winters sex-ratio genes. More extensive population sampling of the Madagascar, Congolese, New Caledonia, and New Zealand populations may shed light on this possibility. The possibility of competitive exclusion of the Winters driver by the Paris driver also exists. Notably, the frequency of the Paris driver is highest in central Africa and the Indian Ocean islands (Jutier et al. 2004), where, on the basis of our coarse global sampling, ancestral copies of the Winters drivers are found. Consistent with the competitive exclusion hypothesis, the intensity of drive is higher in the Paris system than in the Winters system, ∼96 vs. ∼81% (Montchamp-Moreau et al. 2006; Tao et al. 2007b). Neither driver appears to be a balanced polymorphism that would limit the spread of the drivers through the population, so differential intensity of drive would in large part determine the frequency of the drivers in the population (Thomson and Feldman 1975). Testing the competitive exclusion hypothesis will require more extensive sampling of the Winters driver, particularly in Africa and the Indian Ocean islands, as well as competition experiments between the two drivers in population cages in the laboratory.
Our analysis indicates that selection is much more recent than the actual origin of the Winters sex-ratio genes ∼850,000 years ago. The date is based on sequence analysis and is consistent with the species distribution of the genes. All are absent in D. melanogaster but preliminary data indicates that the genes are present in D. mauritiana (Tao et al. 2007b; S. Kingan, unpublished data). Moreover, the D. sechellia Y chromosome is sensitive to drive by Dox (Tao et al. 2007a). An old origin but recent selection is suggestive of a genetic “arms-race” model for the evolution of drivers and suppressors, whereby multiple rounds of suppression and distortion occur due to ongoing genetic conflict between the loci (Frank 1991; Hall 2004). In fact, the structure of the driving locus for Winters supports this arms-race model. Dox may have evolved as an enhancer or modifier of an original distorter, most likely MDox, which had been suppressed by an unknown locus or an earlier form of Nmy. The most recent suppressor, Nmy, may then have evolved to suppress the new, compound distorter. This model is testable by substituting chromosomes with a derived MDox and ancestral Dox into a variety of autosomal backgrounds. If drive is observed for some genotypes, it would confirm that MDox was once able to drive alone. In addition, if there is polymorphism in the drive phenotype, one may be able to map the original suppressor of MDox.
The Winters sex-ratio is not the only trans-specific meiotic drive system: in mice, stalk-eyed flies, and Drosophila, shared drive systems are found in multiple closely related species (Jaenike 2001). The genomic conflict that results from a single meiotic drive system can have profound effects on patterns of genomic diversity in multiple species over a period of millions of years. On the molecular level, these patterns are indistinguishable from those caused by adaptation on the basis of novel variation. It is only with a detailed understanding of the functional importance of genomic regions that one can attribute genomic signatures of selection to processes that increase the fitness of individual organisms.
Acknowledgments
We thank Yun Tao for generously sharing research materials, fly stocks, and unpublished results as well as his expertise and insight into this system; also Luciana Araripe, Horacio Montenegro, Kalsang Namgyal, Erik Dopman, and Nguyen Nguyen for technical assistance, and Noemi Velazguez for administrative assistance. The Nicewicz family farm kindly gave us access to their farm for fly collections. We are grateful to John Wakeley and Molly Przeworski for help with the coalescent modeling. Daven Presgraves, Yun Tao, and two anonymous reviewers provided thoughtful comments, which greatly improved the manuscript. This work was supported by National Institutes of Health grant GM065169 to D.L.H and a National Science Foundation graduate research fellowship to S.B.K.
Supporting information is available online at http://www.genetics.org/cgi/content/full/genetics.109.109587/DC1.
References
- Andolfatto, P., 2001. Contrasting patterns of X-linked and autosomal nucleotide variation in Drosophila melanogaster and Drosophila simulans. Mol. Biol. Evol. 18 279–290. [DOI] [PubMed] [Google Scholar]
- Andolfatto, P., 2005. Adaptive evolution of non-coding DNA in Drosophila. Nature 437 1149–1152. [DOI] [PubMed] [Google Scholar]
- Andolfatto, P., and M. Przeworski, 2000. A genome-wide departure from the standard neutral model in natural populations of Drosophila. Genetics 156 257–268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ashburner, M., K. G. Golic and R. S. Hawley, 2005. Drosophila: A Laboratory Handbook. Cold Spring Harbor Laboratory Press, Cold Sping Harbor, NY.
- Baudry, E., N. Derome, M. Huet and M. Veuille, 2006. Contrasted polymorphism patterns in a large sample of populations from the evolutionary genetics model Drosophila simulans. Genetics 173 759–767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Begun, D. J., A. K. Holloway, K. Stevens, L. W. Hillier, Y. P. Poh et al., 2007. Population genomics: whole-genome analysis of polymorphism and divergence in Drosophila simulans. PLoS Biol. 5 e310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Begun, D. J., and P. Whitley, 2000. Reduced X-linked nucleotide polymorphism in Drosophila simulans. Proc. Natl. Acad. Sci. USA 97 5960–5965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Betran, E., J. Rozas, A. Navarro and A. Barbadilla, 1997. The estimation of the number and the length distribution of gene conversion tracts from population DNA sequence data. Genetics 146 89–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bodmer, W. F., and A. W. Edwards, 1960. Natural selection and the sex ratio. Ann. Hum. Genet. 24 239–244. [DOI] [PubMed] [Google Scholar]
- Carvalho, A. B., M. C. Sampaio, F. R. Varandas and L. B. Klaczko, 1998. An experimental demonstration of Fisher's principle: evolution of sexual proportion by natural selection. Genetics 148 719–731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charlesworth, B., and D. L. Hartl, 1978. Population dynamics of the segregation distorter polymorphism of Drosophila melanogaster. Genetics 89 171–192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Derome, N., E. Baudry, D. Ogereau, M. Veuille and C. Montchamp-Moreau, 2008. Selective sweeps in a 2-locus model for sex-ratio meiotic drive in Drosophila simulans. Mol. Biol. Evol. 25 409–416. [DOI] [PubMed] [Google Scholar]
- Dyer, K. A., B. Charlesworth and J. Jaenike, 2007. Chromosome-wide linkage disequilibrium as a consequence of meiotic drive. Proc. Natl. Acad. Sci. USA 104 1587–1592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edwards, A. W. F., 1961. The population genetics of “sex-ratio” in Drosophila pseudoobscura. Heredity 16 291–304. [Google Scholar]
- Engels, W. R., 2005. Amplify, version 3.1.4. Madison, WI. http://engels.genetics.wisc.edu/amplify/
- Fay, J. C., and C. I. Wu, 2000. Hitchhiking under positive Darwinian selection. Genetics 155 1405–1413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fisher, R. A., 1930. The Genetical Theory of Natural Selection. Oxford University Press, Oxford.
- Frank, S. A., 1991. Divergence of meiotic drive-suppression systems as an explanation for sex-biased hybrid sterility and inviability. Evolution 45 262–267. [DOI] [PubMed] [Google Scholar]
- Hall, D. W., 2004. Meiotic drive and sex chromosome cycling. Evolution 58 925–931. [DOI] [PubMed] [Google Scholar]
- Hall, T. A., 1999. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 41 95–98. [Google Scholar]
- Halligan, D. L., and P. D. Keightley, 2006. Ubiquitous selective constraints in the Drosophila genome revealed by a genome-wide interspecies comparison. Genome Res. 16 875–884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hamblin, M. T., and M. Veuille, 1999. Population structure among African and derived populations of Drosophila simulans: evidence for ancient subdivision and recent admixture. Genetics 153 305–317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hamilton, W. D., 1967. Extraordinary sex ratios. A sex-ratio theory for sex linkage and inbreeding has new implications in cytogenetics and entomology. Science 156 477–488. [DOI] [PubMed] [Google Scholar]
- Hammer, M. F., F. Blackmer, D. Garrigan, M. W. Nachman and J. A. Wilder, 2003. Human population structure and its effects on sampling Y chromosome sequence variation. Genetics 164 1495–1509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hartl, D. L., 1975. Modifier theory and meiotic drive. Theor. Popul. Biol. 7 168–174. [DOI] [PubMed] [Google Scholar]
- Hartl, D. L., and A. G. Clark, 2007. Principles of Population Genetics. Sinauer Associates, Sunderland, MA.
- Hey, J., 2004. HKA, version February 2005. Piscataway, NJ. http://genfaculty.rutgers.edu/hey/software#HKA
- Hill, W. G., and A. Robertson, 1968. Linkage disequilibrium in finite populations. Theor. Appl. Genet. 38 226–231. [DOI] [PubMed] [Google Scholar]
- Hudson, R. R., 1987. Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50 245–250. [DOI] [PubMed] [Google Scholar]
- Hudson, R. R., M. Kreitman and M. Aguadé, 1987. A test of neutral molecular evolution based on nucleotide data. Genetics 116 153–159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hurst, L. D., and A. Pomiankowski, 1991. Causes of sex ratio bias may account for unisexual sterility in hybrids: a new explanation of Haldane's rule and related phenomena. Genetics 128 841–858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jaenike, J., 2001. Sex chromosome meiotic drive. Annu. Rev. Ecol. Syst. 32 25–48. [Google Scholar]
- Jutier, D., N. Derome and C. Montchamp-Moreau, 2004. The sex-ratio trait and its evolution in Drosophila simulans: a comparative approach. Genetica 120 87–99. [DOI] [PubMed] [Google Scholar]
- Kaplan, N. L., R. R. Hudson and C. H. Langley, 1989. The “hitchhiking effect” revisited. Genetics 123 887–899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kelly, J. K., 1997. A test of neutrality based on interlocus associations. Genetics 146 1197–1206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lachaise, D., M.-L. Cariou, J. R. David, F. Lemeunier, L. Tsacas et al., 1988. Historical biogeography of the Drosophila melanogaster species subgroup. Evol. Biol. 22 159–225. [Google Scholar]
- Lachaise, D., and J. F. Silvain, 2004. How two Afrotropical endemics made two cosmopolitan human commensals: the Drosophila melanogaster–D. simulans palaeogeographic riddle. Genetica 120 17–39. [DOI] [PubMed] [Google Scholar]
- Li, Y. J., Y. Satta and N. Takahata, 1999. Paleo-demography of the Drosophila melanogaster subgroup: application of the maximum likelihood method. Genes Genet. Syst. 74 117–127. [DOI] [PubMed] [Google Scholar]
- Lyttle, T. W., 1993. Cheaters sometimes prosper: distortion of Mendelian segregation by meiotic drive. Trends Genet. 9 205–210. [DOI] [PubMed] [Google Scholar]
- Markow, T. A., 1996. Evolution of Drosophila mating systems. Evol. Biol. 29 73–106. [Google Scholar]
- McDermott, S. R., and R. M. Kliman, 2008. Estimation of isolation times of the island species in the Drosophila simulans complex from multilocus DNA sequence data. PLoS ONE 3 e2442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mercot, H., A. Atlan, M. Jacques and C. Montchamp-Moreau, 1995. Sex-ratio distortion in Drosophila simulans: co-occurence of a meiotic drive and a suppressor of drive. J. Evol. Biol. 8 283–300. [Google Scholar]
- Montchamp-Moreau, C., D. Ogereau, N. Chaminade, A. Colard and S. Aulard, 2006. Organization of the sex-ratio meiotic drive region in Drosophila simulans. Genetics 174 1365–1371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nei, M., 1987. Molecular Evolutionary Genetics. Columbia University Press, New York.
- Nielsen, R., 2005. Molecular signatures of natural selection. Annu. Rev. Genet. 39 197–218. [DOI] [PubMed] [Google Scholar]
- O'Reilly, P. F., E. Birney and D. J. Balding, 2008. Confounding between recombination and selection, and the Ped/Pop method for detecting selection. Genome Res. 18 1304–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orr, H. A., and S. Irving, 2005. Segregation distortion in hybrids between the Bogota and USA subspecies of Drosophila pseudoobscura. Genetics 169 671–682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Phadnis, N., and H. A. Orr, 2009. A single gene causes both male sterility and segregation distortion in Drosophila hybrids. Science 323 376–379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Presgraves, D. C., 2008. Sex chromosomes and speciation in Drosophila. Trends Genet. 24 336–343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Przeworski, M., 2003. Estimating the time since the fixation of a beneficial allele. Genetics 164 1667–1676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ptak, S. E., and M. Przeworski, 2002. Evidence for population growth in humans is confounded by fine-scale population structure. Trends Genet. 18 559–563. [DOI] [PubMed] [Google Scholar]
- Rozas, J., J. C. Sanchez-DelBarrio, X. Messeguer and R. Rozas, 2003. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19 2496–2497. [DOI] [PubMed] [Google Scholar]
- Sandler, L., and E. Novitski, 1957. Meiotic drive as an evolutionary force. Am. Nat. 91 105–110. [Google Scholar]
- Sharp, P. M., and W. H. Li, 1989. On the rate of DNA sequence evolution in Drosophila. J. Mol. Evol. 28 398–402. [DOI] [PubMed] [Google Scholar]
- Stumpf, M. P., and G. A. McVean, 2003. Estimating recombination rates from population-genetic data. Nat. Rev. Genet. 4 959–968. [DOI] [PubMed] [Google Scholar]
- Tajima, F., 1983. Evolutionary relationship of DNA sequences in finite populations. Genetics 105 437–460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tajima, F., 1989. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123 585–595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tao, Y., L. Araripe, S. B. Kingan, Y. Ke, H. Xiao et al., 2007. a A sex-ratio meiotic drive system in Drosophila simulans. II: an X-linked distorter. PLoS Biol. 5 e293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tao, Y., D. L. Hartl and C. C. Laurie, 2001. Sex-ratio segregation distortion associated with reproductive isolation in Drosophila. Proc. Natl. Acad. Sci. USA 98 13183–13188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tao, Y., J. P. Masly, L. Araripe, Y. Ke and D. L. Hartl, 2007. b A sex-ratio meiotic drive system in Drosophila simulans. I: an autosomal suppressor. PLoS Biol. 5 e292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tatusova, T. A., and T. L. Madden, 1999. BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiol. Lett. 174 247–250. [DOI] [PubMed] [Google Scholar]
- Tavare, S., D. J. Balding, R. C. Griffiths and P. Donnelly, 1997. Inferring coalescence times from DNA sequence data. Genetics 145 505–518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomson, G. J., and M. W. Feldman, 1975. Population genetics of modifiers of meiotic drive. IV. on the evolution of sex-ratio distortion. Theor. Popul. Biol. 8 202–211. [DOI] [PubMed] [Google Scholar]
- Untergasser, A., H. Nijveen, X. Rao, T. Bisseling, R. Geurts et al., 2007. Primer3Plus, an enhanced web interface to Primer3. Nucleic Acids Res. 35 W71–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vaz, S. C., and A. B. Carvalho, 2004. Evolution of autosomal suppression of the sex-ratio trait in Drosophila. Genetics 166 265–277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wakeley, J., 2009. Coalescent Theory: An Introduction. Roberts and Company, Greenwood Village, CO.
- Wall, J. D., P. Andolfatto and M. Przeworski, 2002. Testing models of selection and demography in Drosophila simulans. Genetics 162 203–216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watterson, G. A., 1975. On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7 256–276. [DOI] [PubMed] [Google Scholar]
- Zimmering, S., L. Sandler and B. Nicoletti, 1970. Mechanisms of meiotic drive. Annu. Rev. Genet. 4 409–436. [DOI] [PubMed] [Google Scholar]



