Abstract
Populations of widely distributed species encounter and must adapt to local environmental conditions. However, comprehensive characterization of the genetic basis of adaptation is demanding, requiring genome-wide genotype data, multiple sampled populations, and an understanding of population structure and potential selection pressures. Here, we used single-nucleotide polymorphism genotyping and data on numerous environmental variables to describe the genetic basis of local adaptation in 21 populations of teosinte, the wild ancestor of maize. We found complex hierarchical genetic structure created by altitude, dispersal events, and admixture among subspecies, which complicated identification of locally beneficial alleles. Patterns of linkage disequilibrium revealed four large putative inversion polymorphisms showing clinal patterns of frequency. Population differentiation and environmental correlations suggest that both inversions and intergenic polymorphisms are involved in local adaptation.
Keywords: Zea mays, parviglumis, mexicana, inversion, population structure, admixture
Introduction
One of the enduring goals of evolutionary genetics is to understand the genomic and geographic extent of adaptation in natural populations. Adaptive differences among closely related contemporary populations are prime examples of the impact of natural selection on genetic polymorphisms. While many examples of adaptation in nature are known (Ford and Ford 1930; Clausen et al. 1940; Endler 1980; Grant and Grant 2002), our theoretical understanding (Lewontin and Krakauer 1973; Feder and Nosil 2010; Le Corre and Kremer 2012) of the genomic signatures of adaptation has so far outpaced natural studies, and a number of outstanding questions remain. For example, what kinds of traits are adaptive? What are the prevailing forms (i.e., clinal, local) of adaptation in nature? What parts of the genome and what kinds of loci are involved?
One approach to link adaptation to genetic diversity is the reverse ecology (Li et al. 2008) strategy of identifying genomic regions that have been the target of natural selection in order to understand ecologically important variation. The advantage of this approach is that it is not dependent on experimental setup or identification of relevant, measurable traits. This approach uses a number of statistical and population genetic methods to identify putatively selected loci. Such loci are usually expected to show higher genetic differentiation among populations and lower genetic diversity within populations (e.g., Lewontin and Krakauer 1973; Storz 2005), and should be correlated with environmental variation that is directly or indirectly causing the selection pressure (Coop et al. 2010; Eckert et al. 2010a). Several related methods to identify loci that have exceptionally high differentiation are available (Beaumont and Nichols 1996; Beaumont and Balding 2004; Foll and Gaggiotti 2008). These are typically based on identifying FST outliers by generating null distributions of FST from a simple neutral model. Results of such methods may depend strongly on the accuracy of the null model: hierarchical population structure, for example, can lead to a large number of false positives (Excoffier et al. 2009). An alternative approach involves identifying associations between environmental variables and genetic polymorphisms while controlling for population structure. This method has been successfully applied to identify adaptively important loci among populations in humans (Coop et al. 2010; Hancock et al. 2011a) and Arabidopsis thaliana (Fournier-Level et al. 2011; Hancock et al. 2011b).
For plants, adaptation to local conditions is especially important because, as sedentary organisms, they cannot readily migrate to more benign conditions. The advent of high-throughput genotyping has begun to enable researchers to answer some of the questions related to the genetic basis of local adaptation in plants. For example, recent studies in Arabidopsis have shown geographically localized fitness effects (Fournier-Level et al. 2011), enrichment for non-synonymous variants at adaptive loci (Hancock et al. 2011b), and adaptation across a number of environmental variables (e.g., soil type, temperature, and precipitation) (Turner et al. 2010; Fournier-Level et al. 2011). However, although both global (Platt et al. 2010) and local (Bomblies et al. 2010) population structure in Arabidopsis can be complex, little is known of how this has impacted studies of local adaptation. Moreover, Arabidopsis has an unusually small and relatively simple genome among plants (Meyerowitz and Pruitt 1985), and it is not yet clear how generally applicable results from this model species are to other taxa.
The wild relatives of domesticated maize offer an excellent opportunity to investigate questions of local adaptation in natural populations (Hufford, Bilinski, et al. 2012). In addition to the well-known domesticated maize, Zea mays ssp. mays, the species Zea mays includes three wild subspecies, collectively known as teosintes. Two of these, Zea mays ssp. parviglumis (hereafter parviglumis) and Zea mays ssp. mexicana (hereafter mexicana), are widespread taxa found in natural populations spanning central and southwest Mexico (fig. 1). Their combined range extends across varied environments, from the warmer low elevations of the Balsas River Valley to the cooler high elevations of Mexico’s Central Plateau, encompassing 2-fold differences in precipitation and several soil types. Ecological niche modeling suggests that both taxa have moved little in response to climate change since the Last Glacial Maximum (approximately 20,000 base pair) (Hufford, Martínez-Meyer, et al. 2012), and their annual life history and outcrossing mating system have likely facilitated adaptation to local conditions. The teosintes also exhibit complex hierarchical population structure (Fukunaga et al. 2005) and have a genome of relatively average size and complexity among angiosperms (Gregory et al. 2007), thus serving as a valuable counterpoint to results from Arabidopsis.
Fig. 1.—
Map of sampled Zea mays ssp. parviglumis and ssp. mexicana populations.
Previous work has investigated the genetic architecture of morphological differences in the teosintes (Lauter et al. 2004; Weber et al. 2007, 2008), and overall patterns of genetic diversity and structure among populations have been detailed using various genetic markers (Smith et al. 1984; Doebley et al. 1987; Fukunaga et al. 2005; Moeller et al. 2007). Our understanding of patterns of variation across the genome, however, has been limited to a handful of accessions or analyses of individual populations (van Heerwaarden et al. 2010; Hufford et al. 2012). Even less is known regarding the genetics of local adaptation, which has only been studied at a few loci in the context of plant immunity (Moeller and Tiffin 2008).
Clinal patterns of large inversions in Drosophila were one of the first observed adaptive patterns in genetic polymorphisms (Dobzhansky 1970). A major effect of chromosomal inversions is recombination suppression, and early on it was suggested that clinal patterns were a result of selection acting on large co-adapted gene complexes caught by inversions (Wright and Dobzhansky 1946; Dobzhansky 1970). Recent theoretical work has shown that no functional connection among genes within inversions is necessary; for selection to maintain inversion polymorphisms they only need to capture two or more locally beneficial alleles (Kirkpatrick and Barton 2006; Kirkpatrick 2010). Large genomic rearrangements are often observed in maize and its wild relatives (Ting 1964; Kato 1975), but their adaptive significance is generally not known. The single exception to this is an observation of an altitudinal cline and unusual haplotype structure at a large inversion found segregating in multiple parviglumis and mexicana populations (Fang et al. 2012).
Here we present a detailed population genetic examination of local adaptation across 21 populations of mexicana and parviglumis (fig. 1) based on single-nucleotide polymorphism (SNP) genotyping. We document complex, hierarchical population structure patterned by elevation, dispersal, and recent admixture. We scan the genome for alleles associated with excess differentiation or strong correlation with environmental gradients and show that population structure complicates our ability to identify putatively adaptive loci. Nonetheless, we identify a number of candidate SNPs, four large, putatively adaptive inversion polymorphisms, and an excess of loci in intergenic regions of the genome. These results differ markedly from previous work in other plant species, suggesting that the genetics of local adaptation may vary considerably among taxa.
Materials and Methods
Sampling
Ten to 12 individuals were sampled from each of 11 populations of parviglumis and 10 populations of mexicana (fig. 1 and supplementary table S1, Supplementary Material online). Seeds from four populations of parviglumis (Ejutla A and B, San Lorenzo, and La Mesa) were sampled by Matthew B. Hufford in 2007, while the remaining seven populations were obtained from nonregenerated bulked seed from accessions in the USDA germplasm collections. All mexicana populations were collected in 2008 and kindly provided by Pesach Lubinsky and Norman Ellstrand. Two mexicana individuals were removed from subsequent analysis after being identified as recent hybrids with maize in an initial principal component analysis (PCA) that included data from 279 maize inbred lines (Cook et al. 2012). A Tripsacum dactyloides (Nei's divergence to Zea = 0.06, Ross-Ibarra et al. 2009) sample provided by Sherry Flint-Garcia was used as an outgroup. The geodetic distances among populations vary from 3 km between Santa Clara and Opopeo to 1,503 km between El Rodeo and Nabogame, and the populations cover an altitudinal range from 590 to 2,609 m above sea level. Previously published genotype data (Cook et al. 2012) and phenotypic data from 278 maize inbred lines (Flint-Garcia et al. 2005) were used for population structure and trait association analyses.
SNP Genotyping
Leaf tips of seedlings at the five-leaf stage were collected, stored at −80 °C overnight, and lyophilized for 48 h. DNA was extracted from homogenized tissue following a modified CTAB protocol (Saghai-Maroof et al. 1984) and quantified using a NanoDrop spectrophotometer (NanoDrop Technologies, Inc., Wilmington, DE, USA) and Wallac VICTOR2 fluorescence plate reader (Perkin-Elmer Life and Analytical Sciences, Torrance, CA) with the Quant-iT™ PicoGreen® dsDNA Assay Kit (Invitrogen, Grand Island, NY, USA).
SNP genotyping was conducted at the UC Davis Genome Center using the MaizeSNP50 BeadChip and Infinium® HD Assay (Illumina, San Diego, CA, USA). SNP genotypes were called using GenomeStudio V2009.1 (Illumina). After dropping SNPs with >10% missing data in either subspecies and manual adjustment of clustering, average call rates (the proportion of successfully called genotypes) per individual were 99% for both parviglumis and mexicana. Genotyping error estimated from three technical replicates of parviglumis was approximately 0.0015% after mismatches caused by missing data were excluded. In total, 43,701 SNPs were called in parviglumis and 43,694 in mexicana. SNPs within a region on chromosome 8 that showed long regions of deviation from Hardy–Weinberg equilibrium in individual populations and showed discrepancies between the physical and genetic maps in Ganal et al. (2011) were removed. After removing monomorphic and duplicate SNPs and SNPs that did not map or mapped multiply to the maize reference genome (release 5b.60), the final data set consisted of 36,719 SNPs genotyped in both subspecies. SNP data, map positions, and annotations are available at http://datadryad.org/resource/doi:10.5061/dryad.8m648.
Additional, more rigorous filtering was applied in haplotype-based analyses that are especially sensitive to the relationship between physical and genetic map positions. Genetic map coordinates for fastPHASE were obtained from a modified map (Gerke et al. 2013) of the maize Intermated B73 × Mo17 (IBM) population (Ganal et al. 2011). The 3,615 SNPs for which genetic and physical map disagreed or information was not available were omitted from haplotype-based analyses.
Diversity and Population Structure
Observed (HO) and expected (HE) heterozygosity and deviation from Hardy–Weinberg equilibrium were calculated separately for each population using the “genetics” package in R (R Core Team 2012). The inbreeding coefficient FIS was calculated as (HE − HO)/HE.
The prcomp function in R was used to perform PCA of SNP genotypes. The number of significant principal components was estimated based on the Tracy-Widom distribution (Patterson et al. 2006). Individuals were assigned to groups based on significant principal components (PCs) using Ward clustering via the hclust function of R (van Heerwaarden et al. 2010).
Pairwise FST (Weir and Cockerham 1984) was calculated for each pair of genetic groups using all polymorphic SNPs. The relationships between genetic, geodetic, and altitudinal distance were evaluated using Mantel tests and partial Mantel tests in the R package “vegan” (Oksanen et al. 2011). The significance of correlations between pairwise genetic distances measured as FST/(1 − FST) and matrices of both geodetic distance and altitude differences among populations were estimated using 9999 permutations of rows and columns of the distance matrices.
Admixture and population structure were estimated using the software package STRUCTURE (Falush et al. 2003) based on genotypes at 10,000 random SNPs. This subset is sufficient to represent population structure and was used to shorten the computational time required by STRUCTURE. Data from 279 inbred lines (Cook et al. 2012) were used in analyses that included cultivated maize. STRUCTURE was run under the admixture and correlated allele frequency model. Two independent runs of 2,500 burn-in MCMC iterations followed by 20,000 retained iterations were performed for each number of groups (K) from 2 to 6 for analyses including maize and from 2 to 9 for analyses including only teosinte samples. Based on comparisons of independent runs, 20,000 iterations were sufficient for convergence. Results were inspected with STRUCTURE HARVESTER v0.6.8 (Earl and vonHoldt 2012) and visualized using DISTRUCT (Rosenberg 2004).
In initial analyses the Ahuacatitlan population appeared to be intermediate between subspecies. Its phylogenetic placement relative to mexicana and parviglumis was evaluated based on the configuration of shared derived alleles using the D statistic of Green et al. (2010) under three different trees. D is a measure of deviation from the expected number of discordant gene trees given a suggested genealogy. For each chromosome, a haplotype was sampled from each of three groups (Ahuacatitlan, parviglumis excluding Ahuacatitlan, and mexicana), and the D value was calculated based on all sites that were polymorphic among the three groups. Sampling was repeated 1000 times to obtain the distribution of D. Homozygous SNPs from the sister genus Tripsacum were used to determine the ancestral state.
Haplotype Sharing and Linkage Disequilibrium
Haplotypes were inferred with fastPHASE (Scheet and Stephens 2006) using known haplotypes of 18 teosinte inbred lines (Chia et al. 2012) and default parameters. Sites with residual heterozygosity in the inbred lines were excluded. Segments of identity-by-state (IBS) and runs of homozygosity were identified using GERMLINE (Gusev et al. 2009), with a seed segment size of 50 SNPs and allowing zero heterozygous and homozygous mismatches.
The software TASSEL (Bradbury et al. 2007) was used to estimate linkage disequilibrium (LD) (r2 and its P value) using phased data from all individuals and pairs of sites with minimum allele frequency >0.1. Blocks of LD were identified based on visual inspection of LD plots.
Candidate SNP Identification
Due to evidence of admixture, the Ahuacatitlan population was excluded from candidate SNP analyses. BAYENV (Coop et al. 2010) was used to evaluate the correlation between environmental variables and allele frequencies of individual SNPs. A random set of 10,000 SNPs was used to construct three covariance matrices: all populations and separately for each subspecies without the Ahuacatitlan parviglumis population. The maximum observed differences between two independent (50,000 iterations each) estimates of the covariance matrices were always less than approximately 10% of the smallest estimated covariance, indicating good convergence between runs.
Seventy-six environmental variables were analyzed, including 8 soil variables, 19 bioclimatic variables, monthly precipitation and monthly mean, maximum and minimum temperature, and altitude (supplementary table S2, Supplementary Material online). For climatic variables, 30 arc-second (∼1 km) resolution climate data were downloaded from www.worldclim.org (last accessed August 12, 2013), and DIVA-GIS (Hijmans et al. 2001) was used to extract climate data for the population locations. Data for three key soil qualities (rooting conditions, oxygen availability to roots, and workability) that varied among populations were downloaded from the Harmonized World Soil Database, and data on five varying (cracking clays, volcanic, top soil clay, top soil sand, and top soil loam) key modifier layers (Sanchez et al. 2003) were downloaded from www.harvestchoice.org (last accessed August 12, 2013). All variables were standardized to a mean of 0 and standard deviation of 1. The dimensionality of environmental data was reduced using PCA (prcomp in R), and the PCs that captured 95% of the variance in environments among populations were used for analysis in BAYENV (supplementary table S3, Supplementary Material online).
Five independent BAYENV runs with 1,000,000 iterations were used to identify SNPs associated with these PCs. SNPs were considered candidates if they showed average Bayes factors across runs in the 99th percentile and were consistently in the 95th percentile of each run. A gene was considered a candidate when a candidate SNP was in the transcribed part of the gene.
To obtain the expected distribution of heterozygosity and the hierarchical differentiation statistics FCT (among subspecies) and FST (among 20 populations, excluding Ahuacatitlan), 100,000 coalescent simulations were conducted under a hierarchical island model of two groups of 100 demes using the software Arlequin (Excoffier and Lischer 2010). The maximum expected heterozygosity for simulations was 0.5. FST and FCT outliers were identified by comparing observed values to this model-based distribution of heterozygosity and differentiation.
The data consist of multiple populations and estimation of the overall deviation from random mating due to population structure (FST) may obscure the signal of differentiation in individual populations. To identify SNPs underlying differentiation at the level of individual populations, F statistics were calculated in a hierarchical framework using the R package “hierfstat” (Yang 1998; Goudet 2005). Variance components were calculated for three levels within each subspecies: population, focal, and individual. The focal component was calculated for each population and locus, yielding
the proportion of total variance due to differentiation between a focal population and all other populations, where
,
,
, and
are the estimates of variance between focal and other populations, among all populations, among all individuals, and error, respectively. Negative variance components were set to zero. For each population, SNPs with observed FFT values above the 99th percentile were considered candidates.
As a complement to the above approaches focusing on individual SNPs, we additionally applied the pairwise haplotype sharing (PHS) method of Toomajian et al. (2006). The PHS statistic estimates the average genetic map length of shared haplotypes around a given allele, with significance evaluated by comparison to shared haplotype length for that position across all individuals. We used the same genetic map and phased haplotypes as in the analysis of runs of homozygosity (ROH) in GERMLINE. The PHS test was conducted first for all samples and then separately for each individual population.
Trait Association Analysis
To investigate the potential relationship between phenotypic traits and local adaptation, we asked whether putatively adaptive SNPs identified here showed enrichment for loci significantly associated with traits in a maize diversity panel. Association mapping tests were carried out on 278 inbred maize lines (supplementary text S1, Supplementary Material online) from the association panel described in Flint-Garcia et al. (2005). These lines were genotyped using the Illumina MaizeSNP50 array (Cook et al. 2012) and phenotyped for 36 traits (127 trait/environment combinations) (Hung et al. 2012). The 51,253 SNPs with minor allele frequency >0.05 were tested against at least one of the traits. Associations were tested using the model
where
is the estimated genetic value of a given trait,
the trait mean,
the tested SNP,
the SNP effect,
the matrix of the panel structure as estimated by Flint-Garcia et al. (2005), β the vector of structure effects,
an incidence matrix,
a vector of random effects assumed to follow a distribution
,where
is the genetic variance-covariance matrix modeled by a shared allele matrix, and
is the model residual assumed to follow
. Only SNPs significantly associated to a given trait with P values <0.01 were retained for further analyses. To test whether enrichment for flowering time loci was affected by a loss of power due to correction for population structure, we also conducted association analyses for flowering time traits using a simplified model that did not include population structure and coancestry.
Enrichment Analyses
Enrichment of candidates in genic (within the transcribed region of a gene) versus nongenic regions and nonsynonymous versus synonymous classes was calculated by sampling the number of SNPs in each candidate list randomly from the genome 1000 times. Enrichment was assessed both with and without SNPs in the four inversion polymorphisms (Inv1n [Fang et al. 2012], Inv4m, Inv9e, and Inv9d) that were identified based on LD patterns (see Results for details). Enrichment of SNPs within inversions was further inspected by ranking SNPs in each candidate list and calculating the proportion of SNPs within inversions in each candidate list. The joint effect of all inversions across all candidate lists was determined by assigning each SNP a maximum rank (more significant ranks being higher) across all candidate lists and calculating the proportion of SNPs from inversions above the 99th quantile. Bootstrapping was used again to determine statistical significance.
Results
Genomic Diversity in Teosinte
We sampled 21 populations of the wild subspecies parviglumis and mexicana from across their native ranges in central and southern Mexico (fig. 1). Ten to 12 individuals from each population were genotyped using the Illumina MaizeSNP50 chip. After quality control, the full data set consisted of 248 samples genotyped at 36,719 SNPs. Although SNPs on the MaizeSNP50 chip were ascertained in a small sample of maize lines, ascertainment appears to have had minimal effect on analyses performed here, as shown by comparisons of minor allele frequencies between subspecies (supplementary fig. S1, Supplementary Material online) or between SNPs with different-sized ascertainment panels (supplementary fig. S2, Supplementary Material online).
Overall, heterozygosity differed only slightly between the two subspecies, but considerable variation was evident among populations in both taxa (fig. 2 and supplementary table S4, Supplementary Material online). Both parviglumis and mexicana showed generally high levels of heterozygosity and little evidence of inbreeding. Within parviglumis, the population of Los Guajes showed high diversity, an inbreeding coefficient (FIS) close to zero, no deviation from Hardy–Weinberg equilibrium, and relatively short runs of ROH (supplementary fig. S3, Supplementary Material online), as might be expected of a large, outcrossing population. Other populations, however, showed evidence of recent demographic changes: diversity in San Lorenzo parviglumis was only half that of Los Guajes, and mexicana individuals from Xochimilco were characterized by extremely long ROH and extensive within-population haplotype sharing, consistent with a recent population bottleneck.
Fig. 2.—
Diversity statistics. (A) Proportion of SNPs deviating from Hardy–Weinberg Equilibrium (HWE), proportion of polymorphic SNPs, and mean inbreeding coefficient FIS. (B) Length and number of ROH and average pairwise length of genomic segments IBS.
Patterns of LD varied greatly along the genome. Although LD generally decays quickly in teosinte (Hufford et al. 2012) and the median LD within 5-kb windows in our data is low (r2 = 0.04), we observed several discrete blocks of elevated LD in multiple populations. We identified four large (>10 Mb) regions of high (r2 ≥ 0.2) LD (fig. 3 and supplementary fig. S4, Supplementary Material online). Three of these regions appear to correspond to inversions previously described cytologically (Inv9e; Ting 1964), in mapping populations (Inv4m, Mano et al. 2012), or by population genetic analysis (Inv1n, Fang et al. 2012; Hufford et al. 2012). The size of these blocks and their chromosomal location effectively rules out recent selective sweeps or centromeric regions as alternative explanations, and we interpret all four as inversion polymorphisms. Clear haplotype structure was observed in IBS analysis for three of these putative inversions, and simple genetic-distance-based clustering, including Tripsacum and maize, suggested that the nonmaize haplotype was likely the derived state (supplementary fig. S5, Supplementary Material online). The inverted and noninverted haplotypes of Inv9d were less clearly demarcated, however, due to low frequency and high diversity of the putative inverted haplotype (supplementary fig. S5, Supplementary Material online).
Fig. 3.—

LD reveals structural rearrangements in teosinte. Shown are LD (r2, red) and permutation P value (black) for pairs of SNPs across chromosomes 1, 4, and 9. Dashed black lines delineate the likely boundaries of structural variants discussed in the text. P value and r2 cutoffs were chosen to visualize the LD blocks against the background level of LD.
Population Structure
Allele frequency in teosinte varied at multiple hierarchical levels. PCA of our SNP data revealed the strongest signal of population structure between the two subspecies (PC1, ∼12% of the variation explained; fig. 4A). Differentiation between subspecies was also evident in results from STRUCTURE (fig. 4B), haplotype sharing (supplementary figs. S6 and S7, Supplementary Material online), and higher pairwise FST between subspecies (average of all parviglumis–mexicana pairs: 0.33) than within subspecies (among parviglumis populations 0.24; among mexicana populations 0.23) comparisons. Putative inversion polymorphisms also separated the subspecies: Inv9d and Inv9e were found only in mexicana, whereas the derived haplotype at Inv4m showed a strong frequency difference between subspecies (supplementary fig. S8, Supplementary Material online).
Fig. 4.—

Population structure in teosinte. (A) Principal component analysis of all individuals, labeled according to the sampled population. (B) STRUCTURE results for all individuals. Individuals are grouped by population and populations ordered by increasing altitude.
Additional levels of structure were observed within subspecies as well (supplementary fig. S9, Supplementary Material online) with STRUCTURE and PCA analysis identifying groups of related populations largely consistent with previous work (Fukunaga et al. 2005). In all, the 20 significant principle components identified 21 clusters that generally corresponded to sampling locales (fig. 4A). Three populations of parviglumis did not follow this trend however: the two Ejutla populations formed a single genetic cluster and the Ahuacatitlan population split into two clusters.
Both geodetic distance (Mantel test P value: 0.006, Mantel statistic r = 0.40) and altitude (P value: 0.0008, r = 0.37) appeared to correlate with population differentiation in the combined data set (supplementary fig. S10A, Supplementary Material online; partial Mantel tests for geodetic distance P value: 0.0005, r = 0.39; altitude P value: 0.004, r = 0.41), consistent with analyses suggesting an important role for altitude in patterning diversity in a range-wide analysis of teosinte (Bradburd et al. 2013). Differentiation within each subspecies significantly correlated with geodetic distance (P value: 0.004, r = 0.55 in parviglumis; P value: 0.046, r = 0.48 in mexicana; supplementary fig. S10B and C, Supplementary Material online). However, this trend was driven mostly by the genetically and geographically distant populations of Nabogame and El Rodeo and was no longer significant after their removal.
Finally, both STRUCTURE and PCA results suggested that the Ahuacatitlan population of parviglumis was admixed (fig. 4). Consistent with this, analysis of haplotype sharing showed that the Ahuacatitlan population shared fewer long haplotypes with parviglumis and more long haplotypes with mexicana than any other parviglumis population (supplementary figs. S6 and S7, Supplementary Material online). The average length of shared haplotypes between Ahuacatitlan and mexicana (mean 4.8 cM) was, however, shorter than the average length of haplotypes shared among mexicana populations (mean 30.7 cM), arguing against extensive recent admixture. To explicitly test the origin of Ahuacatitlan, we applied a test of the configuration of shared derived alleles (Green et al. 2010) with populations of each of the two subspecies. The observed distribution of the D test statistic was inconsistent with models in which Ahuacatitlan is ancestral to both parviglumis and mexicana or a sister group to mexicana (P < 0.05 for both; supplementary fig. S11, Supplementary Material online), whereas the model in which Ahuacatitlan was sister to parviglumis could not be rejected (P value: 0.2).
Candidate SNP Identification
We applied four approaches to identify SNPs underlying local adaptation; candidate SNPs and genes from each of these approaches are listed in supplementary table S5, Supplementary Material online. We first applied the environmental association method implemented in BAYENV (Coop et al. 2010) to principal components summarizing 95% of the variation found among populations from a set of 76 climate and soil variables. Bayes factors estimated for a joint data set of parviglumis and mexicana identified 1,598 SNPs associated with one or more principle component(s), while 370 SNPs and 533 SNPs were identified for each subspecies respectively (table 1 and supplementary figs. S12–S14, Supplementary Material online).
Table 1.
Summary of Environmental Correlation and Differentiation Outlier Results
| Analysis | Variable | Major Loadings | BF, 99tha | No. Candidate SNPs | No. Candidate Genes |
|---|---|---|---|---|---|
| Both | PC1 | Altitude, temperature | 116.8 | 262 | 162 |
| PC2 | Temperature seasonality, soil quality and precipitation | 400.0 | 359 | 222 | |
| PC3 | Precipitation | 40.4 | 308 | 201 | |
| PC4 | Topsoil variables and precipitation seasonality | 39.4 | 229 | 145 | |
| PC5 | Mean diurnal range of temperature plus some soil variables | 72.9 | 291 | 184 | |
| PC6 | Mean diurnal range of temperature plus some soil variables | 10.2 | 225 | 151 | |
| parviglumis | PC1 | Altitude, temperature | 2.2 | 60 | 36 |
| PC2 | Temperature range and seasonality | 6.4 | 173 | 118 | |
| PC3 | Soil type and precipitation | 2.0 | 102 | 69 | |
| PC4 | Monthly precipitation and mean diurnal range of temperature | 4.8 | 42 | 24 | |
| mexicana | PC1 | Altitude, temperature | 6.9 | 108 | 69 |
| PC2 | Temperature seasonality and range and precipitation | 52.3 | 309 | 204 | |
| PC3 | Precipitation and volcanic soil | 31.1 | 27 | 16 | |
| PC4 | Top soil variables and temperature variability | 3.8 | 93 | 58 | |
| FCT | 731 | 728 | |||
| FST | 1363 | 411 |
aBayes factor (BF) at 99th percentile of distribution.
Our second approach identified locally adapted SNPs using an FST outlier method (Lewontin and Krakauer 1973; Excoffier et al. 2009) to scan for SNPs showing an excess of differentiation between subspecies (FCT) or among populations (FST). Based on simulations under the hierarchical model, 731 FCT (2.0%) and 1363 FST (3.7%) outliers were identified at a P value cutoff of 0.01. A striking peak in differentiation between both subspecies and populations was observed within inversion Inv4m, which showed 12-fold and 6-fold enrichment of outlier SNPs for FCT and FST, respectively. Both sets of outliers (fig. 5 and supplementary table S5, Supplementary Material online) differed substantially from SNPs identified by BAYENV, and correlations between Bayes factors and differentiation were low (supplementary table S6, Supplementary Material online).
Fig. 5.—
Differentiation and environmental correlation in inversions compared with genome-wide distribution of test statistics. Joint distribution of FCT (A) and FST (B) versus heterozygosity under a hierarchical island model for all SNPs. The black line indicates the 1% tail based on simulations. Values for each inversion when treated as a single locus indicated with yellow. Extreme outlier loci indicated with red. (C) Bayes factors for PC1 in mexicana. Values are plotted across all 10 chromosomes, with each chromosome in a different color. Black dots represent outlier SNPs, and black horizontal bars below the plots indicate the positions of inversions.
Third, we estimated the proportion of variation due to differentiation in a single focal population, FFT (see Materials and Methods). Mean FFT across populations was lower than 0.1, suggesting modest deviation from random mating due to individual populations. SNPs in the extreme 1% tail of the FFT distribution generally showed little overlap or correlation with FST or FCT values, but populations at range extremes such as El Rodeo and Nabogame had higher FFT, and high FFT SNPs from these populations were also overrepresented among FST (Wilcoxon rank-sum test P values <0.001) and BAYENV outliers (Wilcoxon rank-sum test P values <0.001 for PC1, PC2, and PC3). This effect was most evident in Nabogame, where FFT outliers constituted approximately 9% of all PC2 outliers for BAYENV.
Finally, to complement the above approaches focusing on individual SNP frequencies, we implemented the PHS test (Toomajian et al. 2006) to identify regions with extensive haplotype sharing as might be expected from a recent partial sweep. When the PHS test was applied to our entire sample (supplementary fig. S15, Supplementary Material online), outliers (P value cutoff 0.01) showed significant overlap (P value < 0.001) with PC1 BAYENV, FFT, and FST outliers, due primarily to haplotype structure created by the putative inversions on chromosomes 4 (position 168,832,447–182,596,678) and 9 (positions 0–29,403,252 and 118,862,807–143,163,957). Significant overlap was also observed between FFT outliers and PHS outliers identified in tests of individual populations (11 out of 21 populations had P values <0.01 based on permutations). These combined results indicate that SNPs identified as differentiated in a specific population often reside in a region with unexpectedly long shared haplotypes.
Characterization of Candidate Loci
We assessed additional evidence for local adaptation at candidate loci by further dissecting signals in putative inversions and gauging enrichment of candidates in described functions. All four putative inversions contained an excess of differentiated SNPs or SNPs associated with environmental variables (fig. 6 and table 2). To partially control for the nonindependence of loci within inversions, each SNP was given a maximum rank across all environmental association candidate lists (see Materials and Methods for details). Inversions were 2-fold enriched for candidate SNPs (P value < 0.001), containing 5.6% of all SNPs but 11% of SNPs in the 99th percentile of the maximum rank distribution of these lists.
Fig. 6.—
Fold enrichment of observed ratios of genic/nongenic SNPs among candidates. Fold of enrichment for each set of candidates (red line) from BAYENV, FST, and FCT, SNPs significant in any of the population-based FFT and PHS analysis, and phenotypic association analysis (GWAS). Expected distribution of fold of enrichment ratios (gray dots) was obtained by random sampling of the same amount of SNPs as in the candidate set 1000 times and calculating the ratio and fold enrichment. SNPs within inversions were excluded.
Table 2.
Enrichment of Sets of Candidate SNPs within Four Inversions
| Analysis | Variable |
P Valuea |
Biallelic | ||||
|---|---|---|---|---|---|---|---|
| Inv1n | Inv4m | Inv9d | Inv9e | All Inversions | |||
| Both | PC1 | <0.001 (4.5) | 0.003 (3.7) | <0.001 (7.6) | 0.027 (2.2) | 0.0001 (3.3.) | Inv9d |
| PC2 | 0.032 (1.7) | 0.824 | <0.001 (3.9) | 0.331 | 0.0016 (1.8) | ||
| PC3 | <0.001 (5.7) | 1.000 | 0.819 | 0.186 | 0.0001 (2.2) | Inv1n | |
| PC4 | 0.002 (2.7) | 0.574 | 0.001 (3.1) | <0.001 (4.0) | 0.0001 (2.5) | Inv9e | |
| PC5 | 0.440 | 1.000 | 1.000 | 0.938 | 0.9942 | ||
| PC6 | 0.672 | 1.000 | 0.859 | 0.687 | 0.9745 | ||
| parviglumis | PC1 | 0.450 | 1.000 | 0.290 | 1.000 | 0.0181 (1.5) | |
| PC2 | 0.616 | 0.016 (3.2) | 0.045 (2.3) | 0.728 | 0.1555 | ||
| PC3 | 0.261 | 1.000 | 1.000 | 0.812 | 0.3764 | ||
| PC4 | 0.271 | 1.000 | 1.000 | 0.474 | 0.4543 | ||
| mexicana | PC1 | 0.001 (3.8) | 1.000 | <0.001 (33.0) | 0.095 | 0.0001 (3.2) | Inv9d |
| PC2 | 0.748 | 0.694 | <0.001 (4.3) | 0.027 (2.0) | 0.0007 (1.8) | ||
| PC3 | 0.136 | 1.000 | 1.000 | 1.000 | 7.99 | ||
| PC4 | 0.002 (3.9) | 1.000 | 1.000 | <0.001 (8.2) | 0.0001 (2.0) | Inv9e | |
| FCT | <0.001 (2.9) | <0.001 (11.7) | 0.049 (1.6) | 0.705 | 0.0001 (3.4) | Inv4m | |
| FST | <0.001 (2.7) | <0.001 (6.3) | <0.001 (1.8) | 0.866 | 0.0001 (3.2) | Inv4m | |
Note.—Inversions significant when analyzed as a single biallelic locus are listed in the column “biallelic.”
aFor each P value <0.05, the magnitude of enrichment is reported in parenthesis. P values are based on bootstrapping.
To further control for nonindependence of SNPs within putative inversions, we conducted an additional BAYENV analysis with inversions treated as single biallelic loci. We assigned each individual a genotype based on neighbor-joining trees of SNPs within the inversion (supplementary fig. S5, Supplementary Material online). All four putative inversions were identified in environmental association analysis when treated as biallelic loci. Three of these inversions (Inv1n, Inv4m, and Inv9d) showed altitudinal clines (supplementary fig. S8, Supplementary Material online), though the signal at Inv1n was less clear than originally reported in Fang et al. (2012) as the derived (inverted) arrangement was absent from the low elevation Crucero Lagunitas population of parviglumis.
We tested candidate SNPs for enrichment in functional categories (genic vs. nongenic, synonymous vs. nonsynonymous) as well as for association with maize phenotypic traits (Flint-Garcia et al. 2005; Hung et al. 2012), putative adaptive introgressions from mexicana into maize (Hufford et al. 2013), and ±5 Mb around centromeres (Wolfgruber et al. 2009). PHS outliers were the only set of candidates enriched for genic SNPs (P value <0.001 for outliers based on the total sample, with 11 of 21 populations having P <0.01). In contrast, FST outliers were strongly enriched for nongenic SNPs (P values < 0.001, supplementary table S7, Supplementary Material online), and both FST and FCT outliers were farther from genes than other SNPs (P values < 0.001, two-sample Kolmogorov–Smirnov test). FST outliers were also enriched for SNPs associated with the architecture of the male inflorescence in a diverse maize panel (Flint-Garcia et al. 2005; Cook et al. 2012; supplementary table S8, Supplementary Material online). SNPs significantly associated with flowering time, ear size, and stand count (plant survival and density) were overrepresented among both FCT and FST outliers (supplementary table S9, Supplementary Material online), but did not show enrichment in an association model that took into account structure and relatedness (both K + Q; see Materials and Methods). Among genes that showed evidence of introgression from mexicana into maize, both FST (P value < 0.001) and FCT (P value < 0.001) candidates as well as SNPs associated with PC2 (temperature range) within mexicana (P value 0.0061) were more common than expected by chance (Hufford et al. 2013). Excluding SNPs within inversions did not qualitatively change any of these results (data not shown).
In centromeres, weak but significant enrichment of environmentally correlated SNPs was observed for PC1 (2.7-fold, P < 0.001) and PC2 (1.5-fold, P = 0.046) in joint analysis, and PC3 in mexicana (4.3-fold, P = 0.009) (supplementary figs. S12–S14, Supplementary Material online). Both FST and FCT outliers were 5-fold enriched in centromeres (P < 0.001), mostly due to centromeres on chromosomes 2 and 4, where enrichment was 21- and 17-fold for FST and FCT, respectively (supplementary fig. S17, Supplementary Material online).
Discussion
Complex Population Structure
The presence of population structure is a crucial consideration when determining the potential for local adaptation. Population structure can bias estimates of demographic history and the inference of selection through its effect on the allele frequency spectrum (Siol et al. 2010). In addition, Eckert et al. (2010b) point out that when environmental gradients and population structure coincide, environmental correlation methods may suffer from increased rates of false positives and negatives if the analysis under- or overcorrects for population structure. Previous theoretical work under simple island (Städler et al. 2009) or stepping stone (De and Durrett 2007) models has shown that biases in demographic inference can be ameliorated by sampling single individuals from each of many populations. However, such a sampling scheme does not take into account uneven or hierarchical population structure (St. Onge et al. 2012).
Our results indicate that population structure in teosinte is complex and affected by multiple factors. Across all populations, the effects of altitude on population structure are significant, even after correcting for geodetic distance (supplementary fig. S10A, Supplementary Material online). The mexicana populations of Santa Clara and Opopeo, for example, show higher FST with the La Cadena parviglumis population 60 km away than with distant parviglumis populations at the same elevation (supplementary fig. S16, Supplementary Material online). These results are in agreement with other studies that have noted the important role of altitude in determining genetic distance, genome size, and morphological characters in Z. mays subspecies (Smith et al. 1981; Poggio et al. 1998; Buckler et al. 2006; van Heerwaarden et al. 2011; Bradburd et al 2013). There is also clear hierarchical population structure (fig. 4 and supplementary fig. S9, Supplementary Material online), with divisions between subspecies and among previously identified groups within subspecies (Fukunaga et al. 2005). Within subspecies, the effect of altitude is no longer statistically significant, but isolation by distance continues to explain a meaningful portion of the genetic structure observed in both taxa (supplementary fig. S10B and C, Supplementary Material online). In addition to distance and altitude, stochastic founding events have likely played a role in patterning diversity. This is most evident in the extensive haplotype sharing within the geographically dispersed La Mesa/El Sauz/San Lorenzo group (supplementary fig. S6, Supplementary Material online). The importance of founder events is consistent with previous analyses by Fukunaga et al. (2005) and Buckler et al. (2006) who observed that genetic distance was more correlated with specific dispersal routes than with geodetic distance.
Population structure had a considerable effect on many of our outlier detection approaches. Parviglumis and mexicana do not conform to a simple two-level hierarchical island model where gene flow between populations within subspecies would be equal. The impact of this structure when determining candidates for local adaptation was most clearly seen in a subset of strongly differentiated populations. FFT outlier SNPs specific to the isolated El Rodeo and Nabogame populations have smaller FST P values than FFT outliers from other populations (Wilcoxon rank-sum test, P < 0.001). Similarly, the Nabogame mexicana population has extreme temperature and precipitation, and SNPs that are strongly differentiated in Nabogame (FFT outliers) have significantly higher Bayes factors for PC2 (temperature seasonality) in the BAYENV analysis than FFT outliers from other mexicana populations (Wilcoxon rank-sum test, P < 0.001). While these findings may reflect true local adaptation, they should be interpreted with caution since such geographically restricted environmental variation has been shown to cause false positives in association studies even when population structure is taken into account (Mathieson and McVean 2012). This may be one explanation for the observation that new mutations with narrow geographic distributions were enriched among climate-associated loci in Arabidopsis (Hancock et al. 2011b). Because most plants likely show complex patterns of structure (Loveless and Hamrick 1984), it is clear that careful population level sampling and consideration of population structure will continue to be important in studies of plant adaptation and evolutionary history.
In addition to providing insight regarding local adaptation, our examination of population structure identified a putative hybrid zone between parviglumis and mexicana. While hybridization between these teosinte taxa is known to occur (Fukunaga et al. 2005; van Heerwaarden et al. 2011), details of admixture within individual populations have not been well documented. Our data suggest that our highest elevation parviglumis population, Ahuacatitlan, is extensively admixed, with mean assignment probabilities to mexicana of approximately 50% across individuals in the population. Patterns of haplotype sharing (supplementary figs. S6 and S7, Supplementary Material online) and derived allele counts (supplementary fig. S11, Supplementary Material online), however, support a model of continual gene flow with parviglumis over a relatively long period of time. Several of the parviglumis individuals sampled by Ross-Ibarra et al. (2009) are from localities very near to Ahuacatitlan, likely contributing to these authors’ inference of continuous gene flow between mexicana and parviglumis. Reports of hybrids from localities near Ahuacatitlan (Fukunaga et al. 2005), together with our results showing little admixture in other parviglumis populations in geographic proximity to mexicana, indicate the presence of a geographically restricted hybrid zone between mexicana and parviglumis occurring at mid elevations in the Eastern Balsas region.
Adaptive Inversion Polymorphisms
Cytological studies of both mexicana and parviglumis have previously identified a number of inversion polymorphisms (Ting 1964; Wilkes 1967; Kato 1975). In our genotyping data, we observe four blocks of LD that we interpret as large inversions. Together they comprise approximately 5% of the maize reference genome, and their role in patterning both molecular and phenotypic variation (Fang et al. 2012) among populations is significant.
Three of the inversion polymorphisms we have identified here based on LD have previously been observed. Inversion Inv1n was first identified by Hufford et al. (2012) and its population genetics subsequently described by Fang et al. (2012). Inv9e was identified cytologically by Ting (1964), who found the inverted arrangement in both of the mexicana populations in which we see the derived haplotype. Finally, the LD block we identify as Inv4m is at or near the physical positions of several inversions identified cytologically in maize and teosinte (Ting 1964; Wilkes 1967) and by marker map order in Zea nicaraguensis (Mano et al. 2012). Synteny maps identify the proximal breakpoint of Inv4m as an ancient chromosomal fusion involving the homeolog of the distal end of the telomere of chromosome 5 (Schnable et al. 2009; Wang and Bennetzen 2012), and the breakpoint appears common to several distinct inversions in maize (Doyle 1994). In addition, there appears to be long-distance LD between Inv4m and the centromere of chromosome 4 (fig. 3). All in all, this suggests that the region of Inv4m may be prone to structural rearrangements.
All four observed inversions appear to play a role in local adaptation: they show significant enrichment for SNPs with high Bayes factors for PC1 (table 2) and are characterized by altitudinal clines in haplotype frequency (supplementary fig. S8, Supplementary Material online) reflecting environmental differences such as temperature. Inversion Inv1n is enriched for SNPs with high Bayes factors for PC1, PC3, and PC4, and the clinal patterns observed are broadly consistent with those of Fang et al. (2012). Extreme patterns of precipitation in Crucero Lagunitas and shared ancestry with the distant Oaxacan population of El Rodeo suggest that distinct selection pressures or unusual history may explain the surprising lack of Inv1n in this low-elevation population. Inv4m is associated with a striking peak in differentiation among subspecies (supplementary fig. S17, Supplementary Material online) and is also an extreme FST and FCT outlier when treated as a single locus (fig. 5). This suggests an adaptive role for Inv4m at higher altitudes, consistent with evidence of introgression from mexicana into sympatric maize populations at Inv4m and its rarity in maize outside of the Mexican highlands (Hufford et al. 2013). Interestingly, quantitative trait loci (QTL) for differences between parviglumis and mexicana for pigment intensity and macrohair count—traits thought to be adaptive at high elevations—co-localize to this region as well (Lauter et al. 2004). Inv9d was only polymorphic in mexicana, but showed a strong enrichment of SNPs associated with PC1 (fig. 5C and supplementary fig. S14A, Supplementary Material online). Finally, inversion Inv9e showed 8-fold enrichment of SNPs associating with PC4 (top soil) within mexicana (table 2 and supplementary fig. S14D, Supplementary Material online). While all four inversions show evidence of natural selection, further experiments will be required to identify the precise targets of selection within the inversions and directly measure their effects on traits and fitness.
The number of polymorphic inversions observed here underscores the potential importance of inversions in plant local adaptation (Kirkpatrick 2010). The accumulation of locally adaptive loci in inversions has been predicted by theory (Kirkpatrick and Barton 2006), and is consistent with observations in a number of plant and animal taxa (Dobzhansky 1970; Etges and Levitan 2004; Huynh et al. 2010; Lowry and Willis 2010; Cheng et al. 2012). Although, in principle the observation of high differentiation in inversions could be due to the effects of background selection in regions of low recombination (Charlesworth et al. 1995), SNP correlations with environmental variables and clinal patterns of inversion haplotypes are difficult to explain under a model of background selection.
Although similar patterns of enrichment (excess of BAYENV, FST, and FCT outliers) were observed in some centromere regions, the low levels of recombination observed in maize centromeres (Gore et al. 2009) imply that these results, as well as previous observations of haplotype differentiation (Hufford et al. 2012), may be due to the effects of selection on linked sites (Cutter and Payseur 2013) rather than on centromeres themselves.
Genome-Wide Patterns of Local Adaptation
Candidate SNPs identified here as outliers for environmental association or allele frequency differentiation show a pronounced enrichment in nongenic regions (fig. 6). This result stands in contrast to the genic enrichment found in similar analyses of both Arabidopsis (Hancock et al. 2011a) and human (Hancock et al. 2011b) data. One potential explanation for this difference is that the complexity of maize and teosinte genomes, which are >85% noncoding sequence (Schnable et al. 2009), may provide greater opportunity for the evolution of functional noncoding elements. Indeed, the causal polymorphisms underlying traits including flowering time (Salvi et al. 2007) and branching (Studer et al. 2011) have recently been identified as repeat insertions which impact gene regulation. The functional role of our candidate SNPs is supported by their enrichment among relevant phenotypic traits (supplementary tables S8 and S9, Supplementary Material online). For example, FCT candidates are enriched for SNPs significantly associated with tassel morphology in maize, a major phenotypic difference between parviglumis and mexicana (Iltis and Doebley 1980). Both FCT and FST candidates are enriched for SNPs significantly associated with flowering time, a common form of local adaptation observed among plant populations (Olsson and Ågren 2002; Lowry et al. 2008). We see little evidence that the observed nongenic enrichment is due to biases in the genotyping platform, as differences in heterozygosity between genic (0.275) and nongenic (0.281) SNPs are minor, and both our PHS outliers and some association analyses show enrichment for genic SNPs. These results do not argue that all noncoding DNA is functional; however, a recent genome-wide sampling of noncoding variants finds significantly fewer phenotypic associations than expected (Chia et al. 2012).
Although our candidate loci are not enriched for genes, we nonetheless identify a number of compelling genes showing signals of local adaptation (supplementary table S5, Supplementary Material online). A SNP in the gene b1, for example, correlated with PC1—largely made up of temperature and altitude—among mexicana populations. b1 is a gene in the anthocyanin synthesis pathway that has been identified as a QTL for sheath color differences among mexicana and parviglumis (Lauter et al. 2004). Pigmentation has been suggested to be an important adaptive trait as a response to lower temperatures and changing ultraviolet light conditions (Galinat 1967; Barthakur 1974; Chalker-Scott 1999). Two SNPs in the 3′-UTR of the well-known domestication gene teosinte branched1, a locus known to determine plant architecture in parviglumis populations (Weber et al. 2007, 2008), show unusually strong patterns of differentiation and association with PCs related to temperature range and soil types (supplementary table S5, Supplementary Material online). Other candidate genes with a well-known function in maize include abph1 that controls leaf arrangement (Jackson and Hake 1999) and sh1 that encodes the production of sucrose synthase (Sheldon et al. 1983). Additionally, a synonymous SNP (PZE-104028461) in the maize filtered gene GRMZM2G000471 was both an extreme outlier for FST and associated with PC2 in mexicana. The gene containing this SNP is orthologous to the Arabidopsis gene, At4g10380, NIP5;1, which encodes a boron channel (Kato et al. 2009) and has been identified as a candidate for adaptation to different soil types in Arabidopsis lyrata (Turner et al. 2008).
Finally, environmental association and frequency-based approaches produced sets of candidates with little overlap (supplementary table S6, Supplementary Material online, the overlap between PHS and environmental association was always less than 15 SNPs). Rather than indicating a lack of evidence for local adaptation, we suspect that the weak correspondence between methods is likely reflective of biological factors. For example, SNPs showing environmental association are only weakly correlated with FST and FCT P values (supplementary table S3, Supplementary Material online), a result also reported in trees (Eckert et al. 2010a; Keller et al. 2012). Because FST outlier methods detect excess divergence among populations regardless of the distribution of environmental variation, they are likely more powerful for identification of loci related to factors that do not correlate with environmental variables included here. All of our measured environmental variables are abiotic, but biotic factors (e.g., herbivory, competition) undoubtedly play an important selective role and may not be perfectly reflected in our environmental PCs. For example, the presence of the Zea specialist leafhopper Dalbulus maidis is thought to depend on the local abundance of maize and proximity to bodies of water (Medina et al. 2012), and Moeller and Tiffin (2008) hypothesize localized selection pressure in herbivory as an explanation for evidence of local adaptation in their sequence analysis of immunity genes in parviglumis. Biotic interactions are likely critical in local adaptation in a number of species. For example, local pathogen diversity may play a larger role in human adaptation than climate (Fumagalli et al. 2011). Consistent with the importance of localized selection pressures, we note that many of our FFT outliers, which show less overlap with other methods, nonetheless exhibit patterns of haplotype sharing suggestive of recent positive selection. A full characterization of local environments is, in most instances, unfeasible. Due to this limitation, comprehensive studies of local adaptation should combine methods that assess allele frequency association with measured variables with those that describe extreme population differentiation based on genetic data alone.
Supplementary Material
Supplementary text S1, tables S1–S9 and figures S1–S17 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).
Acknowledgments
The authors thank Jeff Glaubitz for SNP annotations and mapping; Lauren Sagara for technical assistance with genotyping; Shohei Takuno for PHS perl scripts; and Norm Ellstrand, Pesach Lubinsky, and Mark Millard (USDA-ARS GRIN) for seeds. Yaniv Brandvain, Graham Coop, John Doebley, Andrew Eckert, Torsten Günther, Joost van Heerwaarden, and Peter Morrell provided useful comments on an earlier version of this manuscript. This work was supported by National Science Foundation (IOS-0922703 to J.R.-I.); United States Department of Agriculture, National Institute of Food and Agriculture (2009-01864 to J.R.-I.), and Academy of Finland (to T.P.).
Literature Cited
- Barthakur N. Temperature differences between two pigmented types of corn plants. Int J Biometeorol. 1974;18:70–75. [Google Scholar]
- Beaumont MA, Balding DJ. Identifying adaptive genetic divergence among populations from genome scans. Mol Ecol. 2004;13:969–980. doi: 10.1111/j.1365-294x.2004.02125.x. [DOI] [PubMed] [Google Scholar]
- Beaumont MA, Nichols RA. Evaluating loci for use in the genetic analysis of population structure. Proc R Soc Lond B Biol Sci. 1996;263:1619–1626. [Google Scholar]
- Bomblies K, et al. Local-scale patterns of genetic variability, outcrossing, and spatial structure in natural stands of Arabidopsis thaliana. PLoS Genet. 2010;6:e1000890. doi: 10.1371/journal.pgen.1000890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bradburd G, Ralph P, Coop G. Disentangling the effects of geographic and ecological isolation on genetic differentiation. 2013 doi: 10.1111/evo.12193. arXiv:1302.3274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bradbury PJ, et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23:2633–2635. doi: 10.1093/bioinformatics/btm308. [DOI] [PubMed] [Google Scholar]
- Buckler ES, Goodman MM, Holtsford TP, Doebley JF, Sanchez G. Phylogeography of the wild subspecies of Zea mays. Maydica. 2006;51:123–134. [Google Scholar]
- Chalker-Scott L. Environmental significance of anthocyanins in plant stress responses. Photochem Photobiol. 1999;70:1–9. [Google Scholar]
- Charlesworth D, Charlesworth B, Morgan MT. The pattern of neutral molecular variation under the background selection model. Genetics. 1995;141:1619–1632. doi: 10.1093/genetics/141.4.1619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng C, et al. Ecological genomics of Anopheles gambiae along a latitudinal cline in Cameroon: a population resequencing approach. Genetics. 2012;190:1417–1432. doi: 10.1534/genetics.111.137794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chia J-M, et al. Capturing extant variation from a genome in flux: maize HapMap II. Nat Genet. 2012;44:803–807. doi: 10.1038/ng.2313. [DOI] [PubMed] [Google Scholar]
- Clausen J, Keck DD, Hiesey WM. Experimental studies on the nature of species. I. Effect of varied environments on western North American plants. Washington (DC): Gibson Brothers, Inc; 1940. [Google Scholar]
- Cook JP, et al. Genetic architecture of maize kernel composition in the nested association mapping and inbred association panels. Plant Physiol. 2012;158:824–834. doi: 10.1104/pp.111.185033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coop G, Witonsky D, Di Rienzo A, Pritchard JK. Using environmental correlations to identify loci underlying local adaptation. Genetics. 2010;185:1411–1423. doi: 10.1534/genetics.110.114819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cutter AD, Payseur BA. Genomic signatures of selection at linked sites: unifying the disparity among species. Nat Rev Genet. 2013;14:262–274. doi: 10.1038/nrg3425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De A, Durrett R. Stepping-stone spatial structure causes slow decay of linkage disequilibrium and shifts the site frequency spectrum. Genetics. 2007;176:969–981. doi: 10.1534/genetics.107.071464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dobzhansky TG. Genetics of the evolutionary process. New York: Columbia University Press; 1970. [Google Scholar]
- Doebley J, Goodman MM, Stuber CW. Pattens of isozyme variation between maize and Mexican annual teosinte. Econ Bot. 1987;41:234–246. [Google Scholar]
- Doyle GG. Inversions and list of inversions available. In: Freeling M, Walbot V, editors. The maize handbook. New York: Springer-Verlag; 1994. pp. 346–349. [Google Scholar]
- Earl DA, vonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012;4:359–361. [Google Scholar]
- Eckert AJ, et al. Patterns of population structure and environmental associations to aridity across the range of loblolly pine (Pinus taeda L., Pinaceae) Genetics. 2010a;185:969–982. doi: 10.1534/genetics.110.115543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eckert AJ, et al. Back to nature: ecological genomics of loblolly pine (Pinus taeda, Pinaceae) Mol Ecol. 2010b;19:3789–3805. doi: 10.1111/j.1365-294X.2010.04698.x. [DOI] [PubMed] [Google Scholar]
- Endler JA. Natural selection on color patterns in Poecilia reticulata. Evolution. 1980;34:76–91. doi: 10.1111/j.1558-5646.1980.tb04790.x. [DOI] [PubMed] [Google Scholar]
- Etges WJ, Levitan M. Palaeoclimatic variation, adaptation and biogeography of inversion polymorphisms in natural populations of Drosophila robusta. Biol J Linn Soc. 2004;81:395–411. [Google Scholar]
- Excoffier L, Hofer T, Foll M. Detecting loci under selection in a hierarchically structured population. Heredity. 2009;103:285–298. doi: 10.1038/hdy.2009.74. [DOI] [PubMed] [Google Scholar]
- Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–567. doi: 10.1111/j.1755-0998.2010.02847.x. [DOI] [PubMed] [Google Scholar]
- Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003;164:1567–1587. doi: 10.1093/genetics/164.4.1567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fang Z, et al. Megabase-scale inversion polymorphism in the wild ancestor of maize. Genetics. 2012;191:883–894. doi: 10.1534/genetics.112.138578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feder JL, Nosil P. The efficacy of divergence hitchhiking in generating genomic islands during ecological speciation. Evolution. 2010;64:1729–1747. doi: 10.1111/j.1558-5646.2010.00943.x. [DOI] [PubMed] [Google Scholar]
- Flint-Garcia SA, et al. Maize association population: a high resolution platform for quantitative trait locus dissection. Plant J. 2005;44:1054–1064. doi: 10.1111/j.1365-313X.2005.02591.x. [DOI] [PubMed] [Google Scholar]
- Foll M, Gaggiotti O. A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics. 2008;180:977. doi: 10.1534/genetics.108.092221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ford HD, Ford EB. Fluctuation in numbers, and its influence on variation, in Melitaea aurinia, Rott. (Lepidoptera) Ecol Entomol. 1930;78:345–352. [Google Scholar]
- Fournier-Level A, et al. A map of local adaptation in Arabidopsis thaliana. Science. 2011;334:86–89. doi: 10.1126/science.1209271. [DOI] [PubMed] [Google Scholar]
- Fukunaga K, et al. Genetic diversity and population structure of teosinte. Genetics. 2005;169:2241–2254. doi: 10.1534/genetics.104.031393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fumagalli M, et al. Signatures of environmental genetic adaptation pinpoint pathogens as the main selective pressure through human evolution. PLoS Genet. 2011;7:e1002355. doi: 10.1371/journal.pgen.1002355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galinat WC. Plant habit and the adaptation of corn. Massachusetts Agri Exp Station Bulletin. 1967;565:1–16. [Google Scholar]
- Ganal MW, et al. A large maize (Zea mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome. PLoS One. 2011;6:e28334. doi: 10.1371/journal.pone.0028334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerke JP, Edwards JW, Guill KE, Ross-Ibarra J, McMullen MD. The genomic impacts of drift and selection for hybrid performance in maize. arXiv. 2013;1307.7313 doi: 10.1534/genetics.115.182410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gore MA, et al. A first-generation haplotype map of maize. Science. 2009;326:1115. doi: 10.1126/science.1177837. [DOI] [PubMed] [Google Scholar]
- Goudet J. HIERFSTAT, a package for R to compute and test hierarchical F-statistics. Mol Ecol Notes. 2005;5:184–186. [Google Scholar]
- Grant PR, Grant BR. Unpredictable evolution in a 30-year study of Darwin's finches. Science. 2002;296:707–711. doi: 10.1126/science.1070315. [DOI] [PubMed] [Google Scholar]
- Green RE, et al. A draft sequence of the Neandertal genome. Science. 2010;328:710–722. doi: 10.1126/science.1188021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gregory TR, et al. Eukaryotic genome size databases. Nucleic Acids Res. 2007;35:D332–D338. doi: 10.1093/nar/gkl828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gusev A, et al. Whole population, genome-wide mapping of hidden relatedness. Genome Res. 2009;19:318–326. doi: 10.1101/gr.081398.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hancock AM, et al. Adaptations to climate-mediated selective pressures in humans. PLoS Genet. 2011a;7:e1001375. doi: 10.1371/journal.pgen.1001375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hancock AM, et al. Adaptation to climate across the Arabidopsis thaliana genome. Science. 2011b;334:83–86. doi: 10.1126/science.1209244. [DOI] [PubMed] [Google Scholar]
- Hijmans RJ, Guarino L, Cruz M, Rojas E. Computer tools for spatial analysis of plant genetic resources data: 1. DIVA-GIS. Plant Genet Resour Newsl. 2001;127:15–19. [Google Scholar]
- Hufford MB, Bilinski P, Pyhäjärvi T, Ross-Ibarra J. Teosinte as a model system for population and ecological genomics. Trends Genet. 2012;28:606–615. doi: 10.1016/j.tig.2012.08.004. [DOI] [PubMed] [Google Scholar]
- Hufford MB, Martínez-Meyer E, Gaut BS, Eguiarte LE, Tenaillon MI. Inferences from the historical distribution of wild and domesticated maize provide ecological and evolutionary insight. PLoS One. 2012;7:e47659. doi: 10.1371/journal.pone.0047659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hufford MB, et al. Comparative population genomics of maize domestication and improvement. Nat Genet. 2012;44:808–811. doi: 10.1038/ng.2309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hufford MB, et al. The genomic signature of crop-wild introgression in maize. PLoS Genet. 2013;9:e1003477. doi: 10.1371/journal.pgen.1003477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hung H-Y, et al. ZmCCT and the genetic basis of daylength adaptation underlying the post-domestication spread of maize. Proc Natl Acad Sci U S A. 2012;109:11068–11069. doi: 10.1073/pnas.1203189109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huynh LY, Maney DL, Thomas JW. Chromosome-wide linkage disequilibrium caused by an inversion polymorphism in the white-throated sparrow (Zonotrichia albicollis) Heredity. 2010;106:537–546. doi: 10.1038/hdy.2010.85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iltis HH, Doebley JF. Taxonomy of Zea (Gramineae). II. Subspecific categories in the Zea mays complex and a generic synopsis. Am J Bot. 1980;67:994–1004. [Google Scholar]
- Jackson D, Hake S. Control of phyllotaxy in maize by the abphyl1 gene. Development. 1999;126:315–323. doi: 10.1242/dev.126.2.315. [DOI] [PubMed] [Google Scholar]
- Kato Y, Miwa K, Takano J, Wada M, Fujiwara T. Highly boron deficiency-tolerant plants generated by enhanced expression of NIP5;1, a boric acid channel. Plant Cell Physiol. 2009;50:58–66. doi: 10.1093/pcp/pcn168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kato YTA. Cytological studies of maize (Zea mays L.) and teosinte (Zea mexicana (Schrader) Kuntze) in relation to their origin and evolution. 1975. . Massachusetts Agri Exp Station Bulletin. 635:1–185. [Google Scholar]
- Keller SR, Levsen N, Olson MS, Tiffin P. Local adaptation in the flowering time gene network of balsam poplar, Populus balsamifera L. Mol Biol Evol. 2012;29:3143–3152. doi: 10.1093/molbev/mss121. [DOI] [PubMed] [Google Scholar]
- Kirkpatrick M. How and why chromosome inversions evolve. PLoS Biol. 2010;8:e1000501. doi: 10.1371/journal.pbio.1000501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirkpatrick M, Barton N. Chromosome inversions, local adaptation and speciation. Genetics. 2006;173:419–434. doi: 10.1534/genetics.105.047985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lauter N, Gustus C, Westerbergh A, Doebley J. The inheritance and evolution of leaf pigmentation and pubescence in teosinte. Genetics. 2004;167:1949–1959. doi: 10.1534/genetics.104.026997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Le Corre V, Kremer A. The genetic differentiation at quantitative trait loci under local adaptation. Mol Ecol. 2012;21:1548–1566. doi: 10.1111/j.1365-294X.2012.05479.x. [DOI] [PubMed] [Google Scholar]
- Lewontin RC, Krakauer J. Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms. Genetics. 1973;74:175–195. doi: 10.1093/genetics/74.1.175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li YF, Costello JC, Holloway AK, Hahn MW. “Reverse ecology” and the power of population genomics. Evolution. 2008;62:2984–2994. doi: 10.1111/j.1558-5646.2008.00486.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loveless MD, Hamrick JL. Ecological determinants of genetic structure in plant populations. Annu Rev Ecol Syst. 1984;15:65–95. [Google Scholar]
- Lowry DB, Rockwood RC, Willis JH. Ecological reproductive isolation of coast and inland races of Mimulus guttatus. Evolution. 2008;62:2196–2214. doi: 10.1111/j.1558-5646.2008.00457.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lowry DB, Willis JH. A widespread chromosomal inversion polymorphism contributes to a major life-history transition, local adaptation, and reproductive isolation. PLoS Biol. 2010;8:e1000500. doi: 10.1371/journal.pbio.1000500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mano Y, Omori F, Takeda K. Construction of intraspecific linkage maps, detection of a chromosome inversion, and mapping of QTL for constitutive root aerenchyma formation in the teosinte Zea nicaraguensis. Mol Breed. 2012;29:137–146. [Google Scholar]
- Mathieson I, McVean G. Differential confounding of rare and common variants in spatially structured populations. Nat Genet. 2012;44:243–246. doi: 10.1038/ng.1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Medina RF, Reyna SM, Bernal JS. Population genetic structure of a specialist leafhopper on Zea: likely anthropogenic and ecological determinants of gene flow. Entomol Exp Appl. 2012;142:223–235. [Google Scholar]
- Meyerowitz EM, Pruitt RE. Arabidopsis thaliana and plant molecular genetics. Science. 1985;229:1214–1218. doi: 10.1126/science.229.4719.1214. [DOI] [PubMed] [Google Scholar]
- Moeller DA, Tenaillon MI, Tiffin P. Population structure and its effects on patterns of nucleotide polymorphism in teosinte (Zea mays ssp. parviglumis) Genetics. 2007;176:1799–1809. doi: 10.1534/genetics.107.070631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moeller DA, Tiffin P. Geographic variation in adaptation at the molecular level: a case study of plant immunity genes. Evolution. 2008;62:3069–3081. doi: 10.1111/j.1558-5646.2008.00511.x. [DOI] [PubMed] [Google Scholar]
- Oksanen J, et al. vegan: Community Ecology Package. 2011 R package version 1.17-6. [Google Scholar]
- Olsson K, Ågren J. Latitudinal population differentiation in phenology, life history and flower morphology in the perennial herb Lythrum salicaria. J Evol Biol. 2002;15:983–996. [Google Scholar]
- Patterson N, Price AL, Reich D. Population structure and eigenanalysis. PLoS Genet. 2006;2:e190. doi: 10.1371/journal.pgen.0020190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Platt A, et al. The scale of population structure in Arabidopsis thaliana. PLoS Genet. 2010;6:e1000843. doi: 10.1371/journal.pgen.1000843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poggio L, Rosato M, Chiavarino AM, Naranjo CA. Genome size and environmental correlations in maize (Zea mays ssp. mays, Poaceae) Ann Bot. 1998;82:107–115. [Google Scholar]
- R Core Team. Vienna (Austria): R Foundation for Statistical Computing. 2012. R: a language and environment for statistical computing. [Google Scholar]
- Rosenberg NA. DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes. 2004;4:137–138. [Google Scholar]
- Ross-Ibarra J, Tenaillon M, Gaut BS. Historical divergence and gene flow in the genus Zea. Genetics. 2009;181:1399–1413. doi: 10.1534/genetics.108.097238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saghai-Maroof MA, Soliman KM, Jorgensen RA, Allard RW. Ribosomal DNA spacer-length polymorphisms in barley: Mendelian inheritance, chromosomal location, and population dynamics. Proc Natl Acad Sci U S A. 1984;81:8014–8018. doi: 10.1073/pnas.81.24.8014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salvi S, et al. Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize. Proc Natl Acad Sci U S A. 2007;104:11376–11381. doi: 10.1073/pnas.0704145104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sanchez PA, Palm CA, Buol SW. Fertility capability soil classification: a tool to help assess soil quality in the tropics. Geoderma. 2003;114:157–185. [Google Scholar]
- Scheet P, Stephens M. A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet. 2006;78:629–644. doi: 10.1086/502802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schnable PS, et al. The B73 maize genome: complexity, diversity, and dynamics. Science. 2009;326:1112–1115. doi: 10.1126/science.1178534. [DOI] [PubMed] [Google Scholar]
- Sheldon E, Ferl R, Fedoroff N, Curtis Hannah L. Isolation and analysis of a genomic clone encoding sucrose synthetase in maize: evidence for two introns in Sh. Mol Gen Genet. 1983;190:421–426. [Google Scholar]
- Siol M, Wright SI, Barratt SCH. The population genomics of plant adaptation. New Phytol. 2010;188:313–332. doi: 10.1111/j.1469-8137.2010.03401.x. [DOI] [PubMed] [Google Scholar]
- Smith JSC, Goodman MM, Lester RN. Variation within teosinte. I. Numerical analysis of morphological data. Econ Bot. 1981;35:187–203. [Google Scholar]
- Smith JSC, Goodman MM, Stuber CW. Variation within teosinte. III. Numerical analysis of allozyme data. Econ Bot. 1984;38:97–113. [Google Scholar]
- St. Onge KR, Palmé AE, Wright SI, Lascoux M. Impact of sampling schemes on demographic inference: an empirical study in two species with different mating systems and demographic histories. 2012 doi: 10.1534/g3.112.002410. G3 2:803–814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Städler T, Haubold B, Merino C, Stephan W, Pfaffelhuber P. The impact of sampling schemes on the site frequency spectrum in nonequilibrium subdivided populations. Genetics. 2009;182:205–216. doi: 10.1534/genetics.108.094904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Storz JF. Using genome scans of DNA polymorphism to infer adaptive population divergence. Mol Ecol. 2005;14:671–688. doi: 10.1111/j.1365-294X.2005.02437.x. [DOI] [PubMed] [Google Scholar]
- Studer A, Zhao Q, Ross-Ibarra J, Doebley J. Identification of a functional transposon insertion in the maize domestication gene tb1. Nat Genet. 2011;43:1160–1163. doi: 10.1038/ng.942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ting YC. Chromosomes of maize-teosinte hybrids. Cambridge (MA): The Bussey Institution of Harvard University; 1964. [Google Scholar]
- Toomajian C, et al. A nonparametric test reveals selection for rapid flowering in the Arabidopsis genome. PLoS Biol. 2006;4:e137. doi: 10.1371/journal.pbio.0040137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Turner TL, Bourne EC, Von Wettberg EJ, Hu TT, Nuzhdin SV. Population resequencing reveals local adaptation of Arabidopsis lyrata to serpentine soils. Nat Genet. 2010;42:260–263. doi: 10.1038/ng.515. [DOI] [PubMed] [Google Scholar]
- Turner TL, Von Wettberg EJ, Nuzhdin SV. Genomic analysis of differentiation between soil types reveals candidate genes for local adaptation in Arabidopsis lyrata. PLoS One. 2008;3:e3183. doi: 10.1371/journal.pone.0003183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Heerwaarden J, et al. Genetic signals of origin, spread, and introgression in a large sample of maize landraces. Proc Natl Acad Sci U S A. 2011;108:1088–1092. doi: 10.1073/pnas.1013011108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Heerwaarden J, et al. Fine scale genetic structure in the wild ancestor of maize (Zea mays ssp. parviglumis) Mol Ecol. 2010;19:1162–1173. doi: 10.1111/j.1365-294X.2010.04559.x. [DOI] [PubMed] [Google Scholar]
- Wang H, Bennetzen JL. Centromere retention and loss during the descent of maize from a tetraploid ancestor. Proc Natl Acad Sci U S A. 2012;109:21004–21009. doi: 10.1073/pnas.1218668109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weber A, et al. Major regulatory genes in maize contribute to standing variation in teosinte (Zea mays ssp. parviglumis) Genetics. 2007;177:2349–2359. doi: 10.1534/genetics.107.080424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weber AL, et al. The genetic architecture of complex traits in teosinte (Zea mays ssp. parviglumis): new evidence from association mapping. Genetics. 2008;180:1221–1232. doi: 10.1534/genetics.108.090134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–1370. doi: 10.1111/j.1558-5646.1984.tb05657.x. [DOI] [PubMed] [Google Scholar]
- Wilkes HG. Teosinte: the closest relative of maize. Cambridge (MA): The Bussey Institution of Harvard University; 1967. [Google Scholar]
- Wolfgruber TK, et al. Maize centromere structure and evolution: sequence analysis of centromeres 2 and 5 reveals dynamic loci shaped primarily by retrotransposons. PLoS Genet. 2009;5:e1000743. doi: 10.1371/journal.pgen.1000743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wright S, Dobzhansky T. Genetics of natural populations. XII. Experimental reproduction of some of the changes caused by natural selection in certain populations of Drosophila pseudoobscura. Genetics. 1946;31:125–156. doi: 10.1093/genetics/31.2.125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang RC. Estimating hierarchical F-statistics. Evolution. 1998:950–956. doi: 10.1111/j.1558-5646.1998.tb01824.x. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




