Skip to main content
Heredity logoLink to Heredity
. 2021 Apr 22;126(6):929–941. doi: 10.1038/s41437-021-00422-z

Joint analysis of days to flowering reveals independent temperate adaptations in maize

Kelly Swarts 1,2,3,, Eva Bauer 4, Jeffrey C Glaubitz 5, Tiffany Ho 3, Lynn Johnson 5, Yongxiang Li 6, Yu Li 6, Zachary Miller 5, Cinta Romay 5, Chris-Carolin Schön 4, Tianyu Wang 6, Zhiwu Zhang 7, Edward S Buckler 3,8, Peter Bradbury 8
PMCID: PMC8178344  PMID: 33888874

Abstract

Domesticates are an excellent model for understanding biological consequences of rapid climate change. Maize (Zea mays ssp. mays) was domesticated from a tropical grass yet is widespread across temperate regions today. We investigate the biological basis of temperate adaptation in diverse structured nested association mapping (NAM) populations from China, Europe (Dent and Flint) and the United States as well as in the Ames inbred diversity panel, using days to flowering as a proxy. Using cross-population prediction, where high prediction accuracy derives from overall genomic relatedness, shared genetic architecture, and sufficient diversity in the training population, we identify patterns in predictive ability across the five populations. To identify the source of temperate adapted alleles in these populations, we predict top associated genome-wide association study (GWAS) identified loci in a Random Forest Classifier using independent temperate–tropical North American populations based on lines selected from Hapmap3 as predictors. We find that North American populations are well predicted (AUC equals 0.89 and 0.85 for Ames and USNAM, respectively), European populations somewhat well predicted (AUC equals 0.59 and 0.67 for the Dent and Flint panels, respectively) and that the Chinese population is not predicted well at all (AUC is 0.47), suggesting an independent adaptation process for early flowering in China. Multiple adaptations for the complex trait days to flowering in maize provide hope for similar natural systems under climate change.

Subject terms: Evolutionary genetics, Quantitative trait

Introduction

Domesticated plants and animals are an excellent model for understanding the genetic consequences of adaptation to rapidly changing environments, since humans first started to move domesticates from their native ranges starting 10,000 years ago. One of the easiest ways for a plant to adapt to a new environment is to modulate the time until flowering, avoiding obstacles to reproductive success. Maize (Zea mays ssp. mays) was domesticated from the tropical grass teosinte (Zea mays ssp. parviglumis) during the mid-Holocene maximum (9000–5000 BP) (Matsuoka et al. 2002; Piperno et al. 2009; van Heerwaarden et al. 2011), where annual temperatures were higher than any time before the industrial era (Metcalfe et al. 2000). After domestication in central Mexico, humans moved maize across the Americas, reaching the southern Andes by at least 3600 BP (Perry et al. 2006) and moving northward to southern Ontario, Canada by 1500 BP (Smith and Crawford 2002). After Spanish contact with the Americas in the late 1400s, maize was quickly established across the world (Ho 1955; Anderson 1988; Revilla et al. 2003; Rebourg et al. 2003; Dubreuil et al. 2006). Modern breeding in the last 100 years has further complicated these histories with the development of inbred lines, heterotic groups, and the intentional introgression of exotic germplasm into global maize, particularly from the historical US Dent germplasm (Barrière et al. 2006). The resulting global modern maize germplasm is highly diverse, and days to flowering in inbred lines varies from 35 to 120 days (Bennetzen and Hake 2008).

Changes in flowering lead to reproductive isolation, structuring populations so that subsequently selected loci are nested within the historical source population (Rebourg et al. 2001; Brown et al. 2011; Swarts et al. 2017). This study seeks to refine our understanding of the genetic architecture underlying maize flowering by jointly analyzing diverse populations genotyped using whole-genome resequencing data.

Flowering time traits have received much attention in maize due to the quantitative nature of inheritance, high heritabilities (up to 0.96 for days to silking in the USNAM population (Buckler et al. 2009)), and ease of scoring (Thornsberry et al. 2001; Salvi et al. 2002, 2007, 2009; Chardon et al. 2004, 2005; Buckler et al. 2009; Coles et al. 2010; Wang et al. 2010; Xu et al. 2012; Durand et al. 2012; Hung et al. 2012; Romay et al. 2013; Giraud et al. 2014). Only a handful of loci, mostly in the autonomous flowering pathway, have confirmed effects for modulating days to flowering either through mutagenesis or positional cloning, including Rap2.7 (Salvi et al. 2007) and the MITE insertion (Castelletti et al. 2014) at the Vgt1 locus (Ducrocq et al. 2008; Salvi et al. 2002), ZCN8 (Meng et al. 2011; Guo et al. 2018), ID1 (Kozaki et al. 2004; Colasanti et al. 2006), conz1 (Miller et al. 2008), df1 (Muszynski et al. 2006), zfl1 and zfl2 (Bomblies and Doebley 2006), ZmMADS69 (Liang et al. 2019) and the major photoperiod locus, ZmCCT (Hung et al. 2012; Huang et al. 2018). However, independent days to flowering quantitative trait loci (QTL) have been mapped in tens of biparental, recombinant inbred line (RIL), and other structured population designs that mitigate the effects of linked background population structure (Chardon et al. 2004; Salvi et al. 2009), including four NAM populations (Buckler et al. 2009; Lehermeier et al. 2014; Li et al. 2015). Within natural diversity populations (the largest being the Ames Diversity Panel (Romay et al. 2013)), only Vgt1, ZCN8, and ZmCCT of the confirmed loci have common standing variation and are routinely detected in mapping populations (Salvi et al. 2009; Buckler et al. 2009; Coles et al. 2010).

Previous studies (Chardon et al. 2004; Salvi et al. 2009; Buckler et al. 2009; Xu et al. 2012; Mace et al. 2013; Lehermeier et al. 2014; Li et al. 2016) have confirmed that days to flowering is quantitative and highly additive in maize, and that many of the effects target common loci, with allelic series present at many loci; a meta-analysis of 22 linkage mapping and ANOVA studies record a total of 313 QTL that collapsed into 62 consensus regions of the genome (Chardon et al. 2004). Synteny mapping to rice and Arabidopsis resulted in 19 overlapping associations (Chardon et al. 2004), and synteny mapping with sorghum revealed that 92.5% of QTL found in sorghum were <10 Mbp from a corresponding QTL in USNAM (Mace et al. 2013). A later meta-analysis including 29 studies (Salvi et al. 2009) found that 441 significant QTL could be collapsed into 59 genomic regions. Similarly, days to flowering in the first NAM population, USNAM, found over 50 genomic regions with significance, and additionally confirmed the existence of allelic series at common loci. A large multi-environment evaluation of maize flowering in the USNAM, Ames diversity panel, and a newly developed Chinese NAM (CNNAM) (Li et al. 2015) population using low-density genotyping-by-sequencing (GBS) markers identified 130 QTL from linkage mapping in USNAM and CNNAM, of which 40 overlapped between the two populations (Li et al. 2016). This suggests that genetic control for flowering is conserved, and allelic variation evolved early and is at least partially differentiated across populations.

Genomic prediction using Genotypic Best Linear Unbiased Predictors (GBLUP) predicts trait values based on relatedness to phenotyped individuals, and cross-population prediction is typically impossible without shared genetic ancestry (Meuwissen 2009). The EUNAM-Dent and EUNAM-Flint populations, which were developed from European parents selected to be from genetically divergent pools, have little overlap in QTL for flowering (Giraud et al. 2014) and evidence for selection on different pathways between the two panels (Unterseer et al. 2016). Cross heterotic pool predictive accuracies were correspondingly typically close to zero or even negative in most families (Lehermeier et al. 2014). Here, we capitalize on the linkage between selection for time to flowering and source population structure to investigate the genetic architecture underlying flowering in regionally distinct global maize populations.

Materials and methods

Datasets

Five publicly available maize datasets were reanalyzed for this analysis, four Nested Association Mapping (NAM) designs—US (USNAM) (Buckler et al. 2009; McMullen et al. 2009), Chinese (CNNAM) (Li et al. 2015), and two European panels based on the major European heterotic groups, Flint (EUNAM-Flint) and Dent (EUNAM-Dent) (Lehermeier et al. 2014; Giraud et al. 2014)—and one diversity panel (Ames) (Romay et al. 2013) (Table 1). Phenotypes—flowering Days to Anthesis (DTA) for male reproductive parts and Days to Silking (DTS) for female—used were genotypic estimates reported in the original studies; spatially corrected BLUPs from per se evaluations in the Ames, CNNAM, and USNAM populations and spatially corrected BLUEs from testcrosses between the EUNAM-Flint and EUNAM-Dent panels. All of the populations were genotyped at low density in their original study, which we used to anchor whole-genome projection for all individuals. The EUNAM panels were genotyped using an Illumina MaizeSNP50 BeadChip (Ganal et al. 2011), and the other panels were genotyped with genotyping-by-sequencing (GBS) (Elshire et al. 2011). Whole-genome genotypes from maize Hapmap 3.2.1 (Bukowski et al. 2016) were imputed using K-Nearest Neighbor imputation (KNNi) (Money et al. 2015) with an overall accuracy of 0.988 and a minor allele imputation accuracy of 0.94 for imputed genotypes, then haplotypes projected onto all populations. Hapmap 3.21 (Bukowski et al. 2016) was called on 1268 inbred genotypes from across the world, with highly variable depth of coverage, and paralogous sites were retained, as they provide signal in GWAS. Because paralogous sites were retained in Hapmap 3.21, we used KNNi (Money et al. 2015) to impute, which was robust to high error rates in genotype calling, but KNNi over-imputes missing data to the major allele.

Table 1.

Population information.

Population Type (N families) Filt. KNNi sites N taxa (DTA) N taxa (DTS) Trial locations and years
USNAM NAM (25) 39,302,149 4758 4758 Clayton, North Carolina, USA—(35.39°N, 78.27°W)—2006, 2007; Aurora, New York, USA—(42.45°N, 76.42°W)—2006, 2007; Columbia, Missouri, USA—(38.95°N, 92.33°W)—2006, 2007; and Urbana, Illinois, USA—(40.11°N, 88.21°W)—2006, 2007
Ames Diversity 34,967,601 2312 2279 Ames, Iowa, USA—(42.01°N, 93.37°W)—2010; Clayton, North Carolina, USA—(35.39°N, 78.27°W)—2010; Aurora, New York, USA—(42.45°N, 76.42°W)—2010
CNNAM NAM (11) 27,047,718 1928 1928 Beijing, Beijing, China—(39.48°N, 116.28°E)—2009, 2010; Xinxiang, Henan, China—(35.19°N, 113.53°E)—2009,2010; Urumqi, Xinjiang, China—(43.47°N, 87.39°E)—2009, 2010
EUNAM-Dent NAM (10) 23,766,799 841 841 Mons, France—(43.41°N, 6.42°E)—2011; Einbeck, Germany—(51.49°N, 9.52°E)—2011; Wadersloh, Germany—(51.44°N, 8.15°E)—2011; Roggenstein, Germany—(49.38°N, 12.15°E)—2011
EUNAM-Flint NAM (11) 25,388,592 811 811 Ploudaniel, France—(48.32°N, 4.18°E)—2011; Eckartsweier, Germany - (48.31°N, 7.51°E)—2011; Wadersloh, Germany—(51.44°N, 8.15°E)—2011; Roggenstein, Germany—(49.38°N, 12.15°E)—2011; La Coruña, Spain—(43.21°N, 8.42°E)—2011

GBS-genotyped populations were projected using FILLIN (Swarts et al. 2014), and EUNAM was projected using a custom implementation of FSFHap (Swarts et al. 2014). KNNi, FILLIN, and FSFHap all use implementations in TASSEL (Bradbury et al. 2007). For GBS populations, projection was anchored by 465,085 consensus sites—where the physical positions match and the major/minor alleles are shared—between Hapmap 3 and GBS. Projection accuracy as calculated by the correlation between masked known and imputed consensus SNPs was r = 0.99 overall between masked and subsequently imputed genotypes (0.96 for minor alleles). For EUNAM, the parental haplotype breakpoints were imputed for each of the progeny using FSFHap. TASSEL used those breakpoints to project the Hapmap 3.21 genotypes of the parents onto the progeny. While most of the parents of these NAM populations were completely inbred, there were a minority that had residual heterozygosity, which can produce families with three haplotypes segregating. Projected datasets were then filtered using appropriate parameters for the family structure of each population, to ensure a minimum of at least ten minor alleles for any given site in a population; NAM populations were filtered so that one family must have a minimum minor allele frequency (MAF) of 0.1 (which controls for the parental residual heterozygosity), giving a minimum MAF of 0.02 for the population as a whole, and Ames was filtered for minimum MAF of 0.015. Any sites with a maximum heterozygosity above 0.02 or coverage below 0.3 were removed. Before calculating kinships, any residual missing genotypes were assigned a homozygous genotype randomly drawn from the genotypic frequency distribution at each site, by family if appropriate.

Multidimensional scaling (MDS) of American landraces and NAM parents

It is often unclear how inbred lines fit within an adaptive evolutionary context, since breeding programs cross and select on progeny from unrelated individuals. We performed joint MDS analysis with American landraces from Takuno et al. (2015) spanning the two American temperate–tropical gradients with the parents of the four NAM populations to understand the distributions of the parents across American temperate adaptation. Each landrace individual was included ten times in the IBS distance matrix used to calculate the MDS coordinates so that the first two coordinates reflect the relatedness between landrace individuals. Landraces and parents for CNNAM were natively genotyped using GBS, and EUNAM and USNAM projected in Hapmap 3.21 coordinates. MDS was based on 465,085 consensus sites between Hapmap 3 and GBS (cmdscale() in R, using an IBS distance matrix generated in TASSEL)

Cross-population prediction

Cross-population prediction was performed to better understand how populations were related to each other with respect to shared genetic architecture for days to flowering. Cross-population prediction was performed using GBLUP as implemented in TASSEL (Bradbury et al. 2007). The training and test populations were combined in a single kinship (similarity) matrix, calculated using the Centered-IBS implementation in TASSEL (Endelman and Jannink 2012) and phenotypes for the test population masked so that the model was trained solely from the training population phenotypes. This method scales such that the mean diagonal elements estimate 1 + f, accommodating the divergence from Hardy–Weinberg equilibrium inherent in structured populations. Predictions from the resulting model for the test phenotypes were then correlated with the true phenotypes (the “predictive ability”) in R using the Pearson method in cor.test() in the stats package. We use predictive ability, rather than prediction accuracy, the predictive ability divided by the square of the heritability, because heritabilities for flowering traits are quite high (all greater than 0.8) and it is more conservative. Predictive ability based on genomic subsets—coding sequence, three and five prime regions, introns and intergenic—were calculated following Rodgers-Melnick et al. (2016).

Genome-wide Association Study (GWAS)

To determine associations between individual SNPs and flowering traits, we ran a series of genome-wide association models with increasing control for population structure, which is highly confounded with biological association for days to flowering. All models were conducted in TASSEL (Bradbury et al. 2007). Model A was a fixed effects model using the unmodified phenotypes from the original publications, uncontrolled for population structure, because flowering is not only correlated with population structure, but it acts to differentiate populations. Thus, many of the regions that differentiate populations may also control regions important for temperate adaptation and we captured these in the uncorrected model. We also lightly controlled for population structure in another fixed effects model, model B, by fitting five MDS coordinates calculated from an IBS distance matrix in TASSEL for the Ames panel, or fitting a family term for the structured populations in a TASSEL GLM for the NAM populations. Models C and D used a two-step modified mixed linear model framework for computational efficiency. We first calculated residuals in the TASSEL MLM from models incorporating (1) only a kinship matrix calculated in TASSEL using the Centered-IBS method (VanRaden 2008; Endelman and Jannink 2012) as a random effect (model C), or (2) both a kinship matrix as random and five MDS coordinates (for Ames) or a family term (for NAM populations) as fixed effects (model D). The residuals from these models were used as the response in a TASSEL GLM. We do not include the two-step GLM results in machine learning analyses because we found that, due to the high correlation between population structure and flowering time, fully controlling for population structure reduced power to detect even well-known flowering loci such as Vgt1 or ZmCCT. Models A and B were used for subsequent machine learning as the “no population-structure correction” GWAS (model A) and the “population-structure-corrected” GWAS (model B).

Machine learning analyses with GWAS results as the response

We employ the RandomForestClassifier method in Spark (Zaharia et al. 2010) in a Databricks environment, classifying additive p values from GWAS results for model training such that the smallest 1% are classified as 1, or true positives, and, to not overwhelm the true positives, only two million random results from the bottom 95% are classed as 0, or true negatives. We cross-validated by using the other nine chromosomes to predict the tenth. A classifier rather than a regression-based approach was chosen, because we are only interested in how the top regions rank relative to all the others, and a regression approach gives more weight to the insignificant results because they vastly outnumber the top hits. P values for SNPs not reaching filtering criteria in a given dataset were assigned to 1, before ranking results, because a SNP that does not meet MAF filtering criteria will not be highly significant, even if biologically causative. Accuracy is reported as area under the curve (AUC), the integration of the receiver operating characteristic (ROC) curve (for receiver operating characteristic). ROC curves display the false positive rate on the x-axis against the true positive rate on the y-axis for each unique raw predicted value reported by the classifier model, as the prediction for each classified instance is reported on a continuous random variable. An AUC of 0.5 is equivalent to guessing. Overall AUC is simply the average AUC for the ten chromosomal tests. Top predictors were reported as the average mean ranking across all ten tests. Within a given test, predictors are ranked by which splits they participate in for a given tree, averaged across all trees.

Results

This study combines five populations, four NAM designs, and a diversity panel, totaling over ten thousand individuals (Table 1). These populations were previously genotyped using GBS or an Illumina array and imputed with whole-genome resequencing using FILLIN and FSFHap, respectively. This resulted in 70 million segregating markers across all populations (Table 1). The populations have variable genetic overlap based on the first two MDS coordinates, where Ames, as expected of a diversity panel, shows the greatest genetic diversity and CNNAM and EUNAM-Flint are the most isolated (Fig. 1). Only half of the SNPs that survive filtering are shared between study populations (Fig. S1), and the allele frequencies at those loci are highly variable (Fig. S2).

Fig. 1. Relatedness between the five study populations.

Fig. 1

Metric multidimensional scaling (MDS) of study populations from 70 million projected Hapmap 3.21 segregating SNPs (generating with cmdscale function in R, using an isolation-by-state (IBS) distance matrix generated in TASSEL).

Evaluation of relatedness between populations by MDS analysis and phenotypic comparison

To better understand the global genetic relationships between the founders of the four NAM populations, Fig. 2 shows the genetic relatedness of the NAM-type population parents to a panel of GBS-genotyped landraces from across the Americas (Ames includes all of the USNAM parents). In this context, the NAM founders overwhelmingly cluster on the North American temperate–tropical gradient, rather than the South American gradient. CNNAM and EUNAM-Dent panel show a more restricted geographic origin for the founder germplasm, while the other populations contain parents with a greater mix of American tropical and temperate origins.

Fig. 2. MDS of GBS genotypes of American landraces from Takuno et al. (Takuno et al. 2015), replicated 10X each to drive the first two coordinates, and NAM population parents.

Fig. 2

The closest USNAM parents to EUNAM-Dent are the temperate US Dents, including B73, B97, Oh7B, Ky21, and MS71, while the nearest US inbreds to CNNAM parents are South African inbreds M162W, M37W, and the Texas US line, Tx303. All of these are more proximal to the tropical Mexican landraces. The EUNAM-Flint recurrent parent, the Iodent/Iowa Stiff Stalk Synthetic EUNAM-Dent line UH007, clusters with the majority of the EUNAM-Flint parents. The Spanish lines EP44, EZ5, and the French line F64 are the most tropically associated lines in the EUNAM-Dent population.

The five populations vary with respect to both the mean and the total variance for flowering (Fig. 3). This is the result of both genetic variance within the populations, and environmental effects of the evaluation regions. Because normalizing heat unit measures such as growing degree days are not available for all populations, we limit GWAS analyses to within population measures. For additive cross-population prediction, the differences across population are less relevant because accuracy is a function of the Pearson correlation between real and predicted values and are not scale dependent.

Fig. 3. Distribution of reported spatially corrected phenotypes for days to anthesis.

Fig. 3

The distributions are affected by the trial locations, e.g. EUNAM was grown in Europe which has less heat units and thus as a population takes longer to flower than it would if it were grown in China. Because of the lack of shared trial locations, the means and variances cannot be directly compared but ather serve to demonstrate that all of the populations have variation for days to flowering, and highlight that the highest diversityAmesInbred panel contains the most phenotypic variance, overlapping with the four NAM families.

Cross-population prediction of days to flowering

We performed cross-population prediction to better understand how the genetic basis for flowering is structured across these populations. Cross-population predictions were generated pairwise with GBLUP in TASSEL across all genomic SNPs (Fig. 4). GBLUP integrates over all of the SNPs in a covariance structure modeling relatedness between individuals to explain variation in the trait of interest (Meuwissen 2009). Thus, for high predictive ability, the phenotypic variation must be closely correlated with the relatedness between individuals. In addition, high predictability requires that the test contains a subset of the diversity represented in the training set.

Fig. 4. GBLUP cross-population predictive abilities for DTA using all 70 million segregating SNPs in Hapmap 3.21.

Fig. 4

Genetic similarity matrix generated using the Centered-IBS method in TASSEL. Predictive abilities are noted in gray text within plots.

Ames has the best cross-population predictive abilities (given throughout as the Pearson correlation between observed and predicted) overall, and unsurprisingly given the close relationship between Ames and USNAM, the best predictive ability is when the model is trained on Ames and predicts DTA in USNAM (r = 0.78; 0.67 for the reverse). Ames predicted all of the populations better than the others, with the exception of CNNAM, which might be expected since Ames is a diversity panel and suggests that Ames is a superset to everything but CNNAM (Chinese lines are present, but poorly represented in Ames (Romay et al. 2013)). CNNAM, uniquely, not only was predicted poorly by the other populations but also did not predict any population well. The EUNAM-Dent population also had poor cross-population predictive ability, but was well predicted by Ames. Both of these populations also show a marked non-linear correlation in predicting Ames, suggesting that a large subset of the Ames population is not represented in these panels. The EUNAM-Flint population is well predicted by Ames and USNAM, but especially when predicted by USNAM, the high overall correlation is especially driven by a relatively small set of lines in the top right of the plot descended from primarily three families descending from the Spanish lines EP44, EZ5, and the French line F64. These three parents are the most “tropical” of the EUNAM-Flint lines on the N. American gradient (Fig. 2).

Theoretically, including only causal polymorphisms should generate stable predictions independent of population specific linkage patterns if the test and training population share a genetic basis (Meuwissen 2009). To this end, we looked at predictions using only SNPs from the functionally annotated regions of the genome as defined in Rodgers-Melnick et al. (2016) and Wallace et al. (2014), and found that subsets can sometimes improve predictions if genomic predictive ability was low, but never if the accuracy was already high (Fig. S3). An exception where a well predicted population is further improved by a subset is when the EUNAM-Flint population predicts Ames or USNAM; in both of these cases predictive ability is already high, and genic or open chromatin subsets further increased accuracy.

Because predictive ability is a function of relatedness, we calculated FST statistics using the Weir and Cockerham method in vcftools (Danecek et al. 2011) for all pairs of populations. Figure 5 shows a correlation of −0.41 for FST and predictive ability for DTA across all markers, confirming a relationship between close relatedness and high predictive ability, but there are outliers that do not fit the expected pattern. Figure 5 shows that CNNAM and EUNAM-Dent generally have lower cross-population predictions than expected by population relatedness, with the exception that EUNAM-Dent is predicted similar to expectation by Ames. In contrast, EUNAM-Flint has higher than expected predictive ability with USNAM.

Fig. 5. Pairwise population differentiation, FST and cross-population predictive ability (r) for DTA.

Fig. 5

High predictive ability is significantly correlated (α < 0.05) with low population differentiation, but many population pairs have higher or lower predictive ability than expected based on relatedness.

Random forest predictions of GWAS for flowering loci across the genome by population

To maximize the influence of selection from background drift on prediction, we examine cross-population predictability using a Random Forest Classifier where only the top GWAS results are coded as “true” (SNPs not segregating in a given population were given a p value of 1 and coded as “false”). GWAS results were run for each population, and because changes in flowering time generate population structure (Swarts et al. 2017), we performed GWAS with no population-structure correction and minimal population-structure correction (a family term for NAM populations, and five MDS coordinates for Ames), which is designed to further enrich top GWAS loci for biological (rather than population-structure artifacts) loci underlying flowering (Figs. 6 and S4). Without population-structure control the results are similar to the cross-population predictions, where Ames and USNAM are well predicted, mostly by each other. EUNAM-Flint is predicted to a lesser extent by Ames, followed by USNAM, and EUNAM-Dent is predicted equally well, but by EUNAM-Flint. CNNAM is predicted less well than by chance. After accounting for population structure, no population is well predicted by the others, and only Ames has a positive predictability.

Fig. 6. Average area under the curve (AUC, in bold)—the false positive rate (FPR) to true positive rate (TPR) ratio—and predictor rankings across all chromosomes from Random Forest Classifier for GWAS results between populations to evaluate overlap in GWAS results between populations.

Fig. 6

The heavy line represents the mean of the chromosomal receiver operating characteristic curves, sampled at 0.01 intervals. Predictors are equivalent GWAS (with or without population-structure correction) for the other populations. Trained on nine chromosomes and tested on the 10th. Not all chromosomes for the population-structure-corrected results have top predictors, and are excluded from average AUC calculations.

To explicitly test if the North American early flowering adaptation (Swarts et al. 2017) was the source germplasm for early flowering, we also tested the GWAS results against measures of population differentiation between three different N. American germplasm pools, tropical, temperate, and Northern Flint; the temperate germplasm results from the admixture between the southern Dent and N. Flint pools (Figs. 7 and S5). Confirming the inference from the cross-population predictions, Ames is predicted very well (overall AUC of 0.89) by the tropical contrasts, followed closely by USNAM (0.85), with EUNAM-Flint well predicted at 0.67, EUNAM-Dent rather poorly predicted at 0.59 and CNNAM slightly negatively predicted with an AUC score of 0.47. Population-structure control with a family term for the NAMs and five MDS coordinates for Ames reduced predictive ability by population differentiation, but did not eliminate the effects, confirming that flowering time is closely tied to population structure.

Fig. 7. Average area under the curve (AUC, in bold)—the false positive rate (FPR) to true positive rate (TPR) ratio—and predictor rankings across all chromosomes from Random Forest Classifier with GWAS additive p values as the response, and N.

Fig. 7

American FSTs as the predictors. The heavy black line represents the mean of the chromosomal receiver operating characteristic curves, sampled at 0.01 intervals. Predictors are equivalent GWAS (with or without population-structure correction) for the other populations. Trained on nine chromosomes and tested on the 10th. Not all chromosomes for the population-structure-corrected results have top predictors, and are excluded from average AUC calculations.

Discussion

Complex traits, such as yield, are well-known to aggregate fitness effects across all pathways and systems of a plant, but this study highlights that this is also true for days to flowering (Li et al. 2016). When a plant is induced to flower depends on the alleles present at ZmCCT, ZCN8, and Vgt1, but it also depends on the pleiotropic response of hundreds of other loci responding to signals for heat, circadian signaling, light quality, moisture availability, starch accumulation, and others (Tian et al. 2011; Brown et al. 2011). Limited overlap in mapping results suggests that, as maize spread across the world during improvement and modern hybrid breeding, complex population dynamics may have led to differential selection for secondary pathways implicated in temperate adaptation.

These patterns are found in Arabidopsis thaliana as well. Fournier-Level et al. find evidence for selection on different, pleiotropic QTL in different environments across Europe in a common garden experiment (Fournier-Level et al. 2013). Additionally, Kover et al. (2009) find that adaptation to early flowering is partially conditional on other conditions in the selection environment. Finally, only two out of a total of 10 QTL were shared between Swedish and Italian RIL populations (Dittmar et al. 2014), similar to findings in maize.

That some populations, namely EUNAM-Flint and CNNAM, do not predict Ames as expected based on population differentiation suggests more complicated population dynamics than simple relatedness (Fig. 5). What better explains the patterning in predictive ability is the spread of the population founders on the North American temperate/tropical gradient (Fig. 2). The two populations with a localized geographic origin on this gradient, CNNAM and EUNAM-Dent, are the two populations with more limited cross-population predictive ability, and low predictive ability for GWAS results in machine learning. In contrast, EUNAM-Flint, which has higher than expected cross-population predictive ability when predicting USNAM, has parents spanning the tropical/temperate North American gradient, and FSTs between Northern Flints and tropical American germplasm from Hapmap 3.21 are top predictors in machine learning against uncorrected GWAS results. Lower than expected predictive ability relative to population differentiation could be interpreted in two ways; a narrow germplasm base or, if the population has high variance for flowering time, that these populations contain novel temperate adaptation not captured on either American temperate/tropical axis.

The history of germplasm introduction can shed light on the discrepancies between predictive ability and population differentiation especially for the EUNAM-Dent and CNNAM. The earliest germplasm in Europe was Caribbean in origin, and subjected to selection for early flowering upon entry into Spain (Rebourg et al. 2003). However, it was only after the introduction of the Northern Flint landraces in the mid-1600s from the northeast of the modern US that maize agriculture spread to European climates north of the Pyrenees (Revilla et al. 2003; Rebourg et al. 2003; Dubreuil et al. 2006). Genetic evidence from European maize suggests that most of the early flowering adaptation was acquired from the North American Northern Flint germplasm introduced in the 1600s (Rebourg et al. 2003; Dubreuil et al. 2006), but also that there are unique rare alleles in the Southern European (Spanish and Portuguese) germplasm (Revilla et al. 2003). Finally, in the past century, the development of the European heterotic groups introduced primarily American Iodent, Stiff Stalk, Lancaster, and Minnesota germplasm into Europe, captured primarily in the EUNAM-Dent panel (Dubreuil et al. 2006; Troyer and Hendrickson 2007).

The recurrent parent of the EUNAM-Dent population is representative of the agronomically important Iodent germplasm (derived from an Iodent/Iowa Stiff Stalk Synthetic cross), and the additional lines in the EUNAM-Dent panel derive from the US Stiff Stalk, US Lancaster, and Hohenheim Dent populations (Doebley et al. 1988; Bauer et al. 2013). The Hohenheim Dents were bred from US temperate germplasm, and especially the early flowering Minnesota lines (Technow et al. 2014), which result from crosses between the US Southern Dents and Northern Flints, followed by selection for extreme temperate adaptation, with early flowering contributed by the Northern Flints (Troyer and Hendrickson 2007).

EUNAM-Dent is well predicted by Ames, but not USNAM; Ames is enriched for temperate US germplasm, including Iodent and Minnesota lines (Romay et al. 2013), while USNAM is not (McMullen et al. 2009). It is not surprising that Ames then predicts the EUNAM-Dent panel, as it is a superset, but it is perhaps surprising that the Ames GWAS results predict the EUNAM-Dent germplasm so poorly in machine learning, given that the pedigree suggests that temperate adaptation in the EUNAM-Dent panel is Northern Flint in origin. EUNAM-Dent has low narrow-sense heritabilities for flowering relative to Ames (0.92 for DTA (Romay et al. 2013)) or USNAM (0.94 for DTA and DTS (Buckler et al. 2009)), 0.7 (DTS) and 0.61 (DTA), which can result from fixation of alleles in a population and may explain this discrepancy. In addition, Giraud et al. (2014) found fewer QTL in the EUNAM-Dent relative to the EUNAM-Flint panel, and these QTL explain less variance. In our results, the machine learning using both cross-population and FST results find no significant loci on many chromosomes, especially in EUNAM-Dent after population-structure correction (Supplementary Figs. S4 and S5), suggesting that many of the flowering loci are fixed in the population. Alternately, poor overlap and machine learning prediction could be a result of recent selection for temperate adaptation in the development of the Minnesota germplasm and subsequent selection in Europe, which is not sufficiently represented in the other datasets.

Like early European germplasm, the first maize introduced into China was tropical in origin. The most likely first point of entry for maize to China was by Spanish and Portuguese traders through the Port of Macau, and these are reported to have been humid, tropical adapted varieties (Ho 1955; Anderson 1988). A regular trading route between Acapulco and Manila, which started in CE 1565, could have introduced western lowland Mexican maize to Asia (Schurz 1939). Although Chinese maize now contains both early and late flowering varieties, unlike the European germplasm, there is no record of later introduction of American temperate adapted material until the past century.

Almost all of the CNNAM parents derive from Chinese sources across heterotic groups, and the recurrent parent is a derivative of the temperate Chinese landrace TangPiSingTou (Lu et al. 2009). While most of the CNNAM parents are temperate adapted (Li et al. 2016), they are significant at the major photoperiod locus ZmCCT, confirming that the population contains tropical alleles. In addition, broad sense heritabilities for CNNAM were high (0.91 and 0.90 for DTA and DTS, respectively), so the genetic basis for days to flowering is not narrow. High heritability and broad phenotypic variability suggest that the CNNAM population does not suffer from a narrow germplasm base. Thus, poor cross-population prediction across all populations indicates that CNNAM contains novel alleles for temperate adaptation. Although novel targets are consistent with these data, Li et al. (2016) find that 40 out of 130 independently identified QTL overlap between between CNNAM and USNAM, suggesting that many of these novel variants are allelic variants at common loci, consistent with the “common gene” mode of adaptation inferred from USNAM alone (Buckler et al. 2009).

As the CNNAM panel is dominated by lines whose origins likely predate modern hybrids of the 20th century, these results suggest an indigenous temperate adaptation to China of independent origin relative to the other four populations. As CNNAM germplasm is not extensively utilized outside of China, this study highlights a potential source of novel alleles for breeding programs. Moreover, maize suggests a roadmap for understanding the characteristics important for resiliency in natural species subject to changing climate. Per these results, there is evidence for independent adaptation in maize in the Americas and China, and partial adaptation in Europe (Rebourg et al. 2003) but why was maize able to accomplish these feats? Maize is historically highly outcrossing resulting in large effective population sizes, with high levels of genetic diversity (Hufford et al. 2012), which theoretically allows for efficient selection (Bürger and Lande 1994). To better understand the impacts of climate change on a diversity of natural systems, understanding past adaptations in other domesticates will provide a clearer picture of the population parameters underlying resiliency.

Data archiving

Phenotypes and imputed genotypes will be made available upon request.

Supplementary information

Acknowledgements

The authors thank Robert Bukowski from BRC Bioinformatics Facility, Institute of Biotechnology at Cornell University for early access to the KNNi imputed HapMap 3 dataset, and Jeffrey Ross-Ibarra and Matt Hufford for early access to the American landraces. We also thank Laura Morales for discussions and reviewing earlier drafts of the manuscript. This work was supported by National Science Foundation Grants IOS-0922493 and IOS-1238014 and the US Department of Agriculture–Agricultural Research Service.

Compliance with ethical standards

Conflict of interest

The authors declare no competing interests.

Footnotes

Associate editor: Yuan-Ming Zhang

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s41437-021-00422-z.

References

  1. Anderson EN (1988) The food of China. Yale University Press
  2. Barrière Y, Alber D, Dolstra O, Lapierre C, Motto M, Ordás Pérez A, et al. Past and prospects of forage maize breeding in Europe. II. History, germplasm evolution and correlative agronomic changes. Maydica. 2006;51:435–449. [Google Scholar]
  3. Bauer E, Falque M, Walter H, Bauland C, Camisan C, Campo L, et al. Intraspecific variation of recombination rate in maize. Genome Biol. 2013;14:R103. doi: 10.1186/gb-2013-14-9-r103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bennetzen JL, Hake SC (2008) Handbook of maize: its biology. Springer Science & Business Media
  5. Bomblies K, Doebley JF. Pleiotropic effects of the duplicate maize FLORICAULA/LEAFY genes zfl1 and zfl2 on traits under selection during maize domestication. Genetics. 2006;172:519–531. doi: 10.1534/genetics.105.048595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23:2633–2635. doi: 10.1093/bioinformatics/btm308. [DOI] [PubMed] [Google Scholar]
  7. Brown PJ, Upadyayula N, Mahone GS, Tian F, Bradbury PJ, Myles S, et al. Distinct genetic architectures for male and female inflorescence traits of maize. PLoS Genet. 2011;7:11. doi: 10.1371/journal.pgen.1002383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Buckler ES, Holland JB, Bradbury PJ, Acharya CB, Brown PJ, Browne C, et al. The genetic architecture of maize flowering time. Science. 2009;325:714–718. doi: 10.1126/science.1174276. [DOI] [PubMed] [Google Scholar]
  9. Bukowski R, Guo X, Lu Y, Zou C, He B, Rong Z et al. (2016) Construction of the third generation Zea mays haplotype map. bioRxiv: 026963. [DOI] [PMC free article] [PubMed]
  10. Bürger R, Lande R. On the distribution of the mean and variance of a quantitative trait under mutation-selection-drift balance. Genetics. 1994;138:901–912. doi: 10.1093/genetics/138.3.901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Castelletti S, Tuberosa R, Pindo M, Salvi S. A MITE transposon insertion is associated with differential methylation at the maize flowering time QTL Vgt1. G3. 2014;4:805–812. doi: 10.1534/g3.114.010686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chardon F, Hourcade D, Combes V, Charcosset A. Mapping of a spontaneous mutation for early flowering time in maize highlights contrasting allelic series at two-linked QTL on chromosome 8. Theor Appl Genet. 2005;112:1–11. doi: 10.1007/s00122-005-0050-z. [DOI] [PubMed] [Google Scholar]
  13. Chardon F, Virlon B, Moreau L, Falque M, Joets J, Decousset L, et al. Genetic architecture of flowering time in maize as inferred from quantitative trait loci meta-analysis and synteny conservation with the rice genome. Genetics. 2004;168:2169–2185. doi: 10.1534/genetics.104.032375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Colasanti J, Tremblay R, Wong AYM, Coneva V, Kozaki A, Mable BK. The maize INDETERMINATE1 flowering time regulator defines a highly conserved zinc finger protein family in higher plants. BMC Genom. 2006;7:158. doi: 10.1186/1471-2164-7-158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Coles ND, McMullen MD, Balint-Kurti PJ, Pratt RC, Holland JB. Genetic control of photoperiod sensitivity in maize revealed by joint multiple population analysis. Genetics. 2010;184:799–812. doi: 10.1534/genetics.109.110304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–2158. doi: 10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dittmar EL, Oakley CG, Ågren J, Schemske DW. Flowering time QTL in natural populations of Arabidopsis thaliana and implications for their adaptive value. Mol Ecol. 2014;23:4291–4303. doi: 10.1111/mec.12857. [DOI] [PubMed] [Google Scholar]
  18. Doebley J, Wendel JD, Smith JSC, Stuber CW, Goodman M. The origin of cornbelt maize: the isozyme evidence. Econ Bot. 1988;42:120–131. [Google Scholar]
  19. Dubreuil P, Warburton M, Chastanet M, Hoisington D, Charcosset A. More on the introduction of temperate maize into Europe: large-scale bulk SSR genotyping and new historical elements. Maydica. 2006;51:281–291. [Google Scholar]
  20. Ducrocq S, Madur D, Veyrieras J-B, Camus-Kulandaivelu L, Kloiber-Maitz M, Presterl T, et al. Key Impact of Vgt1 on flowering time adaptation in maize: evidence from association mapping and ecogeographical information. Genetics. 2008;178:2433–2437. doi: 10.1534/genetics.107.084830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Durand E, Bouchet S, Bertin P, Ressayre A, Jamin P, Charcosset A, et al. Flowering time in maize: linkage and epistasis at a major effect locus. Genetics. 2012;190:1547–1562. doi: 10.1534/genetics.111.136903. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE. 2011;6:e19379. doi: 10.1371/journal.pone.0019379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Endelman JB, Jannink J-L. Shrinkage estimation of the realized relationship matrix. G3. 2012;2:1405–1413. doi: 10.1534/g3.112.004259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fournier-Level A, Wilczek AM, Cooper MD, Roe JL, Anderson J, Eaton D, et al. Paths to selection on life history loci in different natural environments across the native range of Arabidopsis thaliana. Mol Ecol. 2013;22:3552–3566. doi: 10.1111/mec.12285. [DOI] [PubMed] [Google Scholar]
  25. Ganal MW, Durstewitz G, Polley A, Bérard A, Buckler ES, Charcosset A, et al. A large maize (Zea mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome. PLoS ONE. 2011;6:e28334. doi: 10.1371/journal.pone.0028334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Giraud H, Lehermeier C, Bauer E, Falque M, Segura V, Bauland C, et al. Linkage disequilibrium with linkage analysis of multiline crosses reveals different multiallelic QTL for hybrid performance in the flint and dent heterotic groups of maize. Genetics. 2014;198:1717–1734. doi: 10.1534/genetics.114.169367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Guo L, Wang X, Zhao M, Huang C, Li C, Li D, et al. Stepwise cis-regulatory changes in ZCN8 contribute to maize flowering-time adaptation. Curr Biol. 2018;28:3005–3015.e4. doi: 10.1016/j.cub.2018.07.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. van Heerwaarden J, Doebley J, Briggs WH, Glaubitz JC, Goodman MM, de Jesus Sanchez Gonzalez J, et al. Genetic signals of origin, spread, and introgression in a large sample of maize landraces. Proc Natl Acad Sci USA. 2011;108:1088–1092. doi: 10.1073/pnas.1013011108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ho P-T. The introduction of American Food Plants into China. Am Anthropol. 1955;57:191–201. [Google Scholar]
  30. Huang C, Sun H, Xu D, Chen Q, Liang Y, Wang X et al. (2018) ZmCCT9 enhances maize adaptation to higher latitudes. Proc Natl Acad Sci 115:E334–E341 [DOI] [PMC free article] [PubMed]
  31. Hufford MB, Xu X, van Heerwaarden J, Pyhäjärvi T, Chia J-M, Cartwright RA, et al. Comparative population genomics of maize domestication and improvement. Nat Genet. 2012;44:808–811. doi: 10.1038/ng.2309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Hung H-Y, Shannon LM, Tian F, Bradbury PJ, Chen C, Flint-Garcia SA, et al. ZmCCT and the genetic basis of day-length adaptation underlying the postdomestication spread of maize. Proc Natl Acad Sci USA. 2012;109:11068–11069. doi: 10.1073/pnas.1203189109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kover PX, Rowntree JK, Scarcelli N, Savriama Y, Eldridge T, Schaal BA. Pleiotropic effects of environment-specific adaptation in Arabidopsis thaliana. N Phytol. 2009;183:816–825. doi: 10.1111/j.1469-8137.2009.02943.x. [DOI] [PubMed] [Google Scholar]
  34. Kozaki A, Hake S, Colasanti J. The maize ID1 flowering time regulator is a zinc finger protein with novel DNA binding properties. Nucleic Acids Res. 2004;32:1710–1720. doi: 10.1093/nar/gkh337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Lehermeier C, Krämer N, Bauer E, Bauland C, Camisan C, Campo L, et al. Usefulness of multiparental populations of maize (Zea mays L.) for genome-based prediction. Genetics. 2014;198:3–16. doi: 10.1534/genetics.114.161943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Li Y-X, Li C, Bradbury PJ, Liu X, Lu F, Romay CM, et al. Identification of genetic variants associated with maize flowering time using an extremely large multi-genetic background population. Plant J. 2016;86:391–402. doi: 10.1111/tpj.13174. [DOI] [PubMed] [Google Scholar]
  37. Li C, Li Y, Bradbury PJ, Wu X, Shi Y, Song Y, et al. Construction of high-quality recombination maps with low-coverage genomic sequencing for joint linkage analysis in maize. BMC Biol. 2015;13:78. doi: 10.1186/s12915-015-0187-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Liang Y, Liu Q, Wang X, Huang C, Xu G, Hey S, et al. Zm MADS 69 functions as a flowering activator through the ZmRap2. 7-ZCN 8 regulatory module and contributes to maize flowering time adaptation. N Phytol. 2019;221:2335–2347. doi: 10.1111/nph.15512. [DOI] [PubMed] [Google Scholar]
  39. Lu Y, Yan J, Guimarães CT, Taba S, Hao Z, Gao S, et al. Molecular characterization of global maize breeding germplasm based on genome-wide single nucleotide polymorphisms. Theor Appl Genet. 2009;120:93–115. doi: 10.1007/s00122-009-1162-7. [DOI] [PubMed] [Google Scholar]
  40. Mace ES, Hunt CH, Jordan DR. Supermodels: sorghum and maize provide mutual insight into the genetics of flowering time. Theor Appl Genet. 2013;126:1377–1395. doi: 10.1007/s00122-013-2059-z. [DOI] [PubMed] [Google Scholar]
  41. Matsuoka Y, Vigouroux Y, Goodman MM, Sanchez GJ, Buckler E, Doebley J. A single domestication for maize shown by multilocus microsatellite genotyping. Proc Natl Acad Sci USA. 2002;99:6080–6084. doi: 10.1073/pnas.052125199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. McMullen MD, Kresovich S, Villeda HS, Bradbury P, Li H, Sun Q, et al. Genetic properties of the maize nested association mapping population. Science. 2009;325:737–740. doi: 10.1126/science.1174320. [DOI] [PubMed] [Google Scholar]
  43. Meng X, Muszynski MG, Danilevskaya ON. The FT-like ZCN8 gene functions as a floral activator and Is involved in photoperiod sensitivity in maize. Plant Cell. 2011;23:942–960. doi: 10.1105/tpc.110.081406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Metcalfe SE, O’Hara SL, Caballero M, Davies SJ. Records of Late Pleistocene-Holocene climatic change in Mexico—a review. Quat Sci Rev. 2000;19:699–721. [Google Scholar]
  45. Meuwissen THE. Accuracy of breeding values of “unrelated” individuals predicted by dense SNP genotyping. Genet Sel Evol. 2009;41:35. doi: 10.1186/1297-9686-41-35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Miller TA, Muslin EH, Dorweiler JE. A maize CONSTANS-like gene, conz1, exhibits distinct diurnal expression patterns in varied photoperiods. Planta. 2008;227:1377–1388. doi: 10.1007/s00425-008-0709-1. [DOI] [PubMed] [Google Scholar]
  47. Money D, Gardner K, Migicovsky Z, Schwaninger H, Zhong G-Y, Myles S. LinkImpute: fast and accurate genotype imputation for nonmodel organisms. G3. 2015;5:2383–2390. doi: 10.1534/g3.115.021667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Muszynski MG, Dam T, Li B, Shirbroun DM, Hou Z, Bruggemann E, et al. delayed flowering1 Encodes a basic leucine zipper protein that mediates floral inductive signals at the shoot apex in maize. Plant Physiol. 2006;142:1523–1536. doi: 10.1104/pp.106.088815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Perry L, Sandweiss DH, Piperno DR, Rademaker K, Malpass MA, Umire A, et al. Early maize agriculture and interzonal interaction in southern Peru. Nature. 2006;440:76–79. doi: 10.1038/nature04294. [DOI] [PubMed] [Google Scholar]
  50. Piperno DR, Ranere AJ, Holst I, Iriarte J, Dickau R. Starch grain and phytolith evidence for early ninth millennium B.P. maize from the Central Balsas River Valley, Mexico. Proc Natl Acad Sci USA. 2009;106:5019–5024. doi: 10.1073/pnas.0812525106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Rebourg C, Chastanet M, Gouesnard B, Welcker C, Dubreuil P, Charcosset A. Maize introduction into Europe: the history reviewed in the light of molecular data. Theor Appl Genet. 2003;106:895–903. doi: 10.1007/s00122-002-1140-9. [DOI] [PubMed] [Google Scholar]
  52. Rebourg C, Gouesnard B, Charcosset A. Large scale molecular analysis of traditional European maize populations. Relationships with morphological variation. Heredity. 2001;86:574–587. doi: 10.1046/j.1365-2540.2001.00869.x. [DOI] [PubMed] [Google Scholar]
  53. Revilla P, Soengas P, Cartea ME, Malvar RA, Ordas A. Isozyme variability among European maize populations and the introduction of maize in Europe. Maydica. 2003;48:141–152. [Google Scholar]
  54. Rodgers-Melnick E, Vera DL, Bass HW, Buckler ES. Open chromatin reveals the functional maize genome. Proc Natl Acad Sci USA. 2016;113:E3177–E3184. doi: 10.1073/pnas.1525244113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Romay MC, Millard MJ, Glaubitz JC, Peiffer JA, Swarts KL, Casstevens TM, et al. Comprehensive genotyping of the USA national maize inbred seed bank. Genome Biol. 2013;14:R55. doi: 10.1186/gb-2013-14-6-r55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Salvi S, Castelletti S, Tuberosa R. An updated consensus map for flowering time QTLS in maize. Maydica. 2009;54:501–512. [Google Scholar]
  57. Salvi S, Sponza G, Morgante M, Tomes D, Niu X, Fengler KA, et al. Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize. Proc Natl Acad Sci USA. 2007;104:11376–11381. doi: 10.1073/pnas.0704145104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Salvi S, Tuberosa R, Chiapparino E, Maccaferri M, Veillet S, van Beuningen L, et al. Toward positional cloning of Vgt1, a QTL controlling the transition from the vegetative to the reproductive phase in maize. Plant Mol Biol. 2002;48:601–613. doi: 10.1023/a:1014838024509. [DOI] [PubMed] [Google Scholar]
  59. Schurz WL. The Manila galleon. New York, NY: E.P. Dutton & Company, Inc.; 1939. [Google Scholar]
  60. Smith DG, Crawford GW. Recent developments in the archaeology of the Princess Point Complex in Southern Ontario. In: Hart JP, Rieth CB, editors. Northeast subsistence-settlement change, A.D. 700-A.D. 1300. Albany: New York State Museum bulletin, New York State Museum/New York State Education Department; 2002. pp. 3–19. [Google Scholar]
  61. Swarts K, Gutaker RM, Benz B, Blake M, Bukowski R, Holland J, et al. Genomic estimation of complex traits reveals ancient maize adaptation to temperate North America. Science. 2017;357:512–515. doi: 10.1126/science.aam9425. [DOI] [PubMed] [Google Scholar]
  62. Swarts K, Li H, Romero Navarro JA, An D, Romay MC, Hearne S, et al. Novel methods to optimize genotypic imputation for low-coverage, next-generation sequence data in crop plants. Plant Genome. 2014;7:3. [Google Scholar]
  63. Takuno S, Ralph P, Swarts K, Elshire RJ, Glaubitz JC, Buckler ES, et al. Independent molecular basis of convergent highland adaptation in maize. Genetics. 2015;200:1297–1312. doi: 10.1534/genetics.115.178327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Technow F, Schrag TA, Schipprack W, Melchinger AE. Identification of key ancestors of modern germplasm in a breeding program of maize. Theor Appl Genet. 2014;127:2545–2553. doi: 10.1007/s00122-014-2396-6. [DOI] [PubMed] [Google Scholar]
  65. Thornsberry JM, Goodman MM, Doebley J, Kresovich S, Nielsen D, Buckler ES. Dwarf8 polymorphisms associate with variation in flowering time. Nat Genet. 2001;28:286. doi: 10.1038/90135. [DOI] [PubMed] [Google Scholar]
  66. Tian F, Bradbury PJ, Brown PJ, Hung H, Sun Q, Flint-Garcia S, et al. Genome-wide association study of leaf architecture in the maize nested association mapping population. Nat Genet. 2011;43:159–162. doi: 10.1038/ng.746. [DOI] [PubMed] [Google Scholar]
  67. Troyer AF, Hendrickson LG. Background and Importance of ‘Minnesota 13’ Corn. Crop Sci. 2007;47:905. [Google Scholar]
  68. Unterseer S, Pophaly SD, Peis R, Westermeier P, Mayer M, Seidel MA, et al. A comprehensive study of the genomic differentiation between temperate Dent and Flint maize. Genome Biol. 2016;17:137. doi: 10.1186/s13059-016-1009-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. VanRaden PM. Efficient methods to compute genomic predictions. J Dairy Sci. 2008;91:4414–4423. doi: 10.3168/jds.2007-0980. [DOI] [PubMed] [Google Scholar]
  70. Wallace JG, Bradbury PJ, Zhang N, Gibon Y, Stitt M, Buckler ES. Association mapping across numerous traits reveals patterns of functional variation in maize. PLoS Genet. 2014;10:e1004845. doi: 10.1371/journal.pgen.1004845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Wang C, Chen Y, Ku L, Wang T, Sun Z, Cheng F, et al. Mapping QTL associated with photoperiod sensitivity and assessing the importance of QTL×environment interaction for flowering time in maize. PLoS ONE. 2010;5:e14068. doi: 10.1371/journal.pone.0014068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Xu J, Liu Y, Liu J, Cao M, Wang J, Lan H, et al. The genetic architecture of flowering time and photoperiod sensitivity in maize as revealed by qtl review and meta analysis. J Integr Plant Biol. 2012;54:358–373. doi: 10.1111/j.1744-7909.2012.01128.x. [DOI] [PubMed] [Google Scholar]
  73. Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I (2010) Spark: cluster computing with working sets. In: Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud’10. USENIX Association, Berkeley, CA, USA, p 10

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

Phenotypes and imputed genotypes will be made available upon request.


Articles from Heredity are provided here courtesy of Nature Publishing Group

RESOURCES