Abstract
The study of domestication contributes to our knowledge of evolution and crop genetic resources. Human selection has shaped wild Brassica rapa into diverse turnip, leafy, and oilseed crops. Despite its worldwide economic importance and potential as a model for understanding diversification under domestication, insights into the number of domestication events and initial crop(s) domesticated in B. rapa have been limited due to a lack of clarity about the wild or feral status of conspecific noncrop relatives. To address this gap and reconstruct the domestication history of B. rapa, we analyzed 68,468 genotyping-by-sequencing-derived single nucleotide polymorphisms for 416 samples in the largest diversity panel of domesticated and weedy B. rapa to date. To further understand the center of origin, we modeled the potential range of wild B. rapa during the mid-Holocene. Our analyses of genetic diversity across B. rapa morphotypes suggest that noncrop samples from the Caucasus, Siberia, and Italy may be truly wild, whereas those occurring in the Americas and much of Europe are feral. Clustering, tree-based analyses, and parameterized demographic inference further indicate that turnips were likely the first crop type domesticated, from which leafy types in East Asia and Europe were selected from distinct lineages. These findings clarify the domestication history and nature of wild crop genetic resources for B. rapa, which provides the first step toward investigating cases of possible parallel selection, the domestication and feralization syndrome, and novel germplasm for Brassica crop improvement.
Keywords: artificial selection, turnip, napa cabbage, genotyping-by-sequencing, dedomestication, oilseeds
Introduction
Domestication is a process of adaptation to agroecological niches and human preferences (Larson et al. 2014) driven by a complex mix of ecological, biological, and cultural factors (Price et al. 2011; Gepts et al. 2012). The study of domestication provides insight into the nature of contemporary crop genetic resources (Gepts et al. 2012; Zeder 2015) and evolutionary processes in general (Darwin 1868; Fuller et al. 2014). The identity of the wild relatives of a crop is a key piece of information in disentangling the often-complex domestication history, allows the direct comparison of wild to domesticated forms (Page et al. 2019), and can contribute useful alleles responsible for agronomically important traits (Zhang et al. 2017). However, the identification of truly wild populations is often complicated by the presence of conspecific feral populations that are common for many crops (e.g., Wang et al. 2017). Unlike wild populations, feral plants derive from domesticated crops that have either escaped from cultivation on their own (endoferal) or through crossing with wild populations (exoferal; Gressel 2005). Additionally, complex histories of hybridization and introgression among wild, feral, and domesticated forms within and across species make identification of truly wild populations difficult (Beebe et al. 1997; Wang et al. 2017; Allaby et al. 2008).
The morphologically diverse crops in the genus Brassica (e.g., broccoli, cabbage, bok choy, and canola) provide powerful study systems to understand domestication and artificial selection (Gómez-Campo and Prakash 1999; Bird et al. 2017; Qi et al. 2017; An et al. 2019). Crops in the genus Brassica are also nutritionally and economically important worldwide, valuing around 14 billion dollars in 2014 (FAOSTAT 2014). One of the most widespread crop species, Brassica rapa L. (Brassicaceae: 2n = 20), includes turnips, leafy greens, such as bok choy, napa cabbage, mizuna, tatsoi, rapini, grelos, and choy sum, and oilseed crops, such as turnip rape, toria, and yellow sarsons as well as weedy forms (B. rapa ssp. sylvestris) which may be wild or feral (table 1; Gómez-Campo and Prakash 1999; Prakash et al. 2011). The diverse forms of B. rapa vary in both self-compatibility and annual/biennial habit. With the exception of the Central Asian oilseed crop, yellow sarsons, the species is largely reported as self-incompatible (Nasrallah and Nasrallah 1989). Many of the crops and weedy forms are annual, but turnips, some turnip rape cultivars, and some napa cabbage are biennial and require vernalization to flower (Zhao et al. 2007). Propagation of B. rapa crops and weeds is through seeds and pollination of B. rapa is through insects and, to a lesser extent, wind and physical contact (Pertl et al. 2002). Despite the global economic importance of B. rapa as a crop species and its potential as a model system for artificial selection, many aspects of its domestication are contested. Outstanding questions include when, where, and how many times it was domesticated into various morphotypes, distinguishing the wild and feral types, what the first crop type was, how other crop types emerged from the initially domesticated crop type, and to what degree different domesticated and feral forms have evolved convergently.
Table 1.
Geography | Common Name | Latin Binomial | Plant Organ(s) Used |
---|---|---|---|
C. Asia | Yellow sarson | ssp. trilocularis (Roxb.) Hanelt | Seeds/leaves |
C. Asia | Toria/Brown sarsona | ssp. dichotoma (Roxb.) Hanelt | Seeds/leaves |
E. Asia | Komatsuna | ssp. perviridis Bailey | Leaves/inflorescences |
E. Asia | Bok Choy | ssp. chinensis (L.) Hanelt | Leaves |
E. Asia | Napa cabbage | ssp. pekinensis (Lour.) Hanelt | Leaves |
E. Asia | Choy sum | var. parachinensis (L.H. Bailey) Hanelt | Leaves/inflorescences |
E. Asia | Mizuna | ssp. nipposinica (L.H. Bailey) Hanelt | Leaves |
E. Asia | Tatsoi | ssp. narinosa (L.H. Bailey) Hanelt | Leaves |
E. Asia | Taicai | ssp. chinensis Makino var. tai-tsai Hort | Leaves |
E. Asia | Zitaicai | var. purpuraria (L.H. Bailey) Kitam | Leaves/inflorescences |
Europe | Rapini | ssp. sylvestris L. Janch. var. esculenta Hort | Leaves/inflorescences |
Eurasia | Turnip rape | ssp. oleifera (DC.) Metzg. | Seeds |
Eurasia | Turnip | ssp. Rapa | Root-hypocotyl/leaves |
Eurasia | Weedy (wild/feral)b | ssp. sylvestris (L.) Janch. | Leaves/inflorescences |
Note.—Classification under the species level in B. rapa is currently contested (McAlvay et al. 2018), for example, the Iberian leafy crop known as grelos or nabiza are typically considered in the same subspecies as Italian rapini. See Discussion for taxonomic implications of our results.
For simplicity brown sarsons and toria will be considered together as “toria.”
For the purposes of this study, we define “weedy” B. rapa as wild or feral populations. We chose the term “weedy” over “wild” to include the possibility of ferality, and reflect the preference of spontaneously occurring B. rapa for disturbed areas (Weis and Kossler 2004).
Past studies of B. rapa have recovered conflicting morphologic, genetic, and geographic patterns. Whereas some studies show that morphologically similar crops (e.g., oilseeds) are closely related to each other despite being geographically distant (Zhao et al. 2005; Del Carpio et al. 2011; Tanhuanpää et al. 2016), others found that geographically proximal crops were closely related despite strong morphological differences (Song et al. 1988, 1990; Zhao et al. 2005; Takuno et al. 2007; Del Carpio et al. 2011; Annisa et al. 2013; Guo et al. 2014). This has led some researchers to propose multiple domestication events (Song et al. 1988; McGrath and Quiros 1992; Zhao et al. 2005), whereas others support a single domestication (Burkill and Haniff 1930; Ignatov et al. 2008; Qi et al. 2017).
The proposed region(s) and timing of domestication are similarly varied. Eurasia harbors a number of centers of crop domestication including Southeast, East, South, and Central Asia, with the former two regions originating as early as the Early Holocene (Larson et al. 2014). Prior to the establishment of Silk Road networks linking all of these putative domestication centers, there was widespread early diffusion of crops across Europe and Asia. For example, certain near-Eastern domesticates like barley are thought to have spread throughout Eurasia as early as 4,000 years ago (Lister et al 2018). Proposed centers of origin for B. rapa include Europe (Song et al. 1988, 1990; Zhao et al. 2005), West Asia (Harberd 1972), Central Asia (Ignatov et al. 2008; Qi et al. 2017), and East Asia (Song et al. 1988; Zhao et al. 2005). The timing of domestication has been difficult to assess due in part to the relative difficulty of finding and identifying Brassica seeds at archaeological sites (but see Allchin 1969 and Hyams 1971), but literary and linguistic evidence have provided some insights (supplementary table 1, Supplementary Material online; Qi et al. 2017).
The initial crop type selected, its diffusion, and the subsequent selection for different crops is also debated. Although some hypothesize that turnips were the first domesticated B. rapa crop type (McGrath and Quiros 1992; Siemonsma and Piluek 1993), early literary and linguistic evidence suggest that oilseed types are also ancient (supplementary table 1, Supplementary Material online). Some genetic studies have variously suggested that napa cabbage was selected from bok choy (Song et al. 1990; Zhao et al. 2005; Takuno et al. 2007), turnips (McGrath and Quiros 1992), or a cross between bok choy and turnips (Li 1981; Song et al. 1988; Ren et al. 1995; Qi et al. 2017). Furthermore, several B. rapa leafy crops exist in northwestern Spain and Portugal (Francisco et al. 2009), but their origins have not been investigated. Although the leaves and petioles of turnip crops are used for food in many cultures and leafy types without enlarged root-hypocotyls are sometimes referred to as “turnip tops” or “turnip greens” (Martínez et al. 2013), the crops referred to in this study as leafy types (e.g., rapini, grelos, bok choy, napa cabbage) do not have swollen below-ground storage organs. Uncertainty also remains surrounding the B. rapa oilseed crops, as there are apparent references to yellow sarsons (B. rapa ssp. trilocularis) as far back as 2,900 years ago in India, whereas turnip rape (B. rapa ssp. oleifera) may have originated relatively recently in Europe and China from turnips and leafy vegetables, respectively (Gómez-Campo and Prakash 1999). Although an apparent multiple origin of turnip rape is widely supported by molecular data, the specific progenitors of the different lineages are unclear (Bird et al. 2017; Qi et al. 2017).
A major limitation to insights into the domestication of B. rapa has been a lack of knowledge about weedy forms (B. rapa ssp. sylvestris) which could be either truly wild or feral (McAlvay et al. 2018). Wild crop relatives can be a key to clarifying the location and timing of domestication events (Meyer et al. 2012). The identification of teosinte as the progenitor of maize led to key insights into the morphological and genomic transitions of maize domestication and made substantial contributions to breeding efforts (Wilkes 1967; Cruz-Cárdenas et al. 2019). Although B. rapa ssp. sylvestris occur spontaneously on roadsides, waste areas, farmlands, and riversides in many temperate areas worldwide (Andersen et al. 2009), it is unclear whether these populations are feral crop escapes or truly wild forms (Crouch et al. 1995; Andersen et al. 2009). Feral plants—derived fully or partially from domesticated crops—may or may not resemble wild forms and may harbor different levels of diversity than wild conspecifics (Gressel 2005). Definitively distinguishing wild and feral populations can be challenging, but evidence for ferality may include crops emerging as sister to weedy populations in phylogenies or introgression between crop and weedy populations (Page et al. 2019). Disentangling feral and truly wild B. rapa could position the species as a powerful model system for studying both evolution under domestication and ferality given its close relationship to the genetic model Arabidopsis thaliana, rapid life cycle, and well-annotated reference genome.
To clarify the wild or feral status of weedy B. rapa populations and reconstruct the domestication history of crop forms, we performed genotyping-by-sequencing (GBS) (Elshire et al. 2011) on an unprecedentedly broad panel of B. rapa crops and weedy samples to assess genetic structure, diversity, introgression, and demographic history. We also modeled the distribution of weedy B. rapa in Eurasia and North Africa during recent and mid-Holocene conditions to project areas of suitable habitat during the putative time of domestication and identify potential modern distribution. We investigated the following questions: 1) are weedy B. rapa populations wild or feral? 2) when, where, and how many times was B. rapa domesticated and what was or were the first domesticated crop type(s)? and 3) were the crop morphotypes (leafy, turnip, oilseed) selected one or multiple times from the initial domesticated morphotype?
Results
Genotyping-by-Sequencing
SNP Calling, SNP Filtering, and Sample Filtering
Sequencing produced 656,335,837 raw reads (average 218,778,612 per lane) that were combined with 823,954,356 raw reads from Bird et al. (2017). A total of 372,182 single nucleotide polymorphisms (SNPs) were called using the Tassel pipeline (Glaubitz et al. 2014). After downstream SNP and sample filtering, we retained 416 samples (supplementary table 2, Supplementary Material online) and 68,468 SNPs at a minimum minor allele frequency of 0.05.
Diversity
Nucleotide diversity (supplementary table 3, Supplementary Material online) within populations of B. rapa showed higher diversity in turnip samples from Central and Western Asia, intermediate levels of diversity in European crops and weeds, and lowest diversity in East Asian crops and South Asian oilseeds. The analysis of molecular variance analysis (supplementary table 4, Supplementary Material online) revealed that most of the variation present is within populations (52.39%), followed by geographical groups (28.54%), and among populations within geographical groups (19.07%).
Genetic Structure and Maximum Likelihood Trees
To investigate the genetic structure of crop and weedy B. rapa, we carried out fastSTRUCTURE analysis across K values of 1 through 20. A K value of five maximized marginal likelihood in fastSTRUCTURE (supplementary fig. S1, Supplementary Material online) with K = 8 best explaining the structure in the data using a variational inference algorithm. The resulting plots from K = 5 to K = 8 are shown in figure 1. At K = 5, napa cabbage, bok choy, and yellow sarsons formed relatively discrete cluster, with a composite cluster consisting of toria and East Asian turnips/Japanese greens. Toria was unique from East Asian turnips/Japanese greens in that it has a small amount of shared ancestry with yellow sarsons, whereas East Asian turnips/Japanese greens share a small amount of ancestry with napa cabbage and bok choy. Central Asian turnips, weedy populations from Caucasus, Siberia, and Italy (together referred to as “weedy CSI” below), and Turkish turnips show shared ancestry with European crops and weeds, toria, the East Asian turnips/Japanese greens cluster, napa cabbage, and bok choy. At K = 6 differentiation emerges between different subclusters of napa cabbage with the purple cluster predominantly from South Korea and the red cluster largely from China and other countries. At K = 7, East Asian turnips and Japanese greens emerge in their own cluster. At K = 8, some of the Central Asian turnips (from Afghanistan) emerge as a separate cluster with some signatures of admixture with this Afghani cluster showing up in Turkish turnips, other Central Asian turnips, and weedy Caucasus, Siberia, and Italy.
To investigate the relationships between individuals, we built maximum likelihood trees in RAxML and SNPhylo as well as a distance-based neighbor-joining (NJ) tree. In the RAxML tree (fig. 1), the weedy CSI samples are recovered as sister to the remaining lineages with strong bootstrap support (BS; 93%). Weedy European and American populations did not cluster with the weedy populations from CSI, instead emerging within the European crop clade. European turnips were sister to the weedy European and American populations with low BS (35%). Central Asian turnips emerged as sister to a clade of North African and Turkish turnips, European crops and European weedy populations. The leafy European crops (rapini and grelos) emerged in separate clades with North African turnips sister to rapini (97% BS) and Spanish turnips sister to grelos (56% BS). Toria emerged in a grade leading up to a clade of self-compatible yellow sarsons with short branches. The yellow sarson and toria were together sister to the East Asian crops. The East Asian Turnips and Japanese greens clade was sister to clade containing napa cabbage and bok choy (100% BS). The SNPhylo tree (supplementary fig. S2, Supplementary Material online) was broadly congruent with the RAxML tree except weedy CSI, Central Asian turnips, and Turkish turnips emerged as sister to East and Central Asian crops rather than with weedy CSI sister to all crops and weeds and Central Asian and Turkish turnips as sister to all European crops and weeds. The NJ tree (supplementary fig. S3, Supplementary Material online) largely corroborated the RAxML results with minor differences. The principal difference was that in the NJ tree, North African turnips came out as sister to weeds from the Americas and Europe rather than to rapini.
The principal component analysis (PCA) generally separated samples based on geography and crop type (fig. 2). PC1 (18.3% of variance explained) separated Central Asian oilseeds from the remaining crops and weeds. PC2 (10.4% of variance explained) separated East Asian crops from Central Asian turnips, European turnips, greens, and weeds. The weedy CSI samples clustered in between Central Asian turnips, toria, and East Asian turnips/Japanese greens, whereas weedy forms from Europe and the Americas were closely affiliated with European turnips and Mediterranean leafy types (Italian rapini and Spanish grelos). The European weeds and turnips are not well differentiated from each other using the first several PC axes (supplementary fig. S4A, Supplementary Material online). Among East Asian crops, turnips emerged as most closely associated with Central Asian crops and weeds, followed by bok choy, and napa cabbage. Subsets of the samples are shown in supplementary figure S4B–F, Supplementary Material online.
F ST values were significantly different among populations and ranged from 0.046 between European weeds and European turnips to 0.899 between yellow sarsons and North African turnips (supplementary table S5, Supplementary Material online). Grelos were least differentiated from Spanish turnips and rapini were least differentiated from other European turnips. Yellow sarsons were largely differentiated from all other morphotypes (all values >0.5). Weeds from the Americas showed lowest differentiation from Spanish turnips (0.076) and European weeds (0.071). Weedy populations from CSI were least differentiated from Central Asian turnips (0.126), European turnips (0.118), and East Asian turnips (0.119).
TreeMix and f4-Statistics
To investigate reticulation in the evolutionary history of B. rapa crops and weeds, we used TreeMix (Pickrell and Pritchard 2012) and four population (f4) tests of treeness (Reich et al. 2009). TreeMix generates phylogenetic networks by first creating a dendrogram based on genetic drift and then comparing the covariance structure of the dendrogram to covariance across populations to estimate admixture. f4-statistics test the support for admixture in tested populations by detecting whether allele frequency correlations are compatible with treelike evolution. Overall, the relationships recovered are largely congruent with the RAxML phylogeny of Eastern and Western clades. However, the major difference is the placement of weedy CSI as sister to the Asian or Eastern clade rather than as sister to all other cultivars as in the RAxML (fig. 1) and NJ tree (supplementary fig. S3, Supplementary Material online). The tree model with no migration explains 96.4% of the variation in relatedness between the populations, however, sequentially adding migration events results in four migration events representing 98.5% the variation (supplementary fig. S5, Supplementary Material online). Two migration events are supported by four-population (f4) tests for treeness (fig. 3): between toria and Central Asian turnips (f4 = −0.003, Z = −8.03) and between weedy CSI and weedy European populations (f4 = −0.003, Z = −5.98).
Linkage Disequilibrium Decay, Effective Population Sizes, and Demographic Modeling
In order to investigate population branching structure, population size changes, and migration between populations, we constructed and tested demographic models using “moments” v1.0.1 (Jouganous et al. 2017), which uses the joint allele frequency spectrum (AFS) to fit parameterized demographic models for the populations depicted in supplementary table S9, Supplementary Material online. To estimate Ne and visualize patterns of linkage disequilibrium (LD), we also calculated and graphed LD for populations. Our LD analysis revealed that yellow sarsons have strongly elevated long-range LD, whereas toria and East Asian turnips also show elevated long-range LD in comparison with other populations (supplementary fig. S6, Supplementary Material online). Using LD between unlinked SNPs, those populations also were also found to have the smallest recent population sizes with Ne ≅ 5–10 (supplementary table S6, Supplementary Material online). Yellow sarsons are self-compatible (Sethi et al. 1970), which can dramatically increase long-range LD and result in reduced estimates of Ne (Ragsdale and Gravel 2020), whereas high levels of inbreeding or a recent strong bottleneck may also increase long-range LD and lead to small estimates of Ne.
For the demographic models comparing toria, turnips, and weedy CSI, we found that models that allowed for migration always fit far better than those that did not allow for migration, that the inferred migration rates are very large, and in general we see that the admixture models for weedy CSI are not better supported than divergence with gene flow. However, we cannot confidently infer the relative order of splits between toria, weedy CSI, and turnip crops, likely because the very high migration rates that extend from the time of divergence to the present obscure any signal of a “clean” split between populations (supplementary table S7, Supplementary Material online).
We inferred demographic models for sets of two to four populations using the joint SFS with moments (Jouganous et al. 2017). We first focused on populations in central Eurasia to try to disentangle the deeper population structure and relative order of divergences between toria, turnips, and weedy CSI (supplementary table S7, Supplementary Material online). In general, our best-fit models show that there were high levels of gene flow between toria, turnips, and weedy CSI, with only European crops having more moderate gene flow with Central and East Asian populations. Because of the large gene flow between toria, turnips, and weedy populations, we were unable to definitively resolve the relative timing of splits between these populations. Our best-fit models do not support the scenario that weedy CSI arose recently through admixture, but instead that its existence predates the more recent radiation of turnip and leafy crops in Europe and East Asia.
We then turned to inferring demographic histories of turnip and leafy crops (supplementary table S7 and fig. S7, Supplementary Material online). Across all tested demographic models, we inferred that crop forms split from weedy B. rapa from the Caucasus, Italy, and Siberia around 3,430–5,930 years before present, assuming on average one generation per year. The European and Asian lineages diverged roughly 1,930–2,430 years ago, with subsequent diversification over the last one to two thousand years. However, there is large uncertainty surrounding these inferred dates. Not only are the per-base mutation rate and long-term generation times uncertain, the very high levels of inferred migration confound the inferred split times.
For the more recent history, we observe multiple origins of leafy green crops in both the Western and Eastern regions. Grelos split from Spanish turnips roughly 730–930 years before present, with subsequent high levels of gene flow between those two lineages, whereas rapini split from the Turkish turnip lineage around the same time, with subsequent gene flow between rapini and European turnips. In the Eastern lineage, we observe an early diversification of napa cabbage from turnips, and a more recent emergence of bok choy, with contributions from both East Asian turnips and napa cabbage.
We inferred high levels of regional migration between nearby crops and reduced or zero recent gene flow between more distant lineages. For example, we inferred high levels of gene flow between Spanish and European turnips, and between European and Turkish turnips, but far less direct gene flow between Spanish turnips and turnips from Turkey or farther east. Similarly, we inferred migration between East and Central Asian turnips but not between East Asian turnips and crops in Europe.
Species Distribution Modeling
Species distribution modeling or environmental niche modeling uses species occurrence data based on herbarium specimens, observations, and other occurrence data combined with environmental data to predict suitable habitat in the past, present, and/or future (Kozak et al. 2008; Nakazato et al. 2010). In order to narrow the potential center of domestication and identify conservation priority areas for wild relatives, we generated species distribution models to infer the fundamental ecological niche of B. rapa under current and mid-Holocene climate conditions. The omission rate on training samples was close to predicted omission and the receiver operating characteristic curve (DeLong et al. 1988) showed an area under curve of 0.957, suggesting that the model fit the data well (values above 0.75 are considered informative) (Phillips and Dudik 2008). The following three bioclimatic variables were most important for describing the niche: mean temperature of driest quarter (28.7%), mean temperature of coldest quarter (26.7%), and precipitation of coldest quarter (10.8%). The model predicting B. rapa distribution under a mid-Holocene climate scenario (fig. 4A) indicated suitable habitat in a nearly contiguous band from Iberia, North Africa, and the British Isles to coastal China and Japan. This strip of habitat mainly followed highland formations, except in Europe, where coastal areas were also of high suitability. A potential gap in suitable habitat was present in the present-day borderlands of Afghanistan and Iran. The contemporary model (fig. 4B) was generally similar to the mid-Holocene model except for a general pattern of greater suitability in northern areas and lesser suitability in southern areas.
Discussion
Insights into the Identities of Wild and Feral Populations
The identification of crop wild relatives allows for direct comparisons between wild and crop forms and provides an important source of genetic material for crops (Honnay et al. 2012; Dempewolf et al. 2014), which often have relatively limited genepools due to diversity bottlenecks from selection (Olsen and Gross 2008). Feral relatives can be morphologically indistinguishable from wild material (Wang et al. 2017) and the wild or feral nature of B. rapa populations have been contested (Andersen et al. 2009) hindering domestication research, breeding, and conservation in the species. In our analyses, weedy B. rapa samples from the Caucasus, Siberia, and Italy (“weedy CSI”) consistently segregate in a lineage distinct from weedy samples from the Americas and Europe outside of Italy, a lineage that has not been recovered in previous studies. With the exception of a single accession from Abruzzo in central Italy, and an accession from Tomsk in south-central Siberia, the weedy CSI clade consists of samples in or close to the Caucasus mountains in Georgia and northeastern Turkey. Both mid-Holocene and contemporary niche models suggested high habitat suitability in the Caucasus and Italy, but generally low suitability across Siberia except for in the extreme southern region. The lack of suitability in the latter case could be due to a lack of occurrence data from Siberia used in the niche model or erroneous passport data for the Siberian accession. Our analyses present the possibility that this lineage represents either truly wild relatives of some or all B. rapa crops or populations deriving from an early feralization event.
The hypothesis that the weedy CSI clade represents truly wild populations is consistent with Sinskaya’s (1969) suggestion that wild forms still exist in the Caucasus mountains and Siberia and our niche models indicate that habitat for B. rapa in the Caucasus, Siberia, and Italy would be suitable under a mid-Holocene climate model (fig. 4A). The position of weedy CSI samples as sister to all crops and weeds in the RAxML and NJ analyses, and high nucleotide diversity support this interpretation. This lineage is also located at the juncture of all the crop subspecies in the PCA and contains ancestry common to each of the major crop clades in fastSTRUCTURE, which may indicate that the other crops as weeds were subsampled from the diversity present in weedy populations in the CSI range. The placement of weedy CSI in the SNPhylo and TreeMix trees as sister to Central and East Asian crops, suggests the possibility that weedy CSI is truly wild, but an independent domestication event gave rise to the European crops.
An alternative interpretation would be that weedy CSI represents a highly admixed feral lineage. Many crops are known to exist in wild-domesticated-feral hybrid swarms (e.g., Beebe et al. 1997; Wang et al. 2017; Allaby et al. 2008). As mentioned above, the diverse ancestry for these individuals recovered by fastSTRUCTURE and the central position in the PCA could be explained by admixture between crop and/or weedy types. Several patterns traditionally associated with wild progenitors in analyses may instead indicate admixed feral populations (Wang et al. 2017). For example, high levels of admixture could also have inflated nucleotide diversity and caused the clade to be placed at an artificially deep position in phylogenetic analyses. Demographic models tested in moments were inconclusive regarding the early splits of the major lineages, but the models that performed best allowed for extensive early gene flow, supporting this hypothesis. The TreeMix recovered a migration edges supported by f4-statistics that would suggest gene flow between CSI and toria, and the TreeMix and SNPhylo trees are consistent with weedy CSI populations deriving from an ancient feral lineage that branched off from an ancestor of Central Asian oilseeds and East Asian crops.
Our analyses recovered strong evidence for the feral nature of the weedy populations outside of the CSI clade. Non-CSI European and American weedy samples emerged in association with European crops in PCA, fastSTRUCTURE, RAxML (99% BS), SNPhylo, and NJ analyses and had relatively low differentiation from European crops in the FST analysis. Despite lower levels of diversity among the feral samples, their local adaptation to biotic and abiotic stresses could make them a valuable source of breeding material as in feral rice (Li et al., 2017).
Due to the geographic disjuncture of the weedy CSI samples, it is likely that there are related populations in the intervening areas. Several possibilities could explain the geographic disjuncture of the samples from Italy and the Caucasus in the CSI clade. Weedy B. rapa from the intervening area (e.g., Slovakia, Austria, and Serbia) were associated with other European weeds and crops in the analyses, but the TreeMix analysis suggested that introgression had occurred between the CSI clade and the European weeds, consistent with an exoferal origin of most of the European weedy accessions. Truly wild populations could have been more widespread in Europe previously, but may have been largely replaced by successful feral populations that are preadapted to agroecosystems. Alternatively, the sole Italian sample could have been introduced relatively recently as a seed contaminant in imported crops. Weedy populations of B. rapa have recently been reported in Southwest China (Dong et al. 2018) and Japan (Aono 2011), despite previous reports of absence from East Asia (De Candolle 1886; McGrath and Quiros 1992) as well as in Algeria (Aissiou et al. 2018). Sampling these populations could further clarify the evolutionary history of B. rapa.
Domestication Center, Timing, and Initial Crop Type
The location and timing of domestication as well as the initial domesticated form of B. rapa that was subsequently selected for other crop types has been debated. Our analyses support an initial domestication of turnips and/or oilseeds in Central Asia 3,430–5,930 years before present (YBP) with subsequent diffusion to Europe and East Asia.
Turnips from Central Asia appear to be most closely associated with the putatively wild CSI populations in our FST analysis and had the highest levels of nucleotide diversity of any population. This is consistent with the findings of Qi et al. (2017), who found Central Asian and European turnips to have relatively higher values than East Asian vegetables, and much higher values than yellow sarsons and toria. Given our use of different reduced representation markers, the absolute comparison of values is not possible. Also as in Qi et al. (2017), we estimated highest Ne for turnips, intermediate values for East Asian vegetables, and the lowest values for yellow sarsons and toria. The turnip samples from the Hindu Kush mountains of Afghanistan, Pakistan, and Tajikistan are sister to the rest of the B. rapa samples from West Asia and Europe in our TreeMix, RAxML, and NJ trees and demographic model and are located in an area of very high habitat suitability in the Mid-Holocene species distribution model.
East Asian turnips are also associated with the putatively wild samples and Central Asian turnips as seen in the PCA, fastSTRUCTURE, and FST analyses. East Asian turnips also emerge as sister to other East Asian crops (with the exception of the Japanese leafy crops which are intermingled with East Asian turnips) in our tree-based analyses. An early Central Asian origin is supported by linguistic evidence for turnips in the Pontic Steppe north of the Caucasus mountains approximately between 6430-4278 YBP and in S.W. Asia nearly 3,000 YBP (supplementary table 1, Supplementary Material online).
It has been hypothesized that Central Asian oilseeds arose from an independent domestication event (Zhao et al. 2005; Warwick et al. 2008) or from European oilseeds (Song et al. 1988). Although our TreeMix, SNPhylo, NJ, and RAxML trees show Central Asian oilseeds from the mountains of Afghanistan, Pakistan, and India as sister to East Asian crops, they are associated with West, Central, West, and East Asian turnips in fastSTRUCTURE. These results support either an independent domestication for oilseeds or selection on Central Asian turnips before the diversification of other crop forms across Europe and Asia. Literary evidence for oilseed crops in Northern India almost 3,000 YBP supports their antiquity in the area (supplementary table 1, Supplementary Material online). Apparent introgression between the Central Asian oilseed toria and Central Asian turnips as recovered by TreeMix and f4-statistics is consistent with the geographic proximity of these crops. In our tree-based analyses, toria is consistently sister to yellow sarsons.
In the demographic models we tested with weedy CSI as sister to the remaining crops and weeds, the split was estimated at between 3,430–5,930 YBP (years before present). If the weedy CSI populations are truly wild, this provides a rough temporal estimate of the domestication of B. rapa. Although this prediction is sensitive to our estimates of initial effective population size and generation time, this finding is consistent with archaeological and linguistic evidence (supplementary table 1, Supplementary Material online).
Convergent Phenotypes in Leafy and Oilseed Crops Deriving from Turnip Crops
Our inclusion of diverse leafy crop types from Europe and Asia allowed us to identify distinct evolutionary lineages for at least three different types of leafy B. rapa crops—grelos, rapini, and East Asian greens—all with likely origins in different turnip populations. None of our tree-based analyses recovered these three leafy crops together as a common monophyletic clade and they do not cluster with one another in the PCA. Instead, turnip crops from geographically similar regions emerged as sister to each one.
North African turnips emerged as sister to the Italian leafy crop rapini in TreeMix, RAxML (97% BS), SNPhylo, NJ analyses, and Turkish turnips were sister to rapini in the moments demographic model. Our findings differ from Qi et al. (2017) who found rapini to be sister to the Central Asian oilseeds possibly because the majority of the rapini samples used in Qi et al. (2017) were later suspected to be polyploids in Bird et al. (2017) or had Central Asian provenance. Our findings suggest a possible trans-Mediterranean introduction of turnips to Italy followed by selection for leafy forms or domestication for a leafy type in North Africa where turnip leaves are also eaten (Hammer and Perrino 1985). A different Mediterranean turnip population, Spanish turnips, however, emerged as sister to the Spanish leafy crop grelos in our RAxML, SNPhylo, and NJ analysis. PCA results also indicate an association of the two Mediterranean leafy crops with distinct turnip populations. This provides the first evidence that grelos were independently selected from local turnip crops instead of sharing a common origin with rapini (Francisco et al. 2009).
A similar pattern was recovered in East Asia, where leafy crops may have an origin in a distinct population of turnips. East Asian turnips emerged as sister to East Asian leafy crops in the TreeMix, RAxML, SNPhylo, and NJ trees consistent with the results from Bird et al. (2017) and Cheng et al. (2016). The origin of East Asian leafy types in turnips is also consistent with Takuno et al.’s (2007) hypothesis that an unimproved proto-crop from Europe or Central Asia was introduced to East Asia where it was subsequently selected into diverse leafy morphotypes. The origins of napa cabbage as a result of intermixing of bok choy and East Asian turnips is supported in our demographic model as in Qi et al. (2017) but not recovered in the TreeMix migration edges supported by four-population tests. Most Japanese leafy greens are also clustered with the East Asian turnips suggesting that they too may share a common origin in turnips and may have been selected from an early introduction of turnips or a proto-vegetable type. Although the East Asian turnips sampled had elevated LD compared with bok choy and napa cabbage, Cheng et al. (2016) found the opposite, perhaps as a result of their broader sampling.
Although these patterns could suggest parallel selection for leafy types out of distinct turnip populations, they may also be explained by a single origin of the leafy morphotype with subsequent introgression resulting in the leafy phenotype arising in other turnip lineages, although the latter possibility is not supported by our TreeMix and f4-statistic analyses. The enlarged root-hypocotyl of turnips is controlled by relatively few genes and may have therefore been lost multiple times through human selection and/or ferality (McGrath and Quiros 1992). The pattern of selection for leafy crops out of conspecific crops with swollen stems or taproots does not seem to be common in analogous crops. A similar trend has been observed, however, in B. juncea as it appears that root mustard (B. juncea ssp. napiformis) was the initially selected domesticated, followed by leafy, stem, and oilseed types (Yang et al. 2018). This differs from B. oleracea, B. napus, and Beta vulgaris, where leafy or oilseed types appear to predate the swollen stem/root types (Zohary and Hopf, 2000; Maggioni 2015; An et al. 2019).
Our limited sampling of turnip rape oilseeds from East Asia and Europe restrict our inferences, but like Bird et al. (2017) and Qi et al. (2017), turnip rape did not form a monophyletic clade. Our data support an origin of European turnip rape from European turnips, and East Asian turnip rape from turnips, and/or bok choy consistent with Reiner et al. (1995).
Taxonomic Implications
Our findings indicate that an infraspecific taxonomic recircumscription of B. rapa is warranted to avoid confusion. Several classification systems are currently in use with a combination of subspecies, varieties, and cultivar groups as infraspecific designations (McAlvay et al. 2018). Our findings highlight several paraphyletic lineages that are typically considered to be the same taxon including turnip rape (B. rapa ssp. oleifera), weedy forms (B. rapa ssp. sylvestris), and rapini/grelos (B. rapa ssp. sylvestris var. esculenta). In the case of the latter name the “ssp. sylvestris” epithet is misleading given its likely (multiple) origins in turnips. Furthermore, our results indicate that turnips may also have multiple origins or a single origin with reversions to turnip types, with the former scenario suggesting the need for a reexamination of nomenclature. A phylogenetically and morphologically informed recircumscription of B. rapa to create an international standard for delineating these lineages would be a valuable future contribution to Brassica research and breeding.
Conclusions
Our broad sampling of crops and weedy populations of Brassica rapa allowed us to identify feral and potentially wild relatives, identify a putative center of domestication, and reconstruct the diversification history of crop morphotypes. These insights enhance Brassica rapa as a model system to study artificial selection and ferality as well as contribute to conservation of crop genetic resources in this species.
We found evidence for a distinct lineage of B. rapa from the Caucasus, Siberia, and Italy which may represent the first documented wild B. rapa populations or the descendants of an ancient feral lineage, and we determined that weedy plants in a large portion of the European and American distribution are feral populations resulting from introgression between relatives of weedy CSI and European turnips. Our findings provide evidence of the complex interplay of wild, crop, and feral populations that may take place throughout the domestication process and raise the possibility that the evolutionary relationships between these lineages is not treelike, a pattern also observed rice (Wang et al. 2017), common bean (Beebe et al. 1997), and other crops. If the weedy CSI lineage is truly wild, it provides an unprecedented opportunity to directly compare wild, domesticated, and feral plants to determine functional genomic differences and morphological changes under human selection and feralization (Page et al. 2019). This enhances B. rapa, which has a well-characterized genome, short generation time, and diversity of crop types, as a powerful model system for understanding domestication and ferality. Distinguishing these gene pools contributes to the prioritization of B. rapa populations for conservation and germplasm collection (Brozynska et al. 2016) that may prove important in the future to cope with changing environmental conditions (Guarino and Lobell 2011). This is especially urgent, as potentially wild populations of B. rapa that are seen as weeds could be harmed by increased agricultural weed control in its native range (Crouch et al. 1995).
We found evidence for a common origin of B. rapa crops in turnips or oilseeds domesticated in the mountains of Central Asia between 3,430 and 5,930 years ago with subsequent selection on turnips for leafy vegetables and oilseeds one or multiple times in the Mediterranean and East Asia. Our findings highlight the importance of Central Asia for germplasm collection and in situ agrobiodiversity conservation and indicate, for the first time, that the Spanish leafy crop grelos has a distinct origin from Italian rapini. The appearance of various geographically disparate but morphologically convergent leafy, oilseed, and tuber-forming types suggests that crops in other species with multiple morphotypes such as other Brassica species and Beta vulgaris (beets, chard) may have been selected for particular suites of traits on multiple occasions. The nutritional characteristics, culinary uses, and storability of these organs in each of these species is relatively different, suggesting that selection for multiple morphotypes may have been a common strategy to fill different food systems niches across Eurasia.
Our results raise new questions about the nature of the weedy CSI lineage, potential parallel selection, as well as domestication and feralization syndromes in B. rapa. Additional sampling of B. rapa populations in the Caucasus, Siberia, and Italy as well as known weedy populations in western China (Dong et al., 2018), Japan (Aono 2011), and North Africa (Aissiou et al. 2018) would provide further clarity regarding wild and feral gene pools. Our finding of convergent leafy and oilseed phenotypes from distinct turnip lineages may suggest that parallel selection may have occurred, a hypothesis that could be tested explicitly with functional genomic approaches. Finally, our study opens up the possibility of understanding the genetic and phenotypic basis of domestication and feralization in B. rapa with direct comparisons possible between weedy CSI, turnip, and feral populations.
Materials and Methods
Genetic Analyses
Sampling
We obtained 289 B. rapa and outgroup samples through seed banks (e.g., United States Department of Agriculture [USDA]and The Leibniz Institute of Plant Genetics and Crop Plant Research-Gatersleben) and directly from researchers (supplementary material 1, Supplementary Material online). Three seeds from each accession as well as three seeds from each of the 333 USDA seed bank accessions used in the final GBS data set by Bird et al. (2017) were planted in 6” square pots using Promix High Porosity medium (Premier Tech, Rivière-du-Loup, Québec) at the Walnut Street Greenhouses at University of Wisconsin-Madison. Accessions were characterized morphologically using select standard descriptors developed by the International Board for Plant Genetic Resources (IBPGR 1987). Subspecies identity was evaluated (supplementary table S2, Supplementary Material online) using the following criteria: semiheading and heading with wrinkled leaves and swollen petioles—napa cabbage (B. rapa ssp. pekinensis), swollen petioles, and smooth spatulate leaves—bok choy (B. rapa ssp. chinensis), parted leaf dissection—mizuna (B. rapa ssp. nipposinica), flattened rosette of many smooth spatulate leaves (B. rapa ssp. narinosa), long swollen petioles with smooth orbicular leaves—komatsuna (Brassica rapa ssp. perviridis), enlarged root-hypocotyl and dissected lyrate leaves—turnip (B. rapa ssp. rapa), compact and enlarged inflorescence with dissected lyrate leaves—rapini and grelos (B. rapa ssp. sylvestris var. esculenta), yellow seeds—yellow sarsons (B. rapa ssp. trilocularis), lacking the traits previously mentioned but possessing dissected lyrate leaves—toria or brown sarson (B. rapa ssp. dichotoma), turnip rape (B. rapa ssp. oleifera) or weedy (B. rapa ssp. sylvestris) based on collection passport data (e.g. country, cultivated/uncultivated context).
GBS data from these samples were combined with a previously generated GBS diversity data set (Bird et al. 2017), providing a total of 653 samples before sample filtering. This diversity panel incorporates samples from six continents and 12 crop subspecies. Our panel augments Bird et al.’s (2017) sampling of East Asian leafy crops, South and Central Asian oilseeds, and European turnips by adding turnips from West Asia, South Central Asia (“Central Asia,” defined as Nepal, Northern India, Pakistan, and Afghanistan, Tajikistan, Kyrgyzstan, and Western China), and Europe, Mediterranean leafy crops (Spanish grelos and Italian rapini), and weedy occurring samples from Europe, the Caucasus, Siberia, the Americas, and New Zealand. We also included a Brassica oleracea sample (NGB 162411) from Denmark to serve as an outgroup for our tree-based analyses. We did not include three East Asian vegetable crops (caixin, zicaitai, taicai), which are thought to be selected from bok choy (Cheng et al. 2016). Choy sum also appears to be selected from bok choy and as a result is included with bok choy in our analyses.
DNA Extraction and Sequencing
DNA was extracted from young leaf material using the CTAB method (Doyle J and Doyle JL 1987) at the University of Wisconsin-Madison Biotechnology Center (UWBC). The restriction enzyme ApeKI was used to construct GBS libraries. Samples were 96-plexed in each of three lanes of Illumina HiSeq 2000 (Illumina Inc., San Diego, CA), and one well in each plate was left as a negative control. Raw sequence data were combined with that of Bird et al. (2017), who also used the ApeKI enzyme, and processed using the GBS 2 pipeline in Tassel 5 (Glaubitz et al. 2014). Parameters used for the pipeline can be found in supplementary table S8, Supplementary Material online. Reads were aligned to B. rapa ssp. pekinensis v1.5 reference genome (Wang et al. 2011) using Burrows–Wheeler alignment (Li and Durbin 2009). Raw reads are publicly available on Data Dryad at https://doi.org/10.5061/dryad.0gb5mkm0m (last accessed May 5, 2021).
SNP and Sample Filtering
The SNPs resulting from the Tassel 5 pipeline were filtered using VCFtools (Danecek et al. 2011) based on read depth (minimum mean depth = 3), number of alleles (only biallelic loci used), minimum number of genotypes scored per site (100% for PCA, 90% for all other analyses), and a minimum minor allele frequency of 5% for all analyses except demographic modeling (supplementary table S8, Supplementary Material online). In order to test the effects of different SNP filtering criteria, we ran the PCA, RAxML, and fastSTRUCTURE analyses with SNP data sets that had been filtered with minimum mean read depths of 3, 6, 9, and 12. We found minimal differences in the patterns recovered by the three analyses and so retained the minimum mean read depth of 3. Tassel 5 was used to filter SNP sites for a maximum heterozygosity of 50% and remove taxa that had <50% of scored loci. Samples identified by Bird et al. (2017) to be polyploids or otherwise problematic were removed (see Bird et al. 2017 for list of samples).
Diversity and Genetic Structure
To determine patterns of diversity within populations of B. rapa, we evaluated nucleotide diversity (π; Nei and Li 1979) in Tassel 5 (Glaubitz et al. 2014) and variance within and across populations and geographical groups using the AMOVA (Excoffier et al. 1992) in Arlequin 3.5 (Excoffier et al. 2005). Given that the data are GBS-derived SNPs that inherently optimize variability across individuals by targeting segregating sites, the ability to compare these nucleotide diversity results with those in other species or π calculated using other types of data are limited. For the AMOVA analysis, groups were defined geographically as follows: East Asia, Central Asia, Europe, and Americas. Six turnips of unknown provenance as they would not be informative. With the exception of the TreeMix analysis, which used populations delineated based on monophyletic clades through SNPhylo (supplementary fig. S2, Supplementary Material online), populations were delineated for all analyses based on clades recovered from the RAxML tree in conjunction with phenotypic data and provenance information. The polyphyletic weedy B. rapa populations were delineated based on provenance (weedy Europe, weedy Americas) except for the Weedy CSI (Caucasus, Siberia, Italy) clade which consistently formed a discrete clade distinct from these other lineages.
To determine the genetic structure of our samples, we used fastSTRUCTURE 1.0 (Raj et al. 2014). The Brassica oleracea outgroup sample was omitted from this analysis. We ran fastSTRUCTURE at group (K) values between 1 and 20 and used the ChooseK.py script included in the fastSTRUCTURE package to assess the K value that maximized marginal likelihood. FastSTRUCTURE plots were generated using the online utility STRUCTURE PLOT 2.0 (Ramasamy et al. 2014). To further investigate the genetic structure of the samples, we used PCA in Plink 1.07 (Purcell et al. 2007), running analyses that include all populations at once as well subsets of populations focusing on crops within geographic regions. To evaluate genetic differentiation among populations, we calculated FST (Weir and Cockerham 1984) between each pair of populations defined by subspecies and geography using Arlequin 3.5 (Excoffier et al. 2005).
Tree-Based Analyses
In order to visualize the hierarchical structure of the species, we generated an NJ tree (Saitou and Nei 1987) using Nei’s genetic distance and 100 bootstrap replicates in PAUP 4.0 (Swofford 2003), a maximum likelihood tree with RAxML 8.0 (Stamatakis 2014) on the CIPRES platform (Miller et al. 2012) using rapid bootstrapping (-fa and -x flags), 100 bootstraps, a GTR + Γ substitution model, and sample NGB 162411 (B. oleracea from Denmark) as an outgroup, and SNPhylo v. 20160204 (Lee et al. 2014) was run using 0.1 for LD threshold, 0.01 for minor allele threshold, 0.1 for missing rate, 1,000 bootstrap replicates, and again rooted with B. oleracea.
TreeMix and f4-Statistics
To test species tree relationships and introgression we used TreeMix v. 1.13 (Pickrell and Pritchard 2012). To reduce signals of recent introgression, samples were pruned to include only those recovered as monophyletic clades in the SNPhylo analysis (supplementary fig. 2, Supplementary Material online) and analyses were run for 0–8 migration edges with the following parameters: -root B. oleracea, -k 300. To provide additional support for inferred migration edges four-population (f4) tests (Reich et al. 2009) for treeness were used as implemented in TreeMix.
Demographic Analysis
In order to investigate population branching structure, population size changes, and migration between populations, we constructed and tested demographic models using moments v1.0.1 (Jouganous et al. 2017), which uses the joint AFS to fit parameterized demographic models. For this analysis, we did not filter by minimum minor allele frequency as it can artificially skew the AFS and bias inferences. Because numerous SNPs might not be called in every individual in each population, we projected sample sizes down to include more SNPs in each analysis (Marth et al. 2004). Total sample sizes and projection sizes are given in supplementary table S6, Supplementary Material online. For all models, we ran parameter inference using the folded AFS (using the minor allele frequency of all biallelic SNPs) to avoid biases caused by misidentification of the ancestral allele.
We tested demographic models using between two and four populations. To reduce the risk of model overparameterization, population sizes were allowed to vary between branches but were constant along each branch, and we allowed no more than two symmetric migration rates per model. We tested models for a total of 13 unique sets of populations (supplementary table S9, Supplementary Material online). For inferences involving four populations, we fixed best-fit parameters from simpler model inferences and only fit parameters for the additional populations. All models are illustrated in supplementary figure S7, Supplementary Material online, with parameter descriptions in supplementary table S7, Supplementary Material online. Best-fit parameters were inferred using the built-in composite likelihood-based optimization functions in moments, and confidence intervals were computed using the Godambe information matrix, which corrects test statistics to account for linkage between SNPs (Coffman et al. 2016).
To clarify the relative order of population splits and the mode of divergence (e.g., with or without ongoing gene flow or via admixture events) in the evolutionary history of turnips, toria, and the newly recovered clade of weedy samples from the Caucasus, Siberia, and Italy, we tested additional demographic models. These populations were selected because the PCA, fastStructure, and tree-based analyses suggested a potential key role in the early origins of B. rapa crops. To try to resolve these deeper splits, we tested the following parameterized demographic models using sets of three populations including the weedy CSI lineage: 1) Weedy CSI, Central Asian turnips, and East Asian turnips; 2) Weedy CSI, toria, and Central Asian turnips; 3) Weedy CSI, toria, and European turnips; and 4) Weedy CSI, toria, and East Asian turnips. For each set of three populations, we tested models with each of the three populations as the earlier diverging branch, as well as a model in which the weedy CSI were formed through an admixture event between the other two populations which had earlier diverged. For each of these four settings, we tested models that both allow and disallow migration, so that we fit eight models for each set of three populations.
Because computing the AFS for more than four populations is computationally burdensome, we compiled the demographic model for ten populations across the Eastern and Western clades by parsimoniously combining inferred models for subsets of these populations. This combined model should be treated with caution, as parameters were not refitted over the ten-population AFS. However, the topology and approximate split times are largely consistent with the inferred models for the 13 population subsets in supplementary table S9, Supplementary Material online. Moments returns parameter estimates in genetic units, scaled by 2Ne, where Ne is the ancestral effective population size. We used an estimate of Ne ≅ 40,000 from Qi et al. (2017), which assumed a mutation rate of 9e−9 (Kagale et al. 2014). To convert from generations to years, we assumed one year per generation. The parameters in all inferred demographic models using moments are in genetic units, that is, scaled by the ancestral Ne, which is set to 1. Thus, all population sizes nu_pop are relative sizes in comparison with the ancestral reference Ne. Times of splits and size changes are given in units of 2Ne generations, and migration rates are given in units of 2Nemper-gen, where mper-gen is the per-generation probability that a lineage in one population had its parent in another source population. In all models, theta = 4NeL is a free parameter that accounts for the linear scaling of the total number of variants in the data AFS, where Ne is the per-generation, per-site mutation rate and L is the effective number of sequenced sites.
We computed pairwise LD as , where p and q are the allele frequencies at the two loci, D is the standard covariance measure of LD, and expectations are estimated as averages of those statistics across pairs of loci (Ohta and Kimura 1969). This measure of LD is closely related to the familiar but allows for unbiased comparison of LD between cohorts with varying sample size (Ragsdale and Gravel 2020). We also computed average LD between all pairs of variants on separate chromosomes within each population, that is, average LD for unlinked loci within each population. LD between unlinked loci provides an estimate of effective population sizes over the last few generations, since expected for unlinked loci in randomly mating outbred populations (Waples 2006; Ragsdale and Gravel 2020), and rearranging this equation provides an estimate for Ne. Long-range LD is not only sensitive to Ne but can also be elevated due to recent admixture between diverged populations. However, LD between unlinked loci decays within just a handful of generations, so that Ne based on unlinked LD is largely insensitive to admixture that occurred more than 2–4 generations ago (Waples 2006).
Species Distribution Modeling
To obtain georeferenced B. rapa occurrence data, we drew from the Global Biodiversity Information Facility (GBIF, http://www.gbif.org, last accessed May 5, 2021). We removed duplicate records, records without location data, records without vouchers, and records from botanical gardens, and to ensure that the occurrence data included only noncrop B. rapa, we included only samples with location data suggesting uncultivated status. Occurrences were included that referenced roadsides, ditches, waste areas, abandoned fields, railroad tracks, parklands, or weediness in their metadata or were originally determined as the weedy ssp. campestris or ssp. sylvestris. Our initial GBIF search for B. rapa returned 34,361 occurrences. After filtering using the above criteria, 2,263 occurrences remained. To reduce bias in sampling that may arise from collecting in accessible areas close to population centers, we removed samples occurring within 100 km of each other using the spThin R package (Aiello‐Lammens et al. 2015). Spatial thinning resulted in a final data set of 638 occurrences. We obtained geographic information system rasters for 19 bioclimatic variables (supplementary table S10, Supplementary Material online) with 2.5-min spatial resolution from WorldClim Version 1.4 (Hijmans et al. 2005). We modeled the niche of weedy B. rapa and predicted compatible habitat given the CCSM4 mid-Holocene climate model (Meehl et al. 2012) and “contemporary” WorldClim data collected between 1960 and 1990 in MaxEnt (Phillips et al. 2006). For both contemporary and mid-Holocene models, all 19 variables were retained as recent simulations indicate that removing collinear variables does not significantly impact model performance with maximum entropy modeling (Feng et al. 2019). Model performance was evaluated with the adjusted area under receiver operating characteristic curve (DeLong et al. 1988). The linear, quadratic, product, and hinge features were used and jackknifes were used to measure variable importance.
Supplementary Material
Supplementary data are available at Molecular Biology and Evolution online.
Supplementary Material
Acknowledgments
This work was supported by a National Science Foundation Doctoral Dissertation Improvement Grant (grant number DEB-1601430 to A.C.M. and E.E.) and the funding from the University of Wisconsin-Madison (UW-Madison) Department of Botany. J.C.P., K.B., H.A., and M.E.M. were supported by the US Department of Energy (DOE HDTRA 1-16-1-0048), the US Department of Agriculture, and the US National Science Foundation (NSF IOS 1339156). We would like to thank the staff of UW-Madison Biotechnology Center and Bioinformatics Resource Center for their support with sequencing and analysis and the Walnut Street Greenhouse at UW-Madison for assistance with growing the plants for phenotyping.
Data Availability
The data for this article have been deposited in Data Dryad repository (https://datadryad.org/stash/dashboard, last accessed May 5, 2021) and are available at the following link: https://doi.org/10.5061/dryad.0gb5mkm0m, last accessed May 5, 2021.
References
- Aiello-Lammens ME, Boria RA, Radosavljevic A, Vilela B, Anderson RP.. 2015. spThin: an R package for spatial thinning of species occurrence records for use in ecological niche models. Ecography 38(5):541–545. [Google Scholar]
- Aissiou F, Laperche A, Falentin C, Lodé M, Deniot G, Boutet G, Régnier F, Trotoux G, Huteau V, Coriton O, et al. 2018. A novel Brassica rapa L. genetic diversity found in Algeria. Euphytica 214(12):241. [Google Scholar]
- Allaby RG, Fuller DQ, Brown TA.. 2008. The genetic expectations of a protracted model for the origins of domesticated crops. Proc Natl Acad Sci U S A. 105(37):13982–13986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- An H, Qi X, Gaynor ML, Hao Y, Gebken SC, Mabry ME, McAlvay AC, Teakle GR, Conant GC, Barker MS, et al. 2019. Transcriptome and organellar sequencing highlights the complex origin and diversification of allotetraploid Brassica napus. Nat Commun. 10(1):1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Andersen NS, Poulsen G, Andersen BA, Kiær LP, D’Hertefeldt T, Wilkinson MJ, Jørgensen RB.. 2009. Processes affecting genetic structure and conservation: a case study of wild and cultivated Brassica rapa. Genet Resour Crop Evol. 56(2):189–200. [Google Scholar]
- Annisa Chen S, Cowling WA.. 2013. Global genetic diversity in oilseed Brassica rapa. Crop Pasture Sci. 64:993–1007. [Google Scholar]
- Aono M, Wakiyama S, Nagatsu M, Kaneko Y, Nishizawa T, Nakajima N, Tamaoki M, Kubo A, Saji H.. 2011. Seeds of a possible natural hybrid between herbicide-resistant Brassica napus and Brassica rapa detected on a riverbank in Japan. GM Crops 2(3):201–210. [DOI] [PubMed] [Google Scholar]
- Beebe S, Ch OT, Gonza AV, Chaco MI, Debouck DG.. 1997. Wild-weed-crop complexes of common bean (Phaseolus vulgaris L., Fabaceae) in the Andes of Peru and Colombia, and their implications for conservation and breeding. Genet Resour Crop Evol. 44(1):73–91. [Google Scholar]
- Bird KA, An H, Gazave E, Gore MA, Pires JC, Robertson LD, Labate JA.. 2017. Population structure and phylogenetic relationships in a diverse panel of Brassica rapa L. Front Plant Sci. 8:321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brozynska M, Furtado A, Henry RJ.. 2016. Genomics of crop wild relatives: expanding the gene pool for crop improvement. Plant Biotechnol J. 14(4):1070–1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burkill IH, Haniff M.. 1930. Malay village medicine: prescriptions collected. Gardens Bull Strait Settlements 6:176–177. [Google Scholar]
- Cheng F, Sun R, Hou X, Zheng H, Zhang F, Zhang Y, Liu B, Liang J, Zhuang M, Liu Y, et al. 2016. Subgenome parallel selection is associated with morphotype diversification and convergent crop domestication in Brassica rapa and Brassica oleracea. Nat Genet. 48(10):1218–1224. [DOI] [PubMed] [Google Scholar]
- Coffman AJ, Hsieh PH, Gravel S, Gutenkunst RN.. 2016. Computationally efficient composite likelihood statistics for demographic inference. Mol Biol Evol. 33(2):591–593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crouch JH, Lewis BG, Lydiate DJ, Mithen R.. 1995. Genetic diversity of wild, weedy and cultivated forms of Brassica rapa. Heredity 74(5):491–496. [Google Scholar]
- Cruz-Cárdenas CI, Cortés-Cruz M, Gardner CA, Costich DE.. 2019. Wild relatives of maize. In: North American crop wild relatives. Vol. 2. Cham: Springer. p. 3–39. [Google Scholar]
- Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, et al. ; 1000 Genomes Project Analysis Group. 2011. The variant call format and VCFtools. Bioinformatics 27(15):2156–2158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Darwin C. 1868. The variation of animals and plants under domestication. Vol. 2. London: John Murray.
- De Candolle AD. 1886. Origin of cultivated plants. New York: Appleton. [Google Scholar]
- DeLong ER, DeLong DM, Clarke-Pearson DL.. 1988. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44(3):837–845. [PubMed] [Google Scholar]
- Del Carpio DP, Basnet RK, De Vos RC, Maliepaard C, Paulo MJ, Bonnema G.. 2011. Comparative methods for association studies: a case study on metabolite variation in a Brassica rapa core collection. PLoS One 6(5):e19624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dempewolf H, Eastwood RJ, Guarino L, Khoury CK, Müller JV, Toll J.. 2014. Adapting agriculture to climate change: a global initiative to collect, conserve, and use crop wild relatives. Agroecol Sustain Food Syst. 38(4):369–377. [Google Scholar]
- Diederichsen A. 2001. Mansfelds encyclopedia of agricultural and horticultural crops (except ornamentals). In: Hanelt P, editor. Brassica rapa group. Berlin: Springer. p. 1446–1453. [Google Scholar]
- Dong JJ, Zhang MG, Wei W, Ma KP, Wang YH.. 2018. GIS assessment of the risk of gene flow from Brassica napus to its wild relatives in China. Environ Monit Assess. 190(7):405. [DOI] [PubMed] [Google Scholar]
- Doyle J, Doyle JL.. 1987. Genomic plant DNA preparation from fresh tissue-CTAB method. Phytochem Bull. 19(11):11–15. [Google Scholar]
- Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE.. 2011. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One 6(5):e19379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Excoffier L, Smouse PE, Quattro JM.. 1992. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131(2):479–491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Excoffier L, Laval G, Schneider S.. 2005. Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinform Online. 1:117693430500100. [PMC free article] [PubMed] [Google Scholar]
- FAOSTAT. 2014. [cited 2016 Jun 16]. Available from: http://www.fao.org/faostat/en/. Accessed May 5, 2021.
- Feng X, Park DS, Liang Y, Pandey R, Papeş M.. 2019. Collinearity in ecological niche modeling: confusions and challenges. Ecol Evol. 9(18):10365–10376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Francisco M, Velasco P, Romero Á, Vázquez L, Cartea ME.. 2009. Sensory quality of turnip greens and turnip tops grown in northwestern Spain. Eur Food Res Technol. 230(2):281–290. [Google Scholar]
- Fuller DQ, Denham T, Arroyo-Kalin M, Lucas L, Stevens CJ, Qin L, Allaby RG, Purugganan MD.. 2014. Convergent evolution and parallelism in plant domestication revealed by an expanding archaeological record. Proc Natl Acad Sci U S A. 111(17):6147–6152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gepts P, Famula TR, Bettinger RL, Brush SB, Damania AB, McGuire PE, Qualset CO.. 2012. Biodiversity in agriculture: domestication, evolution, and sustainability. Vol. 2156. Cambridge: Cambridge University Press. [Google Scholar]
- Glaubitz JC, Casstevens TM, Lu F, Harriman J, Elshire RJ, Sun Q, Buckler ES.. 2014. TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS One 9(2):e90346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gómez-Campo C, Prakash S.. 1999. Origin and domestication. In: Gómez-Campo C, editor. Developments in plant genetics and breeding. Vol. 4. Amsterdam: Elsevier. p. 33–58. [Google Scholar]
- Gressel J, editor. 2005. Introduction: the challenges of ferality. Crop ferality and volunteerism. Boca Raton (FL: ): CRC Press. p. 1–7. [Google Scholar]
- Guarino L, Lobell DB.. 2011. A walk on the wild side. Nat Climate Change. 1(8):374–375. [Google Scholar]
- Guo Y, Chen S, Li Z, Cowling WA.. 2014. Center of origin and centers of diversity in an ancient crop, Brassica rapa (turnip rape). J Heredity. 105(4):555–565. [DOI] [PubMed] [Google Scholar]
- Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A.. 2005. Very high resolution interpolated climate surfaces for global land areas. Int J Climatol. 25(15):1965–1978. [Google Scholar]
- Honnay O, Jacquemyn H, Aerts R.. 2012. Crop wild relatives: more common ground for breeders and ecologists. Front Ecol Environ. 10(3):121. [Google Scholar]
- Hyams E. 1971. Plants in the service of man. 10,000 years of domestication. London: JM Dent & Sons Ltd. [Google Scholar]
- IBPGR. 1987. Descriptors for Brassica campestris L. Rome: International Board for Plant Genetic Resources.
- Ignatov AN, Artemyeva AM, Hida K.. 2008. Origin and expansion of cultivated Brassica rapa in Eurasia: linguistic facts. ISHS Acta Horticulturae 867: 81–88. [Google Scholar]
- Jouganous J, Long W, Ragsdale AP, Gravel S.. 2017. Inferring the joint demographic history of multiple populations: beyond the diffusion approximation. Genetics 206(3):1549–1567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozak KH, Graham CH, Wiens JJ.. 2008. Integrating GIS-based environmental data into evolutionary biology. Trends Ecol Evol. 23(3):141–148. [DOI] [PubMed] [Google Scholar]
- Larson G, Piperno DR, Allaby RG, Purugganan MD, Andersson L, Arroyo-Kalin M, Barton L, Climer Vigueira C, Denham T, Dobney K, et al. 2014. Current perspectives and the future of domestication studies. Proc Natl Acad Sci U S A. 111(17):6139–6146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li CW. 1981. The origin, evolution, taxonomy and hybridization of Chinese cabbage. Chinese Cabbage 1:1–10. [Google Scholar]
- Li H, Durbin R.. 2009. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25(14):1754–1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li LF, Li YL, Jia Y, Caicedo AL, Olsen KM.. 2017. Signatures of adaptation in the weedy rice genome. Nat Genet. 49(5):811–814. [DOI] [PubMed] [Google Scholar]
- Lister DL, Jones H, Oliveira HR, Petrie CA, Liu X, Cockram J, Kneale CJ, Kovaleva O, Jones MK.. 2018. Barley heads east: genetic analyses reveal routes of spread through diverse Eurasian landscapes. PLoS One 13(7):e0196652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martínez S, Pérez N, Carballo J, Franco I.. 2013. Effect of blanching methods and frozen storage on some quality parameters of turnip greens (“grelos”). LWT-Food Sci Technol. 51(1):383–392. [Google Scholar]
- McAlvay AC, Bird K, Poulsen G, Pires JC, Emshwiller E.. 2018. Barriers and prospects for wild crop relative research in Brassica rapa. Acta Horticult. 1202:165–177. [Google Scholar]
- McGrath JM, Quiros CF.. 1992. Genetic diversity at isozyme and RFLP loci in Brassica campestris as related to crop type and geographical origin. Theor Appl Genet. 83(6–7):783–790. [DOI] [PubMed] [Google Scholar]
- Meyer RS, DuVal AE, Jensen HR.. 2012. Patterns and processes in crop domestication: an historical review and quantitative analysis of 203 global food crops. New Phytol. 196(1):29–48. [DOI] [PubMed] [Google Scholar]
- Meehl GA, Washington WM, Arblaster JM, Hu A, Teng H, Tebaldi C, Sanderson BN, Lamarque JF, Conley A, Strand WG, et al. 2012. Climate system response to external forcings and climate change projections in CCSM4. J Climate. 25(11):3661–3683. [Google Scholar]
- Miller MA, Pfeiffer W, Schwartz T.. 2012. The CIPRES science gateway: enabling high-impact science for phylogenetics researchers with limited resources. In: Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the campus and beyond. ACM. 39 pp.
- Maggioni L. 2015. Domestication of Brassica oleracea L. [PhD thesis]. [Alnarp (Sweden)]: Swedish University of Agricultural Sciences.
- Nakazato T, Warren DL, Moyle LC.. 2010. Ecological and geographic modes of species divergence in wild tomatoes. Am J Bot. 97(4):680–693. [DOI] [PubMed] [Google Scholar]
- Nei M, Li WH.. 1979. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc Natl Acad Sci U S A. 76(10):5269–5273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ohta T, Kimura M.. 1969. Linkage disequilibrium at steady state determined by random genetic drift and recurrent mutation. Genetics 63(1):229–238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olsen KM, Gross BL.. 2008. Detecting multiple origins of domesticated crops. Proc Natl Acad Sci U S A. 105(37):13701–13702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Page A, Gibson J, Meyer RS, Chapman MA.. 2019. Eggplant domestication: pervasive gene flow, feralization, and transcriptomic divergence. Mol Biol Evol. 36(7):1359–1372. [DOI] [PubMed] [Google Scholar]
- Pertl M, Hauser TP, Damgaard C, Jørgensen RB.. 2002. Male fitness of oilseed rape (Brassica napus), weedy B. rapa and their F1 hybrids when pollinating B. rapa seeds. Heredity 89(3):212–218. [DOI] [PubMed] [Google Scholar]
- Phillips SJ, Anderson RP, Schapire RE.. 2006. Maximum entropy modeling of species geographic distributions. Ecol Model. 190(3–4):231–259. [Google Scholar]
- Phillips SJ, Dudik M.. 2008. Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography 31(2):161–175. [Google Scholar]
- Pickrell J, Pritchard J.. 2012. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8(11):e1002967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prakash S, Wu XM, Bhat SR.. 2011. History, evolution, and domestication of Brassica crops. Plant Breed Rev. 35:19–84. [Google Scholar]
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, De Bakker PI, Daly MJ, et al. 2007. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 81(3):559–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qi X, An H, Ragsdale AP, Hall TE, Gutenkunst RN, Chris Pires J, Barker MS.. 2017. Genomic inferences of domestication events are corroborated by written records in Brassica rapa. Mol Ecol. 26(13):3373–3388. [DOI] [PubMed] [Google Scholar]
- Ragsdale AP, Gravel S.. 2020. Unbiased estimation of linkage disequilibrium from unphased data. Mol Biol Evol. 37(3):923–932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raj A, Stephens M, Pritchard JK.. 2014. fastSTRUCTURE: variational inference of population structure in large SNP data sets. Genetics 197(2):573–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramasamy RK, Ramasamy S, Bindroo BB, Naik VG.. 2014. STRUCTURE PLOT: a program for drawing elegant STRUCTURE bar plots in user friendly interface. Springerplus 3(1):431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- *Reiner H, Holzner W, Ebermann R.. 1995. The development of turnip-type and oilseed-type Brassica rapa crops from the wild-type in Europe: an overview of botanical, historical and linguistic facts. In: Rapeseed Today and Tomorrow. Vol. 4. Cambridge: Organising Committee of the Ninth International Rapeseed Congress. p. 1066–1069. [Google Scholar]
- Ren J, McFerson JR, Li R, Kresovich S, Lamboy WF.. 1995. Identities and relationships among Chinese vegetable Brassicas as determined by random amplified polymorphic DNA markers. J Am Soc Horticult Sci. 120(3):548–555. [Google Scholar]
- Saitou N, Nei M.. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 4(4):406–425. [DOI] [PubMed] [Google Scholar]
- Sethi SC, Chowdhury JB, Sareen PK.. 1970. Studies on Intervarietal Crosses in Brassica campestris L. Cytologia 35(1):50–56. [Google Scholar]
- Siemonsma JS, Piluek K.. 1993. Plant resources of south-east Asia. Vol. 8. Vegetables. Wageningen (Netherlands): Pudoc. p. 121–134. [Google Scholar]
- Sinskaya EN. 1969. Historical geography of cultivated floras (at the dawn of agriculture). Leningrad (USSR: ): Kolos. [Google Scholar]
- Song KM, Osborn TC, Williams PH.. 1988. Brassica taxonomy based on nuclear restriction fragment length polymorphisms (RFLPs). Theor Appl Genet. 75(5):784–794. [DOI] [PubMed] [Google Scholar]
- Song K, Osborn TC, Williams PH.. 1990. Brassica taxonomy based on nuclear restriction fragment length polymorphisms (RFLPs). Theor Appl Genet. 79(4):497–506. [DOI] [PubMed] [Google Scholar]
- Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30(9):1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Swofford DL. 2003. PAUP: phylogenetic analysis using parsimony, version 4.0 b10. Cambridge: Cambridge University Press.
- Takuno S, Fujimoto R, Sugimura T, Sato K, Okamoto S, Zhang SL, Nishio T.. 2007. Effects of recombination on hitchhiking diversity in the Brassica self-incompatibility locus complex. Genetics 177(2):949–958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tanhuanpää P, Erkkilä M, Tenhola-Roininen T, Tanskanen J, Manninen O.. 2016. SNP diversity within and among Brassica rapa accessions reveals no geographic differentiation. Genome 59(1):11–21. [DOI] [PubMed] [Google Scholar]
- Wang X, Wang H, Wang J, Sun R, Wu J, Liu S, Bai Y, Mun J-H, Bancroft I, Cheng F, et al. 2011. The genome of the mesopolyploid crop species Brassica rapa. Nat Genet. 43(10):1035–1039. [DOI] [PubMed] [Google Scholar]
- Wang H, Vieira FG, Crawford JE, Chu C, Nielsen R.. 2017. Asian wild rice is a hybrid swarm with extensive gene flow and feralization from domesticated rice. Genome Res. 27:1029–1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waples RS. 2006. A bias correction for estimates of effective population size based on linkage disequilibrium at unlinked gene loci. Conserv Genet. 7(2):167–184. [Google Scholar]
- Warwick SI, Legere A, SIMARD MJ, James T.. 2008. Do escaped transgenes persist in nature? The case of an herbicide resistance transgene in a weedy Brassica rapa population. Mol Ecol. 17(5):1387–1395. [DOI] [PubMed] [Google Scholar]
- Weir BS, Cockerham CC.. 1984. Estimating F‐statistics for the analysis of population structure. Evolution 38(6):1358–1370. [DOI] [PubMed] [Google Scholar]
- Weis AE, Kossler TM.. 2004. Genetic variation in flowering time induces phenological assortative mating: quantitative genetic methods applied to Brassica rapa. Am J Bot. 91(6):825–836. [DOI] [PubMed] [Google Scholar]
- Wilkes HG. 1967. Teosinte: the closest relative of maize. Cambridge: Bussey Institution of Harvard University.
- Yang J, Zhang C, Zhao N, Zhang L, Hu Z, Chen S, Zhang M.. 2018. Chinese root-type mustard provides phylogenomic insights into the evolution of the multi-use diversified allopolyploid Brassica juncea. Mol Plant. 11(3):512–514. [DOI] [PubMed] [Google Scholar]
- Zeder MA. 2015. Core questions in domestication research. Proc Natl Acad Sci U S A. 112(11):3191–3198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang H, Mittal N, Leamy LJ, Barazani O, Song BH.. 2017. Back into the wild: apply untapped genetic diversity of wild relatives for crop improvement. Evol Appl. 10(1):5–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao J, Wang X, Deng B, Lou P, Wu J, Sun R, Xu Z, Vromans J, Koornneef M, Bonnema G.. 2005. Genetic relationships within Brassica rapa as inferred from AFLP fingerprints. Theor Appl Genet. 110(7):1301–1314. [DOI] [PubMed] [Google Scholar]
- Zhao J, Paulo MJ, Jamar D, Lou P, Van Eeuwijk F, Bonnema G, Vreugdenhil D, Koornneef M.. 2007. Association mapping of leaf traits, flowering time, and phytate content in Brassica rapa. Genome 50(10):963–973. [DOI] [PubMed] [Google Scholar]
- Zohary D, Hopf M.. 2000. Domestication of plants in the Old World: the origin and spread of cultivated plants in West Asia, Europe and the Nile Valley. Oxford: Oxford University Press. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data for this article have been deposited in Data Dryad repository (https://datadryad.org/stash/dashboard, last accessed May 5, 2021) and are available at the following link: https://doi.org/10.5061/dryad.0gb5mkm0m, last accessed May 5, 2021.