Skip to main content
Ecology and Evolution logoLink to Ecology and Evolution
. 2021 Jul 28;11(16):10926–10935. doi: 10.1002/ece3.7870

Population structure, seasonal genotypic differentiation, and clonal diversity of weedy dandelions in three Boston area populations (Taraxacum sp.)

Lisa Mazumder 1,, Rick Kesseli 1
PMCID: PMC8366855  PMID: 34429891

Abstract

Weedy dandelions have a worldwide distribution and thrive in urban environments despite a lack of sexual reproduction throughout most of its range. North American dandelions, introduced from Eurasia, are believed to be primarily, if not exclusively, apomictic triploids. In some European populations, apomicts co‐occur with diploid sexual individuals and hybridizations can create genetically unique apomicts, which may subsequently disperse and establish new populations globally. Using six nuclear microsatellite markers and a cpDNA intergenic spacer, we investigate the impact of this unusual natural history on population structure and diversity in three urban Boston area dandelion populations. Our results show high levels of genetic diversity within populations, spatial population structure, and seasonal genotypic differentiation in flowering times. We find evidence that sexual reproduction and recombination, presumably in Europe, and extensive gene flow drive these patterns of diversity and create the appearance of panmixia despite the lack of evidence for local sexual reproduction.

Keywords: apomixis, clonal diversity, cpDNA, dandelion, linkage disequilibrium, microsatellites


This study investigates population genetics, clonal diversity, and genotypic differences in flowering time of weedy dandelions in an urban area. We find population structure among sites and that specific genotypes are flowering during certain seasons. Populations exhibit high genetic diversity, are asexual, and mimic panmixia as a result of mass dispersal from European source populations.

graphic file with name ECE3-11-10926-g004.jpg

1. INTRODUCTION

Weedy dandelion, Taraxacum officinale Weber ex. Wiggers, is a highly successful, cosmopolitan, short‐lived perennial species that thrives in disturbed environments. Dandelion likely originated 30 million years ago in the western Himalayas, and its recent history and center of diversity is in western Asia and Europe (Solbrig, 1971). The now notorious weed, once revered for its nutritional and medicinal value, was carried both deliberately and passively to new regions by European settlers (Richards, 1973; Solbrig, 1971). More recently, air travel and agricultural transfer have likely increased dispersal frequency and extent for many plant propagules (Hodkinson & Thompson, 1997; Mack & Lonsdale, 2001). In addition to this human‐aided dispersal, the success of the dandelion has likely been facilitated by recently discovered novel airflow vortex behaviors associated with the pappus, resulting in dispersal distances reported to be as high as 150 km or more (Cummins et al., 2018 and citations there in).

Dandelion also has an unusual apomictic reproductive system, resulting in hundreds of genetically identical clonal seeds being produced from single triploid parents (Asker & Jerling, 1992). Though apomixis confers a highly efficient mode of reproduction, purely apomictic populations should suffer due to low genetic diversity and an accumulation of deleterious mutations (Felsenstein, 1974; Müller, 1964). However, diploid sexual and triploid apomictic dandelions can hybridize in the native European range where they coexist, thereby creating new, genetically unique apomictic lineages. These new apomictic lines then presumably disperse to establish new populations (Menken et al., 1995; Richards, 1973; Van Der Hulst et al., 2003). This mixed reproductive strategy allows the dandelion to generate new genetic diversity via sexual reproduction while taking advantage of the efficiency of apomictic reproduction. Though purely sexual populations are rare even in Europe (Menken et al., 1995; Van Der Hulst et al., 2003), cryptic sexual reproduction is certainly possible outside of Europe among pollen‐producing apomicts or with hidden sexual diploids and this would provide an evolutionary advantage, especially in the face of unexpected environmental stochasticity (Williams, 1975).

In the introduced range of North America, where only triploid apomictic dandelions have been reported, prior studies have uncovered higher than expected genetic diversity for an asexual species (King, 1993; Lyman & Ellstrand, 1984; Rogstad et al., 2001), with multiple clonal genotypes existing in the same area. Prior studies assessing genetic diversity, as well as a more recent study assessing genomic size variation in dandelions, have not identified any sexual dandelions in North America (Iaffaldano et al., 2017; King, 1993; Lyman & Ellstrand, 1984). The studies of Lyman and Ellstrand (1984) and King (1993) sampled a wide geographic range of North America with relatively low sample sizes per population and concluded that the observed genetic diversity was arising primarily via sexual recombination in European mixed apomictic–sexual populations and new apomictic lineages were dispersing over to North America. Rogstad et al. (2001) studied a more localized region with deeper sampling of three populations in Kentucky and Ohio using variable number tandem repeat (VNTR) loci. While this study demonstrated a strong role of sexual recombination in the observed genetic diversity, the researchers also suggested that somatic mutation may be playing a greater role in diversification in clonal lineages than previously thought, leading to slightly differentiated “clonal families.” The higher level of diversity in Rogstad et al. (2001) study probably reflects the nature of the VNTR molecular markers, which are highly polymorphic and have high mutation rates.

The occurrence of genetically distinct clones in populations begs the question about if and how these clonal lineages evolve and adapt to local conditions. This question is particularly intriguing in the face of urbanization and global climate change, as urban populations must have wide environmental tolerances and be able to survive extreme abiotic stresses (Johnson et al., 2015). Life history and flowering time differences among clonal genotypes may allow genetically diverse dandelions to coexist in the same locations. Seasonal variability across different geographic regions of North America affects the exact months during which dandelions experience flowering (Collier & Rogstad, 2004; Vellend et al., 2009). In a controlled greenhouse experiment, a significant difference in flowering time among clones was uncovered (Collier & Rogstad, 2004). Common garden studies also showed ecological differences in genotypes, including flowering time (Vellend et al., 2009). Shifting flowering times in the assessment of seasonality, particularly in urban environments where true seasonality is affected by urban structures, has recently become an important and new area of study. Studies show that the flowering seasons of many plant species have been shifting and extending, with both earlier and longer durations than ever before (Primack et al., 2004).

In this study, we investigate the local population genetics of Boston‐area, urban dandelions. Our research objectives are as follows: (1) to characterize clonal diversity in three populations using six polymorphic microsatellite markers and the trnH‐psbA intergenic spacer region of chloroplast DNA, (2) to assess spatial structure and flowering time patterns as an indicator of life history diversity, and (3) to test for linkage disequilibrium, the signature of clonal reproduction, and panmixia to reveal the relative importance of sexual reproduction, dispersal and somatic mutation in the establishment of local population structure.

2. MATERIALS AND METHODS

2.1. Sample collection

Sampling for this study was performed over two years on a monthly basis April through November in 2017 and 2018; plants flowering well beyond this period are found but flowering is sporadic and their numbers are low. We collected 568 dandelion samples from three stable, undisturbed (other than occasional mowing) sites covering approximately a 20 km range from north to the south of Boston, MA: Prospect Hill Monument, Somerville, MA; Cedar Grove Cemetery, Boston, MA; and 12 Moon Farm in Milton, MA, respectively. Leaf tissue was collected only from individuals in flower, separated at least four meters apart, and placed in separate Ziploc bags. Samples were kept fresh or stored in −20°C until the time of extraction. Leaf tissue was typically not stored in the freezer for longer than one month before DNA extraction.

2.2. DNA extraction, PCR, sequencing and fragment analysis

Approximately 100 mg of leaf tissue was used for DNA extraction using the DNeasy Plant Mini Kit (Qiagen, Valencia, CA) following manufacturer's protocol with slight modifications. Frozen leaf tissue was measured out and manually crushed using sterilized ceramic mortar and pestles. Lysation buffer (200 µl) was added directly onto the leaf tissue to aid in manual crushing. The elution buffer was heated at 65°C for 10 min prior to the final DNA elution step to increase DNA yield. Whole genomic DNA was quantified and checked for purity on a NanoDrop 2000C. DNA dilutions on samples were performed to a level of 0.5–0.7 ng. PCR was performed using the diluted DNA as the template. PCR for the trnH‐psbA intergenic spacer region was performed according to primers and PCR specifications from Hamilton (1999). PCRs for the cpDNA were performed in a total volume of 25 µl per reaction with 8.8 µl autoclaved MilliQ water, 5 µl of 5X PCR green flexi buffer, 0.5 µl of 10 mM dNTPs, 2.5 µl of 25 mM MgCl2, 1.5 µl of 10 µM of forward and reverse primers, 0.2 µl of GoTaq DNA polymerase (Promega), and 5 µl of diluted DNA. The thermal cycler conditions for the trnH‐psbA region were as follows: 95°C for 2 min; 30 cycles of 95°C for 1 min, 52°C for 1 min, 72°C for 1 min; ending with 72°C for 5 min, and a final 4°C hold. Amplicons for the trnH‐psbA region were confirmed on 1% ethidium bromide agarose gels, and positive PCR products were prepared and sent for PCR cleanup and Sanger sequencing in the forward direction to Eton Biosciences, Inc. Multiplex PCR was performed for six microsatellites developed specifically for dandelions (Falque et al., 1998; Vasut et al., 2004) in two multiplexed sets of three loci each using the fluorescent dyes 6‐FAM, VIC, AND NED (Thermo Fisher Scientific, Inc.). The fluorescent tag was directly applied to the 5′ end of the forward primer, while the reverse primer was kept untagged. The loci (6‐FAM)‐MSTA31, (VIC)‐MSTA67, and (NED)‐MSTA78 were multiplexed together; while the loci (6‐FAM)‐MSTA101, (VIC)‐MSTA102, and (NED)‐MSTA143 were multiplexed together. PCRs for the microsatellite markers were performed in a total volume of 25 µl per reaction with 9.8 µl autoclaved MilliQ water, 5 µl of 5X PCR green flexi buffer, 0.5 µl of 10 mM dNTPs, 2.5 µl of 25 mM MgCl2, 0.33 µl of 10 µM of forward and reverse primers (a total of 3 multiplexed pairs), 0.2 µl of GoTaq DNA polymerase (Promega), and 5 µl of diluted DNA. The thermal cycler conditions for touchdown PCR as described in Zavada et al. (2017) were followed for the amplification of the microsatellite markers. Microsatellite amplicons were confirmed with 2% ethidium bromide agarose gels. Multiplexed PCR products of the microsatellite markers were sent for fragment analysis to the Roy J. Carver Biotechnology center at the University of Illinois, Urbana‐Champaign where samples were run on an Applied Biosystems 3730xl DNA Analyzer.

2.3. Data processing and analysis

For the cpDNA, a randomized subset of 96 individuals spanning the three sites and all collection time intervals was sequenced in the forward direction for the trnH‐psbA marker; 93 high‐quality sequences were obtained from the sequencing effort. Sanger sequences of the cpDNA were edited for artifacts, and multiple alignment of sequences was constructed using the CLUSTALW algorithm performed using Geneious version 10.0.5 (Kearse et al., 2012). A BLAST was performed against the NCBI GenBank® database to confirm that sequences were from dandelion DNA. A FASTA file of the alignment was exported for use in downstream haplotype analysis in R version 3.4.4 (R Core Team). Haplotype networks were constructed using the R package pegas version 0.11 (Paradis, 2010).

For the microsatellite markers, 568 samples were assayed for fragment sizes. Peak profiles were visualized and manually scored using the microsatellite plug‐in in Geneious version 10.0.5 and stored in a csv file format. Allele bins were determined by observing common peak patterns and seasonal categories of spring (April–June), summer (July–August), and fall (September–November) were assigned based on the month of collection. The R package poppr version 2.8.3 (Kamvar et al., 2014, 2015) was used for calculation of population genetics statistics, linkage disequilibrium (LD), and gene flow analysis with k‐means hierarchical clustering. For LD, the index of association, r¯ d, and p‐value were calculated for each of the three populations. Calculations were performed for both the entire dataset as well as for clone‐corrected data in which duplicate individuals with identical alleles for all loci were removed. The null hypothesis for each test was that there would be no LD between markers. The R package adegenet version 2.1.1 (Jombart, 2008; Jombart & Ahmed, 2011) was used to perform Discriminant Analysis of Principal Components (DAPC) to infer population structure; by determining genetic clusters spatially by site, and temporally by season. The R package ape version 5.1 (Paradis & Schliep, 2019) was used to create and visualize the dendrograms in the gene flow analysis; genotypes with missing data were removed before generating the dendrograms or k‐means clusters.

An analysis of molecular variance (AMOVA) was performed in the R package poppr version 2.8.0 (Kamvar et al., 2014, 2015) with significance testing of the variance matrices via random permutations (Excoffier et al., 1992) performed with R package ade4 version 1.7.11 (Bougeard & Dray, 2018; Chessel et al., 2004; Dray & Dufour, 2007; Dray et al., 2007; Thioulouse et al., 2018). AMOVA was performed for both non‐clone‐corrected and clone‐corrected data. Pairwise Gst values (Hedrick, 2005) were calculated for the three populations and the three seasons; as well as separate populations and seasons using the R package poppr version 2.8.0 (Kamvar et al., 2014, 2015). For the assessment of clonal diversity in the populations, a customized R script was used to bin alleles when the number of alleles was identical for each of the respective loci but differed in length by ±4 base pairs. Based on allele pattern recognition, it was assumed that those differing in length by a few base pairs but containing exactly the same number of alleles at all the positions for each of the six loci are likely the same clone and the slight base pair difference can be attributed to scoring bias or software error.

A Bayesian clustering approach, STRUCTURE version 2.3.4 (Pritchard et al., 2000; Falush et al., 2003, Falush et al., 2007; and Hubisz et al., 2009), was applied to the microsatellite data. Analyses were run for the total dataset of n = 551 for the three sites set as “populations” as well as the three seasons set as the “populations.” Two additional analyses were performed: (1) with the cpDNA haplotype groupings set as “populations” for n = 91 samples in order to visualize nuclear–cytoplasmic recombinant groupings established at each population site and (2) with the clone‐corrected dataset where all multilocus genotypes were represented once and any samples with missing data removed for n = 270. For all STRUCTURE analyses, admixture model and correlation between alleles were assumed. For the total dataset of n = 551, an initial simulation was performed for K = 2 through K = 20 for burn‐in period of 200,000 with an additional 200,000 Markov Chain Monte Carlo (MCMC) replicates over five iterations in order to implement post hoc tests of optimal K clusters. For both the haplotype subset of n = 91 and clone‐corrected subset of n = 270, an initial simulation of K = 2 through K = 15 for burn‐in period of 200,000 with an additional 200,000 MCMC replicates over five iterations was performed. These initial simulations were run through Structure Harvester (http://taylor0.biology.ucla.edu/structureHarvester/; Earl & vonHoldt, 2012) where optimal number of clusters, K, were determined from “Delta K’’ computed based on rate of change of the mean of the standardized log‐likelihood (Evanno et al., 2005). For the total dataset of n = 551, the optimal was K = 11; in the haplotype subset of n = 91, the optimal was K = 3; and in the clone‐corrected subset, the optimal was K = 12. Using these optimal K values, a subsequent simulation was performed for each dataset for burn‐in period 200,000 and 1,000,000 MCMC replicates.

3. RESULTS

3.1. cpDNA analysis and haplotype networks

We obtained 93 high‐quality sequences from the sequencing effort. Sequence length of the multiple alignment is 410 bp. Haplotype network analysis in the R package pegas (Paradis, 2010) resulted in five total haplotypes, with 92.5% of samples falling into three main haplotypes. HAP1 and HAP2 each have 33 and 32 individuals respectively, while HAP3 has 21 individuals. Two uncommon haplotypes had one sample each. The three common haplotypes were detected in all sites, while one uncommon haplotype was from Cedar Grove Cemetery and one from Prospect Hill Monument (Figure 1).

FIGURE 1.

FIGURE 1

Haplotype network of the trnH‐psbA cpDNA marker of 93 samples shows three major haplotypes with 21–33 individuals; and two uncommon haplotypes with one individual each. Hatch marks indicate number of SNPs or indels between haplotypes

3.2. Allelic diversity of nuclear markers, population and seasonal structure

The six microsatellite markers showed substantial variability with 14–39 alleles, high levels of genetic diversity (Hexp = 0.77–0.94), and evenness (E5 = 0.59–0.73) making these markers very useful for our study (Table 1). Both Nei's genetic diversity (Hexp) and Simpson index, 1949 (1‐D), were calculated and both produced equal results and we are only reporting Hexp herein. Furthermore, there are an enormous number of different multilocus genotypes detected. Of the 551 individuals genotyped, 330 were different and between 95 and 124 genotypes were identified in each site (Table 2). A calculation of pairwise Gst between the three populations showed the least differentiation between PH and CG (Gst = 0.018) and more differentiation between MF and PH (Gst = 0.049) and MF and CG (Gst = 0.049). Pairwise Gst calculated between the seasons showed the least differentiation between spring and summer genotypes (Gst = 0.004); followed by Summer and Fall (Gst = 0.010); the most differentiation between spring and fall genotypes (Gst = 0.046).

TABLE 1.

Statistics for each of the six microsatellite loci utilized in this study

Locus # observed alleles Nei’s, 1978 gene diversity (Hexp) Evenness (E5)
MSTA31 29.00 0.89 0.71
MSTA67 39.00 0.94 0.73
MSTA78 26.00 0.81 0.62
MSTA101 22.00 0.82 0.61
MSTA102 23.00 0.78 0.59
MSTA143 14.00 0.77 0.69
Mean 25.50 0.83 0.66

TABLE 2.

Population statistics

Pop N MLG Nc MLGc eMLG SE H G lambda E.5 Hexp
PH 192 124 156 90 96.9 2.92 4.44 46.4 0.978 0.541 0.829
CG 216 148 184 116 104.1 3.28 4.71 72.4 0.986 0.649 0.834
MF 143 95 116 69 95.0 0.00 4.14 28.3 0.965 0.443 0.820
Total 551 330 456 256 105.8 4.59 5.28 79.2 0.987 0.401 0.834

N is number of samples; MLG is number of observed multilocus genotypes of the total dataset of n = 551. Nc and MLGc refer to the subset of samples with complete six‐locus genotypes and no missing data. All subsequent statistics in the table are calculated only for n = 551, where eMLG is expected number of multilocus genotypes at the smallest sample size ≥10 based on rarefraction; SE is standard error based on eMLG; H is Shannon–Wiener (1948) on MLG diversity; G is Stoddart & Taylor’s, 1988 index of MLG diversity; lambda is Simpson’s, 1949 index; E.5 is evenness; Hexp is Nei’s, 1978 unbiased gene diversity.

Bayesian STRUCTURE analyses and AMOVA all detected widespread diversity and population structure among the three sites. While STRUCTURE analyses may not be the most appropriate tool for clonal populations (Grünwald et al., 2017), it did clearly show that all populations had essentially pure genotypes from all K groups regardless of the K value (K = 3–12, with K = 11 being selected as the optimal value for the total dataset of n = 551) chosen, suggesting substantial seed flow of clones into sites. Some multilocus genotypes (clones) are found in all three populations but many are specific to populations, and the frequency of the genetic groups varied among sites (Figure 2). The DAPC analyses (Figure 3a) graphically shows this structure among sites. Seasonal clustering is also evident in the DAPC with genetically similar individuals flowering at similar times of year (Figure 3b). The seasonal analyses conducted for each of the three sites separately showed that true seasonal patterns were not confounded by location (Figure 3c–e). Finally, the AMOVA using clone‐corrected data confirmed many of these patterns, with observed Φ values falling outside the distributions expected from the permutations at all levels of analysis with substantial diversity partitioned among populations (p < .05) as well as among seasons within populations (p < .001).

FIGURE 2.

FIGURE 2

STRUCTURE assignment plot of the total dataset of n = 551 based on the six SSR markers with K = 11 showing some population‐level genotypic specificity in the three sites (PH, Prospect Hill; CG, Cedar Grove; MF, Moon Farm), but also evidence of admixture

FIGURE 3.

FIGURE 3

(a) Discriminant analysis of principal components (DAPC) of the six SSR markers by population site show population structure with some overlap between sites. (b) DAPC shown by season for all populations (FA, fall; SU, summer; SP, spring) shows strong separation between genotypes flowering in fall and spring with some separation, but more overlap of genotypes flowering in the summer. (c–e) DAPC by season for each population show a consistent separation of genotypes flowering in spring, summer, and fall

We used a K‐means hierarchical clustering approach that has been used to investigate and differentiate the structure of clonal and random mating populations (https://grunwaldlab.github.io/Population_Genetics_in_R/Pop_Structure.html), to assess the fit of our dandelion data to these models (Figure 4). The bootstrapped dendrograms constructed from Bruvo's distances showed long branches generally indicative of more random mating populations. While we only show the dendrogram for a randomized subset of the Cedar Grove population for visualization purposes, this pattern of deep branching and mismatches is seen in the full datasets of each population as well. The assigned colored k‐means clusters (K = 8) were often misaligned with dendrogram branches in all analyses which is the expectation for more random mating and panmictic populations (Figure 4). Panmixia seems more pronounced in the Cedar Grove and Prospect Hill populations, and much less so in the Moon Farm population with most k‐means clusters grouping together with the bootstrapped dendrogram groupings.

FIGURE 4.

FIGURE 4

A randomized subset of 88 Cedar Grove samples shown in bootstrapped dendrogram using Bruvo's distances, a genetic distance metric for microsatellite markers. Colored leaves on the tree indicate eight k‐means clusters which do not correspond perfectly with the dendrogram clusters indicating panmixia and gene flow from other sources. The full datasets of Cedar Grove, as well as Prospect Hill and Moon Farm (not pictured), show similar results

3.3. Linkage disequilibrium

In the tests for each of the populations of the six nuclear microsatellite loci, for both the full and the clone‐corrected data, the null hypotheses of no LD were rejected (Table 3). The null hypotheses were also rejected for the combined analysis of all populations. As the null hypothesis of no linkage between is rejected for all three populations, it is suggestive of asexual reproduction as the prevalent mode of reproduction in these sites. Surprisingly however, the haplotypes identified from the chloroplast data are not associated with specific nuclear genomes as might be expected if asexual reproduction is the only mode of reproduction in these populations.

TABLE 3.

Linkage disequilibrium (LD) summary for each of the populations for both non‐clone‐corrected (Non‐CC) and clone‐corrected (CC) data

Population Clone correction N
r¯
d
p‐value
CG Non‐CC 216 0.152 .001
CG CC 176 0.132 .002
MF Non‐CC 143 0.258 .001
MF CC 105 0.153 .001
PH Non‐CC 192 0.205 .001
PH CC 144 0.156 .001

3.4. Clonal diversity and differentiation

Samples with missing data at one or more of the microsatellite loci were removed before clone counting, with 456 out of the 551 samples included in the final clonal diversity assessment. Clone counting was performed for the total dataset (All pops) as well as for each of the three populations separately. There were 256 different genotypes (56%) among the 456 samples, 202 being detected once only and 54 genotypes occurring more than once; the most frequent genotype was detected 32 times with 21 individuals in Moon Farm, 8 in Cedar Grove, and 3 in Prospect Hill. The proportion of different genotypes among the samples was similar for each site; 90 in 156 samples for Prospect Hill, 116 in 184 for Cedar Grove, and 69 in 116 for Moon Farm (Figure 5).

FIGURE 5.

FIGURE 5

Histogram showing genotype frequencies summarizing number of total samples, number of different genotypes, number of genotypes found more than once, and number of clones present in the most common genotype found in all three populations (All pops) and for the three separate populations of Cedar Grove, Moon Farm, and Prospect Hill

The four largest clones with 17–32 individuals sampled throughout the study were detected in all three populations. They each had distinct flowering time patterns, however. The largest clone flowered primarily in Fall of both years (66%), while the second and third largest clones with 28 and 26 samples, respectively, both flowered primarily in the spring of both years (86% and 69% respectively). The fourth largest clone with 17 individuals flowered about equally in fall and spring of 2017, but was not detected in any population in 2018.

Many of the 256 genotypes share common alleles in some loci but have different alleles at other loci, suggesting recombination as the source of genetic diversity. Indeed, in the entire dataset, there is only one pair of different clones that differs by a single allele at a single locus. All other clones differ from each other by multiple alleles at multiple loci, a pattern of variation not expected by local somatic mutation.

Further, a careful analysis of the STRUCTURE output showed that regardless of the assumed number of genetic groups, approximately 10% of the individuals in the entire population are the result of admixture (arbitrarily defined as individuals with less than 80% of their genome coming from one source). For the clone‐corrected analyses, 22%–32% of the multilocus genotypes are defined as having mixed ancestry. This reflects the role of sexual reproduction and recombination in the history of these individuals, though the timing, in Europe or in North America, of this is unknown. We addressed this timing issue, by examining the level of mixed ancestry as determined from the STRUCTURE analysis, with the prevalence of the various clones. From the clone‐corrected data in which each multilocus genotype is represented once in the analysis, none of the 15 largest clones represented by 5–32 individuals/clone in the full population have mixed ancestry at K = 12. These large clones are defined as >90% pure in the analysis. However, 32% of the genotypes are identified as having mixed ancestry (<80% of their genome coming from a single source) possibly suggesting that recombinants are more recent arrivals or evolved locally.

4. DISCUSSION

Apomixis is a rare and specialized form of asexual reproduction in plants that bypasses the need for fertilization and produces clonal seeds. Theoretically, genetic diversity in apomictic species, as with asexuality in general, should be very low and subject to the accumulation of deleterious mutations (Müller, 1964). This inability to generate new genetic diversity through sexual recombination may be unfavorable in stochastic and unstable environmental conditions (Williams, 1975). Remarkably, most apomictic plant species retain some mixture of sexual and asexual members, which can override the problem of low genetic diversity while also taking advantage of the efficient asexual reproductive strategy (Hojsgaard & Hörandl, 2015). As is the case in weedy Taraxacum species, closely related sexual members coexist in certain populations mainly in some areas of Europe and gene flow between apomictic and sexual types is possible, leading to newly formed and genetically unique apomicts (Hughes & Richards, 1988; Menken et al., 1995; Van Der Hulst et al., 2003). These genetically diverse apomictic dandelion lines then disperse and establish new populations elsewhere, such as throughout the introduced range of North America. In this interesting dichotomy, the rewards of both asexual and sexual reproduction are reaped: efficient proliferation and spread of genetic varieties as given by apomixis, and the continual introduction of new genetic varieties formed by sexual reproduction.

Apomictic species often have populations composed of many multilocus genotypes (Ellstrand & Roose, 1987), and Taraxacum follows this pattern as well. Remarkable, however, is that this truly cosmopolitan taxon seems to be sexual only in restricted regions of Europe. Prior studies encompassing the continent of North America, a region where only triploid apomicts are known to occur, Lyman and Ellstrand (1984) found between 1 and 13 different genotypes co‐occurring in the same field; King (1993) found 46% of her samples collected from across the continent were different genotypes using cpDNA and ribosomal DNA restriction enzyme analysis. Rogstad et al. (2001) conducted a study using highly polymorphic variable number tandem repeat (VNTR) markers on a more localized scale in three sites in Ohio and Kentucky and found a range of 19% clonal in the long transect to 34% clonal in the short transect, suggesting greater diversity than the studies of Lyman and Ellstrand (1984) and King (1993) even when conducted on a more localized scale. Our study contributes to this body of work by taking a deep sampling approach over an extended temporal scale in three local areas. Given the asexual breeding system of this species, previous studies probably assumed with good reason that dense local sampling might not reveal much additional variation. Our study did, however, identify very high levels of genetic diversity and also distinct population structure and genotypic changes associated with flowering time.

We detected three major cpDNA haplotype lineages, consistent with King (1993) in her North American study, but we also recorded high levels of nuclear diversity, identifying hundreds of unique genotypes and no cases of single genotypes for any sampling date at any site as reported in some studies. Several features of our study may have aided our efforts to reveal this extensive diversity, population structure, and life history (flowering time) differences. First, we chose “old,” well‐established sites that have not had major habitat disturbances in recent times (other than regular mowing). The populations of dandelions have probably been in these areas for decades if not longer and not recently established by a few founders. Second, our sampling took place over two years and across many months in each year. Third, our sites are within the greater metropolitan area surrounding Boston, which could enhance human‐aided dispersal. This experiment was not designed to look at different types of populations, but it is noteworthy that Moon Farm, the one private site and with infrequent mowing, was the most genetically distinct site (pairwise Gst) and showed a less distinct signature of panmixia than the other two populations. These sampling strategies allowed us to assess the full array of genotypes in the area and the cumulative effects of population‐level processes.

While our study and others, together suggest that populations of Taraxacum in North America have high diversity and high numbers of different genotypes, the source of that variation is not as clear. Rogstad et al. (2001) did show a substantial number of unique clones within sites as our study did, but VNTR markers used in that study are more difficult to interpret. They generally concluded that much of the variation was probably caused by somatic mutations in clonal lineages. This is not true in our study. Our study using the SSRs suggests that the origin of the variation is recombination. Our loci are defined by multilocus and multiallelic banding patterns and clones possessed different combinations of these multiallelic loci, not single allele changes, a phenomenon also noted by Van Der Hulst et al., 2003 in their study of northern European apomictic populations. As noted previously, within the entire study that identified 256 different clones, only one pair of different clones differed by a single allele at a single locus. This one pair could conceivably be derived from a local somatic mutation (the pair of clones were both found in the Prospect Hill population). All other clones differed by multiple alleles at multiple loci. The pattern is much more akin to reticulate evolution resulting from hybridization (via sexual reproduction). King (1993) does note the lack of complete linkage disequilibrium between the nuclear and cpDNA genomes, and states that this must be from sex and hybridization, likely among lineages in Europe, which then disperse to North America. Our study shows this more clearly, with decoupled chloroplast and nuclear genomes as well as mismatches between the hierarchical genetic assignments and dendrogram relationships of genotypes giving populations a panmictic appearance (Figure 4). As expected however, LD is still substantial in all populations. Clearly, this high level of genetic diversity would require extraordinarily high seed dispersal if sexual reproduction is confined to Europe. Maybe not surprisingly, it appears that dandelions have just such an extraordinary flight mechanism as recently described by Cummins et al. (2018). Van Der Hulst et al. (2003) came to a similar conclusion that their northern European apomictic population was largely composed of genetic recombinants. However, it is likely that the source populations containing sexual members are much closer to their population than for our Boston area populations, again highlighting role of strong dispersal in mimicking the appearance of panmixia.

We also detected population structure and genotypic differences in flowering time within sites suggesting some local adaptation of clonal apomicts, though genetic drift despite the high dispersal may also contribute to the observed patterns. The spatial structure (3A) seemed surprising given the proximity of these sites, the prevalence of dandelions throughout the region, and the enormous level of gene flow via seed needed to generate the levels of diversity seen. The genetic structure in flowering times (Figure 3b) suggests that clones have diverse life history strategies and may be partitioning the available resources in sites.

While the prevailing view is that all genetic diversity is generated in Europe and dispersed to North American, we cannot discount the possibility of some cryptic local sexual recombination and its possible contribution to the genotypic diversity found in these populations. One interesting finding was that the 15 largest, most frequently occurring clones with 5–32 sampled individuals per genet, were never recombinant or of mixed ancestry based on STRUCTURE assignment analyses, suggesting that frequency may be correlated with time since establishment. This may indicate that recombinants are being generated in, and continuously dispersed from, Europe. Alternatively, this pattern might be achieved by dispersal and establishment of older clones from Europe in the past, followed by local cryptic recombination. Regardless, the propensity for dandelions to thrive in disturbed habitats created by humans makes this study system ripe for exploring urbanization, global climate change and local adaptation or selection of lineages. Further investigations of environmental conditions within each of the sites may reveal the factors that drive the establishment of population structure.

CONFLICT OF INTERESTS

The authors report no competing interests.

AUTHOR CONTRIBUTION

Lisa Mazumder: Conceptualization (equal); Data curation (lead); Formal analysis (lead); Investigation (equal); Methodology (equal); Validation (equal); Visualization (lead); Writing‐original draft (lead); Writing‐review & editing (equal). Rick Kesseli: Conceptualization (equal); Data curation (supporting); Formal analysis (supporting); Funding acquisition (lead); Investigation (equal); Methodology (equal); Project administration (lead); Resources (lead); Supervision (lead); Validation (equal); Visualization (supporting); Writing‐original draft (supporting); Writing‐review & editing (equal).

5.

DATA AVAILABILITY STATEMENT

Chloroplast DNA sequences of the trnH‐psbA intergenic spacer are available in GenBank® (accessions MW451112MW451204). Microsatellite data as well as additional figures not pictured herein are available at 10.6084/m9.figshare.14544672.

ACKNOWLEDGMENTS

The authors would like to thank the following undergraduate researchers at UMass‐Boston for help with DNA extraction for this project: Michael Schmelz, Jennifer Mace, Joulane Elyoumni, Maddy Gallay, Elton Kane, and Yara Alkhars. The authors thank Klaus Schliep (Graz University of Technology) for help with haplotype analysis and Rahul Mazumder (MIT) for assisting with the customized R script for clone counting. The work was supported in part by grants from National Science Foundation, Plant Genome Program (DBI‐0820451), and Research Experience for Undergraduate Program (DBI‐1659347) as well as the NIH Initiative to Maximize Student Development (R25GM076321).

Mazumder, L., & Kesseli, R. (2021). Population structure, seasonal genotypic differentiation, and clonal diversity of weedy dandelions in three Boston area populations (Taraxacum sp.). Ecology and Evolution, 11, 10926–10935. 10.1002/ece3.7870

REFERENCES

  1. Asker, S. E., & Jerling, L. (1992). Apomixis in plants. CRC Press. [Google Scholar]
  2. Bougeard, S., & Dray, S. (2018). Supervised Multiblock Analysis in R with the ade4 Package. Journal of Statistical Software, 86(1), 1–17. 10.18637/jss.v086.i01 [DOI] [Google Scholar]
  3. Chessel, D., Dufour, A., & Thioulouse, J. (2004). The ade4 package – I: One‐table methods. R News, 4(1), 5–10. [Google Scholar]
  4. Collier, M. H., & Rogstad, S. H. (2004). Clonal variation in floral stage timing in the common dandelion Taraxacum officinale (Asteraceae). American Journal of Botany, 91(11), 1828–1833. 10.3732/ajb.91.11.1828 [DOI] [PubMed] [Google Scholar]
  5. Cummins, C., Seale, M., Macente, A., Certini, D., Mastropaolo, E., Viola, I. M., & Nakayama, N. (2018). A separated vortex ring underlies the flight of the dandelion. Nature, 562(7727), 414–418. 10.1038/s41586-018-0604-2 [DOI] [PubMed] [Google Scholar]
  6. Dray, S., & Dufour, A.‐B. (2007). The ade4 package: Implementing the duality diagram for ecologists. Journal of Statistical Software, 22(4). 1–20. 10.18637/jss.v022.i04 [DOI] [Google Scholar]
  7. Dray, S., Dufour, A., & Chessel, D. (2007). The ade4 Package – II: Two‐Table and K‐Table Methods. 7(2), 47–52. [Google Scholar]
  8. Earl, D. A., & vonHoldt, B. M. (2012). STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources, 4(2), 359–361. 10.1007/s12686-011-9548-7 [DOI] [Google Scholar]
  9. Ellstrand, N. C., & Roose, M. L. (1987). Patterns of genotypic diversity in clonal plant species. American Journal of Botany, 74(1), 123–131. 10.1002/j.1537-2197.1987.tb08586.x [DOI] [Google Scholar]
  10. Evanno, G., Regnaut, S., & Goudet, J. (2005). Detecting the number of clusters of individuals using the software structure: A simulation study. Molecular Ecology, 14(8), 2611–2620. 10.1111/j.1365-294X.2005.02553.x [DOI] [PubMed] [Google Scholar]
  11. Excoffier, L., Smouse, P. E., & Quattro, J. M. (1992). Analysis of molecular variance inferred from metric distances among DNA haplotypes: Application to human mitochondrial DNA restriction data. Genetics, 131(2), 479–491. 10.1093/genetics/131.2.479 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Falque, M., Keurentjes, J., Bakx‐Schotman, J. M. T., & van Dijk, P. J. (1998). Development and characterization of microsatellite markers in the sexual‐apomictic complex Taraxacum officinale (dandelion). Theoretical and Applied Genetics, 97(1–2), 283–292. 10.1007/s001220050897 [DOI] [Google Scholar]
  13. Falush, D., Stephens, M., & Pritchard, J. K. (2003). Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics, 164(4), 1567–1587. 10.1093/genetics/164.4.1567 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Falush, D., Stephens, M., & Pritchard, J. K. (2007). Inference of population structure using multilocus genotype data: Dominant markers and null alleles: TECHNICAL ARTICLE. Molecular Ecology Notes, 7(4), 574–578. 10.1111/j.1471-8286.2007.01758.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Felsenstein, J. (1974). The evolutionary advantage of recombination. Genetics, 78(2), 737–756. 10.1093/genetics/78.2.737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Grünwald, N. J., Everhart, S. E., Knaus, B. J., & Kamvar, Z. N. (2017). Best practices for population genetic analyses. Phytopathology®, 107(9), 1000–1010. 10.1094/PHYTO-12-16-0425-RVW [DOI] [PubMed] [Google Scholar]
  17. Hamilton, M. (1999). Four primer pairs for the amplification of chloroplast intergenic regions with intraspecific variation. Molecular Ecology, 8(3), 521–523. [PubMed] [Google Scholar]
  18. Hedrick, P. W. (2005). A standardized genetic differentiation measure. Evolution, 59(8), 1633–1638. 10.1111/j.0014-3820.2005.tb01814.x [DOI] [PubMed] [Google Scholar]
  19. Hodkinson, D. J., & Thompson, K. (1997). Plant Dispersal: The Role of Man. The Journal of Applied Ecology, 34(6), 1484. 10.2307/2405264 [DOI] [Google Scholar]
  20. Hojsgaard, D., & Hörandl, E. (2015). A little bit of sex matters for genome evolution in asexual plants. Frontiers in Plant Science, 6, 10.3389/fpls.2015.00082 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hubisz, M. J., Falush, D., Stephens, M., & Pritchard, J. K. (2009). Inferring weak population structure with the assistance of sample group information. Molecular Ecology Resources, 9(5), 1322–1332. 10.1111/j.1755-0998.2009.02591.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hughes, J., & Richards, A. J. (1988). The genetic structure of populations of sexual and asexual Taraxacum (dandelions). Heredity, 60(2), 161–171. 10.1038/hdy.1988.28 [DOI] [Google Scholar]
  23. Iaffaldano, B. J., Zhang, Y., Cardina, J., & Cornish, K. (2017). Genome size variation among common dandelion accessions informs their mode of reproduction and suggests the absence of sexual diploids in North America. Plant Systematics and Evolution, 303(6), 719–725. 10.1007/s00606-017-1402-2 [DOI] [Google Scholar]
  24. Johnson, M. T. J., Thompson, K. A., & Saini, H. S. (2015). Plant evolution in the urban jungle. American Journal of Botany, 102(12), 1951–1953. 10.3732/ajb.1500386 [DOI] [PubMed] [Google Scholar]
  25. Jombart, T. (2008). adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics, 24(11), 1403–1405. 10.1093/bioinformatics/btn129 [DOI] [PubMed] [Google Scholar]
  26. Jombart, T., & Ahmed, I. (2011). adegenet 1.3‐1: New tools for the analysis of genome‐wide SNP data. Bioinformatics, 27(21), 3070–3071. 10.1093/bioinformatics/btr521 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kamvar, Z. N., Brooks, J. C., & Grünwald, N. J. (2015). Novel R tools for analysis of genome‐wide population genetic data with emphasis on clonality. Frontiers in Genetics, 6. 1–10. 10.3389/fgene.2015.00208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kamvar, Z. N., Tabima, J. F., & Grünwald, N. J. (2014). Poppr: An R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ, 2, e281. 10.7717/peerj.281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kearse, M., Moir, R., Wilson, A., Stones‐Havas, S., Cheung, M., Sturrock, S., Buxton, S., Cooper, A., Markowitz, S., Duran, C., Thierer, T., Ashton, B., Meintjes, P., & Drummond, A. (2012). Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics (Oxford, England), 28(12), 1647–1649. 10.1093/bioinformatics/bts199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. King, L. M. (1993). Origins of genotypic diversity in North American dandelions inferred from ribosomal DNA and chloroplast DNA restriction enzyme analysis. Evolution, 47(1), 136–151. 10.1111/j.1558-5646.1993.tb01205.x [DOI] [PubMed] [Google Scholar]
  31. Lyman, J. C., & Ellstrand, N. C. (1984). Clonal diversity in Taraxacum officinale (compositae), an apomict. Heredity, 53(1), 1–10. 10.1038/hdy.1984.58 [DOI] [Google Scholar]
  32. Mack, R. N., & Lonsdale, W. M. (2001). Humans as global plant dispersers: Getting more than we bargained for. BioScience, 51(2), 95. [Google Scholar]
  33. Menken, S. B. J., Smit, E., & Nijs, H. J. C. M. D. (1995). Genetical population structure in plants: Gene flow between diploid sexual and triploid asexual dandelions (Taraxacum section Ruderalia). Evolution; International Journal of Organic Evolution, 49(6), 1108–1118. 10.1111/j.1558-5646.1995.tb04437.x [DOI] [PubMed] [Google Scholar]
  34. Müller, H. J. (1964). The relation of recombination to mutational advance. Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis, 1(1), 2–9. 10.1016/0027-5107(64)90047-8 [DOI] [PubMed] [Google Scholar]
  35. Nei, M. (1978). Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics, 89(3), 583–590. 10.1093/genetics/89.3.583 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Paradis, E. (2010). pegas: An R package for population genetics with an integrated‐modular approach. Bioinformatics, 26(3), 419–420. 10.1093/bioinformatics/btp696 [DOI] [PubMed] [Google Scholar]
  37. Paradis, E., & Schliep, K. (2019). ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. Edited by R. Schwartz, 35(3), 526–528. 10.1093/bioinformatics/bty633 [DOI] [PubMed] [Google Scholar]
  38. Primack, D., Imbres, C., Primack, R. B., Miller‐Rushing, A. J., & Del Tredici, P. (2004). Herbarium specimens demonstrate earlier flowering times in response to warming in Boston. American Journal of Botany, 91(8), 1260–1264. 10.3732/ajb.91.8.1260 [DOI] [PubMed] [Google Scholar]
  39. Pritchard, J. K., Stephens, M., & Donnelly, P. (2000). Inference of population structure using multilocus genotype data. Genetics, 155, 945–959. 10.1093/genetics/155.2.945 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. R Core Team (2020). A language and environment for statistical computing. R Foundation for Statistical Computing. Available at: https://www.R‐project.org/ [Google Scholar]
  41. Richards, A. J. (1973). The origin of Taraxacum agamospecies. Botanical Journal of the Linnean Society, 66(3), 189–211. 10.1111/j.1095-8339.1973.tb02169.x [DOI] [Google Scholar]
  42. Rogstad, S. H., Keane, B., & Beresh, J. (2001). Genetic variation across VNTR loci in central North American Taraxacum surveyed at different spatial scales. Plant Ecology, 161(1), 111–121. 10.1023/A:1020301011283 [DOI] [Google Scholar]
  43. Shannon, C. E. (1948). A mathematical theory of communication. Bell System Technical Journal, 27(4), 623–656. 10.1002/j.1538-7305.1948.tb00917.x [DOI] [Google Scholar]
  44. Simpson, E. H. (1949). Measurement of diversity. Nature, 163(4148), 688. 10.1038/163688a0 [DOI] [Google Scholar]
  45. Solbrig, O. T. (1971). The population biology of dandelions: These common weeds provide experimental evidence for a new model to explain the distribution of plants. American Scientist, 59(6), 686–694. [Google Scholar]
  46. Stoddart, J. A., & Taylor, J. F. (1988). Genotypic diversity: estimation and prediction in samples. Genetics, 118(4), 705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Thioulouse, J., Dray, S., Dufour, A.‐B., Siberchicot, A., Jombart, T., & Pavoine, S. (2018). Multivariate Analysis of Ecological Data with ade4. Springer New York. 10.1007/978-1-4939-8850-1 [DOI] [Google Scholar]
  48. Van Der Hulst, R. G. M. et al. (2003). Genetic structure of a population sample of apomictic dandelions. Heredity, 90(4), 326–335. 10.1038/sj.hdy.6800248 [DOI] [PubMed] [Google Scholar]
  49. Vasut, R. J., Van dijk, P. J., Falque, M., Travnicek, B., & De jong, J. H. (2004). Development and characterization of nine new microsatellite markers in Taraxacum (Asteraceae). Molecular Ecology Notes, 4(4), 645–648. 10.1111/j.1471-8286.2004.00760.x [DOI] [Google Scholar]
  50. Vellend, M., Drummond, E. B. M., & Muir, J. L. (2009). Ecological differentiation among genotypes of dandelions (Taraxacum officinale). Weed Science, 57(4), 410–416. 10.1614/WS-09-004.1 [DOI] [Google Scholar]
  51. Williams, G. C. (1975). Sex and evolution. Princeton University Press (Monographs in population biology, 8). [PubMed] [Google Scholar]
  52. Závada, T., Malik, R. J., & Kesseli, R. V. (2017). Population structure in chicory (Cichorium intybus): A successful U.S. weed since the American revolutionary war. Ecology and Evolution, 7(12), 4209–4219. 10.1002/ece3.2994 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Chloroplast DNA sequences of the trnH‐psbA intergenic spacer are available in GenBank® (accessions MW451112MW451204). Microsatellite data as well as additional figures not pictured herein are available at 10.6084/m9.figshare.14544672.


Articles from Ecology and Evolution are provided here courtesy of Wiley

RESOURCES