Skip to main content
G3: Genes | Genomes | Genetics logoLink to G3: Genes | Genomes | Genetics
. 2021 Sep 29;11(12):jkab344. doi: 10.1093/g3journal/jkab344

Beta vulgaris ssp. vulgaris chromosome 8 shows significant association with geosmin concentration in table beet

Solveig J Hanson 1,, Julie C Dawson 2, Irwin L Goldman 2
Editor: T Jamann
PMCID: PMC8664477  PMID: 34586384

Abstract

Geosmin, a degraded sesquiterpene molecule with earthy and musty odor, imbues table beet with its characteristic aroma. Geosmin is heritable and endogenously produced in table beet; its earthy aroma is sought by some consumers but deters others. Geosmin biosynthesis is catalyzed by a bifunctional geosmin synthase enzyme in diverse bacteria and fungi, but a mechanism for geosmin biosynthesis in plants has not been reported. This work employed association analysis and selective genotyping of a segregating F2:3 mapping population to seek QTL associated with geosmin concentration in table beet. GBS reads were aligned to sugar beet reference genome EL10.2, and association analysis revealed two QTL for geosmin concentration on Beta vulgaris ssp. vulgaris chromosome 8. QTL at EL10.2 positions 28,017,624 and 38,488,687 each show effect size 8.7 μg·kg-1 geosmin and explain 8.5% and 6.4% of total variation in geosmin concentration, respectively. Resolution was low due to large recombination bin size and imperfect alignment between the reference genome and mapping population, but population size and selection proportion were sufficient to detect moderate to large effect QTL. This study, the first molecular genetic mapping experiment in table beet, succeeded in finding QTL for geosmin concentration in table beet, and it provides the basis for fine mapping or candidate gene investigation of functional loci for this distinctive sensory trait.

Keywords: Table beet, Beta vulgaris, geosmin, association analysis, selective genotyping, QTL mapping

Introduction

Table beet (B. vulgaris ssp. vulgaris) is a minor biennial vegetable crop with significant regional and local economic impact. In Wisconsin, the United States’ largest table beet producing state, most beets are cultivated conventionally for processing markets (USDA NASS 2020) which demand standard red beets (Goldman and Navazio 2003). Beets are also an important crop for U.S. organic vegetable growers (Lyon et al. 2015; Brouwer and Colley 2016; Hultengren et al. 2016), who tend to prioritize flavor when selecting vegetable varieties (Dawson and Healy 2018). To facilitate breeding for both standard and novel table beet flavor characteristics, it is critical to understand the genetic basis of physicochemical components associated with flavor in table beet. Sweet and earthy flavor attributes are salient in table beet (Bach et al. 2014) and have been associated with sucrose (e.g., Bach et al. 2014) and geosmin (Murray et al. 1975), respectively. Bitter flavor in table beet is associated with saponins (Mikołajczyk-Bator et al. 2016), flavonoids (Kujala et al. 2002), and phenols (Bavec et al. 2010). Raw beet can also produce astringent or pungent sensations (Bach et al. 2014), which may be associated with oxalate (Freidig and Goldman 2011), a compound known to produce abrasive sensations in other crops (Korth et al. 2006; Salinas et al. 2001).

The current work focuses on geosmin (trans-1,10-dimethyl-trans-9-decalol), the volatile 12-carbon terpenoid molecule that imbues moist soil with its distinctive aroma (Gerber 1967) and has been identified as the characteristic flavor of table beet (Tyler et al. 1979). Geosmin is present in a diverse group of terrestrial, aquatic, and plant-symbiotic microoroganisms (Churro et al. 2020), nonvascular plants like liverworts (Spiteller et al. 2002), and B. vulgaris ssp. vulgaris (Freidig and Goldman 2014), which includes crop types sugar beet, table beet, fodder beet, leaf beet, and Swiss chard (Letschert 1993). In table beet roots, geosmin is concentrated in and directly under the epidermis (Tyler et al. 1978; Lu et al. 2003a). Humans are exceedingly sensitive to geosmin, although perception thresholds for geosmin vary among individuals (Tyler et al. 1979), and preference for geosmin depends on consumption context. Geosmin is considered a contaminant in drinking water, wine, beer, and other foods (e.g., Buttery et al. 1976; Frisvad et al. 1997; Darriet et al. 2000;), and anecdotal evidence exists both of nonpreference (Lu et al. 2003b) and preference (Goldman and Navazio 2008) for beet among consumers due to its earthy flavor.

While a mechanism for geosmin biosynthesis in B. vulgaris ssp. vulgaris has not been reported, geosmin biosynthesis has been confirmed in liverworts (Spiteller et al. 2002), fungi (Liato and Aïder 2017), actinobacteria, cyanobacteria, and proteobacteria (Churro et al. 2020). The biosynthesis of geosmin in Streptomyces coelicolor bacteria is a cyclization-fragmentation reaction catalyzed by a single bifunctional geosmin synthase enzyme in the presence of magnesium (Mg2+) ions. Sesquiterpene (C15) precursor molecule farnesyl diphosphate (FPP)—itself synthesized from ubiquitous C5 units (Singh and Sharma 2015)—is cyclized into an 85:15 mixture of germacradienol and germacrene D by the N-terminal domain of geosmin synthase. The enzyme’s C-terminal domain then catalyzes a fragmentation reaction which converts C15 germacradienol into C12 geosmin (Jiang et al. 2006) and C3 acetone (Harris et al. 2015).

The gene coding for geosmin synthase has been reported in Streptomyces coelicolor (Cane and Watt 2003) and model cyanobacterium Nostoc punctiforme (Giglio et al. 2008). The crystal structure of the S. coelicolor enzyme’s N-terminal domain has been determined, as has a homology model of the C-terminal domain (Harris et al. 2015). A complete geosmin synthesis gene cluster, composed of the geosmin synthase gene and two transcription regulator nucleotide binding genes, was discovered in the cyanobacteria Microcoleus asticus, and complete or partial geosmin synthase gene sequences were detected in actinobacteria, proteobacteria, and cyanobacteria occupying diverse terrestrial, aquatic, and symbiotic ecological niches (Churro et al. 2020). Dehydrogeosmin, a closely related sesquiterpene, is biosynthesized in the flowers of several Cactaceae species (Feng et al. 1993; Schlumpberger et al. 2004) using the same principal pathway as for geosmin biosynthesis in Streptomyces spp. (Spiteller et al. 2002). It is unknown whether the enzyme catalysts for these pathways arose independently or from a common ancestor.

A geosmin synthase gene has not been detected in B. vulgaris, but ample evidence exists that geosmin is endogenously produced in table beet and that its concentration is under primarily genetic control. Because geosmin is known to be synthesized by ubiquitous soil-dwelling microbes, it was assumed historically that geosmin was present in beet due to plant-microbe affiliation. However, table beet cultivars grown in autoclaved soil (Freidig and Goldman 2014) and aseptic tissue culture (Maher and Goldman 2018) were found to contain geosmin. Two experiments investigating genetic versus environmental control of geosmin in table beet cultivars—one comparing greenhouse and field treatments (Freidig and Goldman 2014) and one comparing differing field environments (Hanson and Goldman 2019)—found significant effects of genotype on geosmin concentration and mostly noncrossover genotype × environment interaction. Moreover, bidirectional recurrent selection for geosmin concentration in table beet was successful and produced realized heritability estimates of 0.70 and 0.23 for downward and upward selection, respectively (Maher and Goldman 2017). Hanson and Goldman (2019) showed broad sense heritability of 0.90 for geosmin concentration in a field-based genotype x environment experiment , adding to a substantial body of evidence establishing the primacy of genetic control over geosmin concentration in table beet. Geosmin concentration displays continuous variation consistent with multigenic control, and the variance of geosmin concentration is known to increase with concentration itself (Freidig and Goldman 2014; Maher and Goldman 2017; Hanson and Goldman 2019). However, genomic investigation is needed to elucidate the biosynthetic origin of geosmin in table beet.

Recent analyses of genetic structure within B. vulgaris ssp. vulgaris demonstrate that the genetic divergence of table beet from sugar, fodder, and leaf beet crop types is minor compared with their genetic similarity (Andrello et al. 2017; Galewski and Mcgrath 2020). No table beet genome has been published to date, but because sugar beet shows very close genetic relationship to table beet and is confirmed to contain geosmin (Tyler et al. 1978; Freidig and Goldman 2014), the genomic tools developed for sugar beet breeding can be leveraged for investigation of the genetic basis of geosmin concentration in table beet. Two sugar beet reference genomes have been published to date: RefBeet (Dohm et al. 2014) from short Illumina-derived DNA sequences and EL10 (Funk et al. 2018; McGrath et al. 2020) from PacBio long DNA reads, optical mapping, and Hi-C reads. EL10 versions EL10.1 and EL10.2 comprise nine main scaffolds, along with 31 and 9 additional small scaffolds, respectively. RefBeet and EL10.1 chromosomes are highly congruous and numbered according to conventional nomenclature (Butterfass 1964; Schondelmaier and Jung 1997). RefBeet chromosomes are oriented according to the cytogenetic map of Paesold et al. (2012), while chromosomal orientation is inverted for seven of nine EL10.1 chromosomes (McGrath et al. 2020). The EL10.2 genome assembly resolved inversions on EL10.1 chromosomes 7 and 9 and is the most contiguous sugar beet genome to date (McGrath et al. 2020).

The sugar beet genome is highly repetitive, characterized by concentrated regions of recombination, and subject to diverse forms of variation. Estimates of repetitive sequence in the sugar beet genome range from 42% (Dohm et al. 2014) to over 60% (Flavell 1974), with diverse repeat patterns among chromosomes (Paesold et al. 2012) and chromosomal recombination concentrated in certain genomic regions (e.g., Barzen et al. 1992; McGrath et al. 2007; Dohm et al. 2014). Presence-absence variation—including indels, complex substitutions, and structural variation involving partial or entire genes—is a meaningful source of phenotypic variation and lineage divergence in sugar beet (Galewski and Eujayl 2021). Sugar beet genome size was estimated by flow cytometry to be 758 Mb (Arumuganathan and Earle 1991), but recent assemblies have fallen substantially short of that projection, ranging in size from 520 to 573 Mb (Galewski and Eujayl 2021). RefBeet and EL10.2 reference genomes have nearly identical size at 567 Mb (McGrath et al. 2020).

A preliminary search of the RefBeet genome (Dohm et al. 2014) for the S. coelicolor geosmin synthase protein sequence returned two hypothetical proteins with predicted terpenoid synthase function (Huang et al. 2017) located on B. vulgaris ssp. vulgaris chromosome 8 (Maher 2017). A probe using the Solidago canadensis germacrene-D synthase protein sequence yielded 56 potential genes, many of which had predicted sesquiterpene synthase function (Maher 2017). However, terpenoids are the largest and most diverse group of phytochemicals, making terpene synthase genes abundant and ubiquitous within plant genomes (Singh and Sharma 2015). In addition, geosmin synthase genes show evidence of rearrangements and deletions between bacterial phyla (Churro et al. 2020) which are much more closely related than bacteria and vascular plants, so sequence homology between geosmin synthase genes in Streptomyces and B. vulgaris cannot be assumed. While it is plausible that the B. vulgaris ssp. vulgaris genome harbors a geosmin synthase gene, additional evidence is required to establish its location and functionality.

Genetic mapping has been used for decades to detect association between genetic markers and genomic loci that influence phenotypes. Linkage mapping was first used in sugar beet to detect qualitative traits like hypocotyl color (Barzen et al. 1992; Pillen et al. 1992), fertility restoration (Pillen et al.1992, 1993), monogermy and Rhizomania resistance (Barzen et al. 1992), and bolting (El-Mezawy et al. 2002). Efforts to detect quantitative traits like sugar yield and quality met with limited success (Weber et al. 1999; Weber et al. 2000) due to significant environmental variation, low marker density, and the lack of consistent genetic polymorphism across populations. A few sugar beet linkage mapping projects succeeded in identifying QTL potentially useful for marker-assisted selection (e.g., Rajabi and Borchardt 2015), but limitations remained to this approach, particularly low resolution due to large linkage blocks in biparental segregating populations, and limitation to two alleles per locus (Flint-Garcia et al. 2003). Association analysis is often used with diverse panels which have rapid decay of linkage disequilibrium. Such an analysis using elite sugar beet breeding lines and wild beet accessions showed that linkage disequilibrium varied by both population history and by chromosome (Adetunji et al. 2014). Association analysis requires control of population structure to identify true marker-trait associations, and in sugar beet, population structure has been sufficiently controlled by including the kinship matrix as a random effect in the marker-trait association model (Würschum et al. 2011; Mangin et al. 2015).

Selective genotyping, the practice of genotyping only individuals with extreme phenotypes, can be used for QTL detection (Navabi et al. 2009). It was first used in bulk segregant analysis (Stuber et al. 1980; Zou et al. 2016), and Lander and Botstein (1989) used this technique in linkage mapping experiments to control costs while retaining a high percentage of linkage information. Bidirectional selective genotyping—selecting equal proportions from each tail of the phenotypic distribution—maximizes both power to detect QTL (Navabi et al. 2009) and efficiency compared with complete population genotyping (Rabier 2014). For a given proportion of individuals selected for genotyping, power increases with QTL effect size, total population size, and proximity of QTL to markers (Navabi et al. 2009). Optimum selection proportion may be calculated as a function of the relative costs of genotyping and phenotyping for a given experiment (Darvasi and Soller 1992; Gallais et al. 2007) or using evidence from simulation studies (Rabier 2014; Sun et al. 2010).

The current work seeks to locate genomic loci associated with table beet geosmin concentration to connect the physical beet genome with existing statistical and observational evidence that this trait is under primarily genetic control. Identification of QTL for geosmin concentration in table beet could also yield insight into the mechanism of geosmin biosynthesis in beet and facilitate development of molecular markers for table beet flavor breeding. This is the first molecular marker-based genetic mapping effort in table beet to our knowledge, and it is unique among B. vulgaris ssp. vulgaris mapping projects for its focus on a sensory trait rather than one important for commodity food manufacturing.

Methods

Germplasm

A population of F3 individuals segregating for geosmin concentration was developed from a biparental cross of two individuals from open pollinated table beet cultivars. Seed parent cultivar Touchstone Gold’ and pollen parent cultivar ‘Chioggia are known to have relatively low and high mean geosmin concentration, respectively (Freidig and Goldman 2014). A single F1 plant was self-pollinated to produce a segregating F2 family, and 194 F2 plants were self-pollinated to create F2:3 families. Populations of parental cultivars Touchstone Gold’ and Chioggia’ were maintained through mass pollination and used as check cultivars.

Field and greenhouse methods

Seed for the F2:3 mapping population was produced in three consecutive annual cycles, each composed of summer field-based root production and winter greenhouse seed production. Parental, F1, and F2 roots were harvested in late August 2014, 2015, and 2016, respectively. Parents were co-isolated to achieve the initial biparental cross in February 2015, and single plants were bagged to achieve F1 and F2 self-pollinations in February 2016 and 2017, respectively. Seed was hand harvested from each plant; of 194 F2:3 families self-pollinated in 2017, 90 produced sufficient seed for inclusion in the mapping population. The mapping population of 90 F2:3 families was grown in two randomized replications in both 2017 and 2019. Four plots each of parental cultivars Touchstone Gold’ and Chioggia’ were placed randomly in each replication, for 98 total entries per replicate. Plots measuring 1.2 m long and spaced 46 cm apart were hand-seeded in 2017 and transplanted in 2019.

Three individuals free from surface damage, with root diameter 4 to 6.5 cm, and with tops intact were harvested from each plot 11.5 weeks after direct seeding in 2017 and 8.5 weeks after transplanting in 2019. Leaf and root tissue were sampled from 2 individuals per F2:3 family plot and one root per check plot within 2 days of harvest. For each leaf sample, five 1-cm disks were punched from a single newly emerged mature leaf and frozen at −80°C in either fresh or lyophilized form. Roots were dry-brushed to remove soil, and a core borer with 1 cm internal diameter was used to extract 6 to 10 cylindrical cores, each with 1 to 3cm length and an epidermal disk of approximately 1 cm diameter. Cores were frozen immediately at −80°C and transferred to a −20°C freezer before laboratory analysis. 84 of 90 F2:3 families yielded sufficient roots for sampling.

GC-MS phenotyping

Geosmin concentration was calculated via HSPME and GC–MS using the method described by Hanson and Goldman (2019) and Maher and Goldman (2017, 2018), adapted from Lu et al. (2003b). Briefly, beet homogenate was prepared for each root tissue sample using six root cores with complete epidermal disks. Cores were trimmed to achieve a 30 g sample mass, combined with Milli-Q water (Millipore, Bedford, MA, USA) at a 1:1 ratio, and blended for 2 min in an industrial blender. Homogenate was held at −20°C until immediately before GC-MS analysis, when it was thawed slightly in a 32°C water bath. Partially frozen homogenate was combined with NaCl in a screw-top glass vial at a 5 g : 1 g ratio, and the vial was sealed immediately with a polytetrafluoroethylene (PTFE)-silicone septum-lined cap. A water blank and three geosmin standards of 5, 10, and 21.6 μg·kg-1 ± (–)geosmin (Sigma-Aldrich, St. Louis, MO) in 5 g Milli-Q water were included with each GC–MS cycle for calibration curve generation. Internal standard –(–)menthone (Sigma-Aldrich) was injected at the rate of 2.82 μg·kg-1 root tissue in all table beet samples and geosmin standards. Relative recovery was calculated by comparing geosmin concentration in unmodified table beet homogenate with that in identical homogenate spiked with 5, 10, 15, and 21.6 μg·kg-1 geosmin, according to the method of Lu et al. (2003b). Relative recovery rates for 2017 and 2019 were 40.37% and 46.76%, respectively.

Relative Recovery= μg Geosmin Total- μg Geosmin Unmodified μg Geosmin Spiked ×100%

Phenotypic data analysis

Phenotypic data for the full mapping population, selectively genotyped population, and check plots were analyzed using R software (R Core Team 2021). Mixed model ANOVA and Tukey-corrected pairwise comparisons with experiment-wide error rate α=0.05 were conducted with R packages “lme4” (Bates et al. 2015), “lmerTest” (Kuznetsova et al. 2017), and “emmeans” (Lenth 2020). The statistical model used was:

Geoijkl= Yeari+ Familyj+(Year*Family)ij+ Blockk(Year)i+(Block*Year*Family)ijk+εijkl

where Geoijkl is individual root geosmin concentration; Yeari is the fixed effect of year i, i = {2017, 2019}; Familyj is the fixed effect of family j, j = {1…78}; Blockk is the fixed effect of block k, which was nested within year, k = {1,2}. The random Block*Year*Family interaction term expressed plot-to-plot variation and was used as the denominator for significance tests of fixed effects.

Welch T-tests were used for comparison of mean geosmin concentration between years, between check plots and F2:3 populations, and between high and low geosmin tails of the selected population. Log-transformed data conformed better to ANOVA assumptions and were used for all statistical tests. Data were visualized using R package “ggplot2” (Wickham 2016). Of the 84 phenotyped families, 6 yielded fewer than two roots in 2017 and were excluded from the full mapping population.

Selective genotyping

Selective genotyping was performed on a subpopulation of F3 individuals representing 16% of the full mapping population, divided equally between tails of the geosmin concentration distribution. Because 2017 and 2019 populations were of different sizes, and because a significant Year effect was observed with respect to geosmin concentration, 2017 and 2019 populations were considered separately for the purpose of individual selection. F2:3 families were ranked by mean geosmin concentration within year, and a subset of families was identified that showed geosmin concentration in the extreme quartile of the distribution in both years. Next, F3 individuals were ranked by geosmin concentration within the year of production. Individuals were considered for selection beginning with the most extreme individual and working toward the mean; individuals were selected only if they belonged to one of the families with extreme mean geosmin concentration in both 2017 and 2019.

The original selected population (n = 92) was composed of 40 individuals from 2017 and 52 individuals from 2019, divided equally between high and low geosmin tails within year. Four genotyped individuals were excluded from further analysis because they belonged to families with too few 2017 roots for valid ANOVA, and one genotyped individual yielded insufficient DNA for analysis. The analyzed population (n = 87) consisted of a low geosmin tail (n = 43) of 20 and 23 individuals from 2017 and 2019, respectively, and a high geosmin tail (n = 44) of 19 and 25 individuals from 2017 and 2019, respectively.

A single 1-cm disk of leaf tissue was submitted to the University of Wisconsin Biotechnology Center (Madison, WI) for DNA extraction and genotyping by sequencing (Elshire et al. 2011). DNA libraries were prepared by digestion with NsiI-BfaI and ligation of GBS barcodes and adapters. Paired-end reads (2 × 150 bp) were generated on one shared lane of an Illumina NovaSeq 6000 sequencer.

Genomic data analysis

Genomic data quality control, sequence alignment, and SNP calling were performed by the University of Wisconsin Bioinformatics Resource Center (Madison, WI, USA). Skewer software (Jiang et al. 2014) was used to remove adapters, primers, and low-quality bases, to trim reads until achievement of Phred quality 20, and to discard excessively short reads. Demultiplexed 64 bp forward reads were aligned to B. vulgaris ssp. vulgaris reference genome EL10.2 (McGrath et al. 2020) using the Tassel v2 GBS Pipeline (Glaubitz et al. 2014) and alignment software Bowtie 2 (Langmead et al. 2012). Variants were called using the Tassel v2 Discovery and Production SNP Caller system. 185,457 unfiltered SNP variants were detected.

VCF files were filtered using VCFtools (Danecek et al. 2011) to eliminate indel mutations and retain only biallelic SNP’s with minor allele frequency >0.10, read depth 5, and no missing individuals. Preliminary analyses that included EL10.2 small scaffolds 10–18 revealed no SNP’s close to the significance threshold, so SNP’s on these contigs were omitted for clarity, resulting in a set of 36,928 markers. SNP markers were filtered in R by genotype frequency to eliminate sites with <5% minor homozygotes and >50% heterozygotes. Marker data were analyzed for linkage disequilibrium within chromosome (Rogers and Huff 2009), and a single marker was selected to represent groups of markers with LD >0.99. Bins containing markers from multiple chromosomes were examined individually, and ambiguously placed markers were eliminated, resulting in a final set of 8651 markers. EL10.2 main scaffolds were renumbered to bring them into accordance with Butterfass chromosome naming convention. Haplotypes were not phased due to unavailability of parental tissue for sequencing, so the reference and alternate alleles were designated 0 and 1, respectively, based on comparison with reference genome EL10.2. Deviations from the reference genome are useful for detecting genomic loci associated with geosmin but not for assigning such deviations to parental genotypes.

While biparental populations are not expected to exhibit population structure, selective genotyping could introduce it. Thus, population structure was analyzed and visualized using principal components analysis (PCA) with base R software and package “ggplot,” respectively. Single marker analysis was performed using R package “GWASpoly” v.2.07 (Rosyara et al. 2016) with settings for diploid organisms, additive gene action, and significance threshold set using 1000 permutations (Churchill and Doerge 1994) to control the family-wise error rate at α=0.05. A separate relationship matrix (K) was calculated for each chromosome using the “leave-one-chromosome-out” method (Yang et al. 2014) to account for the random effect of genotype (Würschum et al. 2011; Mangin et al. 2015). The most significant marker in each 1 Mb window was reported, and a QTL model was fit to this marker subset in “GWASpoly” using backward regression. F3 individual geosmin concentration was used as the phenotypic variable, but because F3 individuals are not replicable, the F2:3 family mean in both years was taken into account during selection of individuals for genotyping. Because the permutation-derived threshold is independent of phenotypic data distribution, untransformed geosmin concentration data could be used for analysis, simplifying interpretation of results.

Results

Mapping population

The distribution of untransformed geosmin concentration in the full mapping population was right-skewed for both individual roots (Figure 1) and F2:3 family means (data not shown). Full populations from 2017 and 2019 did not differ significantly in mean geosmin concentration when compared using individual roots (Welch T = −1.45 on 517.6 DF, P  0.15) or family means (Welch T = −0.99 on 152.9 DF, P  0.32). Mixed model ANOVA performed on individual geosmin concentration values showed a significant Year effect and extremely significant Family effect (Table 1). Pairwise comparisons among F2:3 families were used to confirm the extreme-geosmin nature of families prioritized for selective genotyping (Supplementary Table S1). Each of the 78 F2:3 families in the mapping population was represented by between 4 and 8 individuals, with mean 6.95 individuals per family.

Figure 1.

Figure 1

Distribution of geosmin concentration in (A) a mapping population of F3 individual table beets grown in 2017 (green) and 2019 (purple) and (B) high (green) and low (purple) F3 individuals selected for genotyping. Mean geosmin concentration for low and high geosmin check plots are shown in gold and red font, respectively.

Table 1.

ANOVA for log transformed geosmin concentration of a segregating table beet population composed of 542 individual roots from 78 F2:3 table beet families, grown in 2017 and 2019

Source of Variation SS MS NumDF DenDF F value Pr(>F)
Year 1.25 1.25 1 109.8 6.29 0.01*
Family 120.39 1.56 77 104.4 7.88 <2e-16***
Year: Block 0.51 0.26 2 110.0 1.29 0.28
Year: Family 19.42 0.25 77 104.4 1.27 0.13
*, ***

Significant at P < 0.05 and 0.001, respectively

Plots of high geosmin check Chioggia’ were numerically but not significantly higher in mean geosmin concentration than those of low geosmin check ‘Touchstone Gold’ at 9.49 and 8.02 μg·kg-1, respectively, averaged over years. Mean geosmin concentration of the full population of F3 individuals, averaged over years, was not significantly different from that of high geosmin check ‘Chioggia’ (Welch T = 0.95 on 12.8 DF, P  0.36) nor low geosmin check Touchstone Gold’ (Welch T = −0.29 on 17.3 DF, P  0.78). It should be noted that while check populations are of the same cultivar as parental individuals, actual geosmin concentration of parental individuals is unknown.

High and low geosmin tails (n = 44 and n = 43, respectively) were selected to include F3 individuals with extreme-quartile geosmin concentration in the year of their production, as well as extreme-quartile F2:3 family mean geosmin concentration in both 2017 and 2019 (Supplementary Table S2). All selected individuals were from F2:3 families with two-year mean geosmin concentration in the most extreme pairwise comparison group (Supplementary Table S1). 24 F2:3 families were represented in the selected population, with a range of 1 to 6 individuals per family (Supplementary Table S3). High and low geosmin tails showed extremely significant differences in mean geosmin concentration, at 22.60 and 2.80 μg·kg-1 geosmin concentration, respectively (Welch T = 31.95 on 75.3 DF, P  2.2 × 10−16) (Figure 1). The high geosmin tail of the selected population was higher in geosmin concentration than high geosmin check ‘Chioggia’ (Welch T = 7.99 on 18.7 DF, P  1.93 × 10−07), and likewise, the low geosmin tail of the selected population was significantly lower in geosmin concentration than low geosmin check Touchstone Gold’ (Welch T = −8.89 on 17.3 DF, P  7.18 × 10−08).

Population structure

PCA was performed using genotypic data from the selected individuals at the 8,651 biallelic SNP markers used for association analysis. The percent of total genotypic variation explained by each dimension declined smoothly from a maximum of 18.4% variation, with no clear inflection point. PCA plots showed no obvious clustering by low or high geosmin tail. Some clustering by F2:3 family was evident, but family clusters were highly overlapping and distributed over many dimensions (Supplementary Figure S1). Moreover, the full mapping population appeared as a cloud of points along plots of all combinations of dimensions 1–4 (Supplementary Figure S2).

Families with particularly extreme geosmin concentration were represented by multiple individuals in the genotyped population, while others were included only once, so population structure could have been higher than in a population composed of one individual per F2:3 family. To investigate this possibility, a balanced subpopulation was created using only one individual from each of the 24 F2:3 families in the selected population. The first dimension explained 24.8% of genotypic variation—slightly more than in the full genotyped population—and percent variation declined more steeply between first and second dimensions, demonstrating marginally more population structure than the full genotyped population, rather than less (Supplementary Figure S3). Thus, population structure in the genotyped population was quite minimal and required no further correction than that provided by the relationship matrix (K).

Association analysis

Association analysis revealed a large portion of B. vulgaris ssp. vulgaris chromosome 8 to be significantly associated with geosmin concentration. Markers exceeding a −log10(p) = 4.0 significance threshold derived from 1000 permutations (α  =  0.05) were detected in 22 one-Mb windows, spanning all but the last 10 Mb of the 62 Mb chromosome (Figure 2). For windows with significant markers, the most significant marker was reported, along with its effect size (Table 2). QQ plots confirm association of many markers on chromosome 8 with geosmin concentration but show no evidence of association on other chromosomes (Supplementary Figure S4). Markers were tested for inclusion in a QTL model using backward regression, and two QTL—at EL10.2 positions 28,017,624 and 38,488,687—were found to explain 8.5% and 6.4% of total variation in geosmin concentration, respectively. Taken together, all significant markers explain 19.1% of variation in geosmin concentration.

Figure 2.

Figure 2

Manhattan plot for association analysis performed with respect to geosmin concentration on 8,651 SNP markers from a table beet population selected for extreme geosmin concentration (n = 87 F3 individuals) using −log10(p) = 4.0 significance threshold derived from 1000 permutations (α  =  0.05).

Table 2.

Markers with maximum significance for each 1 Mb window in which any significant markers were detected Association analysis was performed on a table beet population selected for extreme geosmin concentration (n = 87 F3 individuals) using 8651 SNP markers and −log10(p) = 4.0 significance threshold derived from 1000 permutations (α  =  0.05). Markers were tested for inclusion in a QTL model using backward regression, and percent variation in geosmin concentration explained (R2) and P values are shown.

Marker Chr EL10.2 Position Score Effect R 2 (%) P
8_1205464 8 1205464 10.37 6.8 0.0 0.93
8_4759765 8 4759765 11.03 7.3 0.0 1
8_5944761 8 5944761 10.93 7.3 0.0 1
8_8471698 8 8471698 11.21 7.3 0.4 0.57
8_10406017 8 10406017 12.23 8.0 0.0 1
8_13243064 8 13243064 12.44 8.2 0.0 1
8_15304025 8 15304025 13.36 8.2 0.0 1
8_18397052 8 18397052 13.31 8.7 0.1 0.78
8_20033066 8 20033066 12.75 8.4 0.0 1
8_22165325 8 22165325 13.48 8.8 0.0 1
8_23775128 8 23775128 12.75 8.4 0.0 1
8_25275567 8 25275567 12.59 8.2 2.3 0.15
8_28017624 8 28017624 14.64 8.7 8.5 5.45e-03**
8_35673306 8 35673306 12.99 8.4 0.0 0.91
8_38488687 8 38488687 13.52 8.7 6.4 1.64e-02*
8_40730112 8 40730112 13.42 8.5 0.0 1.00
8_42812663 8 42812663 11.6 7.6 0.0 0.88
8_45449861 8 45449861 12.78 8.4 0.5 0.49
8_47543727 8 47543727 10.87 7.0 0.0 0.97
8_50251664 8 50251664 5.76 −4.8 0.3 0.61
8_51746401 8 51746401 6.36 −5.6 0.5 0.52
8_53413075 8 53413075 7.43 −5.5 0.1 0.81
*, **

Significant at P < 0.05 and 0.01, respectively

For the 22 markers representing one-Mb windows in which significant markers were detected, mean effect size was 7.7 μg·kg-1 geosmin concentration, ranging in absolute value from 4.8 to 8.7 μg·kg-1 geosmin concentration (Table 2). Effect size was maximized at the two QTL identified for model inclusion, along with a third at EL10.2 position 18,397,052 that added only marginal explanatory value.

The validity of this analysis was supported by separate association analyses on the F3 individuals comprising the low (n = 43) and high (n = 44) geosmin tails of the selected population using the same set of 8651 markers. These analyses function as negative controls, as no significant marker-trait association should be detected when individuals from only one tail of the geosmin concentration distribution are analyzed. Population structure of the entire genotyped population and each single-tail subpopulation was similar, confirming that any population structure shown in the entire genotyped population was unrelated to geosmin concentration. (Supplementary Figure S3). For both high and low geosmin tail populations, no markers exceeded the α  =  0.05 permutation-derived significance threshold (Supplementary Figure S5). Individual-chromosome QQ plots associated with these analyses show mostly depressed Observed −log10(p) values at the upper end of the Expected −log10(p) value distribution, rather than amplified values that would indicate significant marker-trait association as in the main analysis (Supplementary Figure S6). Thus, support accrues for the result of the main association analysis: that two QTL on chromosome 8 are truly associated with geosmin concentration in table beet.

Discussion

This analysis, the first molecular marker-based genetic mapping in table beet, offers strong evidence of QTL for geosmin concentration on B. vulgaris ssp. vulgaris chromosome 8. By establishing a genomic location associated with geosmin—the compound that imbues table beet with its distinctive earthy aroma—this study links evidence of endogenous production of geosmin in table beet (Maher and Goldman 2018) with evidence of genetic control of this trait (Freidig and Goldman 2014; Maher and Goldman 2017; Hanson and Goldman 2019). As such, it provides a starting point for fine mapping or direct gene identification strategies that could identify a functional gene responsible for geosmin biosynthesis in table beet.

While this analysis detected significant markers in 22 1 Mb windows spanning much of chromosome 8, only two QTL explained most of the observed variation in geosmin concentration. That is, significant markers excluded from the QTL model may be physically linked to the major QTL due to the large recombination blocks inherent in this F2:3 mapping population. The chromosomal positions of the major QTL should be considered in context of the repetitive nature of the B. vulgaris ssp. vulgaris genome and the imperfect alignment of the reference genome with reads from the mapping population. The beet genome shows substantial presence-absence variation (Galewski and Eujayl 2021) and is highly repetitive (Flavell 1974; Dohm et al. 2014), with the nature and size of inserted, deleted, and repeated regions varying among individuals and genotypes. Moreover, table beet is the most genetically distinct crop type within B. vulgaris ssp. vulgaris, harboring more lineage-specific variation than other B. vulgaris ssp. vulgaris crop types (Galewski and McGrath 2020). While the EL10.2 genome was constructed from long reads and used several technologies to increase the probability of correct sequencing in repetitive regions, it is entirely plausible for repetitive regions in this table beet mapping population to be translocated, inverted, multiplied, or deleted with respect to the EL10.2 sugar beet reference genome. Due to imperfect alignment, large recombination bin size, and decreased effectiveness of alignment algorithms in highly repetitive regions (Treangen and Salzberg 2011), these QTL positions should be interpreted cautiously.

While this mapping population does not lend itself to precise QTL positioning, it is clear that QTL for geosmin concentration are located on B. vulgaris ssp. vulgaris chromosome 8. Interestingly, chromosome 8 was also found to harbor two adjacent hypothetical proteins with predicted terpenoid synthase function (BVRB8g184050 and BVRB8g184060) showing more than 20% identity with the S. coelicolor geosmin synthase protein sequence. In the EL10.1 genome, the nucleotide sequence of BVRB8g184050 encompasses two genes (EL10Ac8g20305 and EL10Ac8g20306), both of which encode alpha-farnesene synthase 1 (AFS1) proteins (Phytozome 2021). The sequence of BVRB8g184060 co-locates with much of EL10Ac8g20304 and is adjacent to EL10Ac8g20302, both of which code for S-linalool synthases (Phytozome 2021). All four EL10.1 genes are functionally annotated as terpenoid cyclases, and three of four genes (EL10Ac8g20302, EL10Ac8g20304, and EL10Ac8g20305) include sequences annotated as terpene synthase metal-binding domains. Biosynthesis of geosmin in species of actinobacteria, cyanobacteria, and fungi is driven by a bifunctional Mg2+-binding sesquiterpene synthase enzyme (Jiang et al. 2006; Giglio et al. 2008; Churro et al. 2020), so the functional homology of these putative B. vulgaris ssp. vulgaris terpenoid synthases is worth noting. However, terpene synthase enzymes are ubiquitous within plant genomes (Singh and Sharma 2015), and it cannot be assumed that these genes code for functional geosmin synthases.

Taken together, these putative terpenoid synthase genes make up a region approximately 123 kb long, located about 5 Mb from the end of EL10.1 chromosome 8, consistent with the phenomenon of clustered plant biosynthetic genes (Huang et al. 2017). Both QTL for geosmin concentration are located near the center of EL10.2 chromosome 8, at approximate positions of 28 and 38 Mb. Estimation of the distance between QTL and this cluster of terpenoid synthase genes is complicated by the fact the orientation of EL10.2 chromosomes is not established—either with respect to EL10.1 or the cytogenetic map—and the size of chromosome 8 differs between EL10.1 and EL10.2 (McGrath et al. 2020). Thus, while QTL for geosmin concentration do not obviously co-locate with terpenoid synthase genes showing functional similarity to S. coelicolor geosmin synthase, it is notable that both elements are located on chromosome 8.

While this study detected two major QTL for geosmin concentration on B. vulgaris ssp. vulgaris chromosome 8, the distribution and variance of geosmin concentration in the full mapping population was consistent with that of a quantitative trait. Geosmin concentration exhibited continuous distribution ranging from 0.77 to 41.60 μg·kg-1geosmin, along with right skew, demonstrating the established phenomenon that variance of geosmin concentration tends to increase with concentration itself (Freidig and Goldman 2014; Maher and Goldman 2017; Hanson and Goldman 2019). The full population was created by crossing two parental individuals with moderate and unknown geosmin concentration, and the resulting transgressive segregation—in which a segregating population shows more extreme phenotypes than those of either parent—could owe to a combination of additive, dominance, and epistatic effects. If QTL for geosmin concentration does code for geosmin synthase enzymes, the observed continuous variation in geosmin concentration could owe to the presence of multiple loci harboring geosmin synthase genes, per an additive effects model (Lynch and Walsh 1998). In addition, multiple geosmin synthase alleles could exist, the functionality of which could differ if mutations to functional domains altered their affinity for Mg2+ cofactors (Harris et al. 2015), sesquiterpene substrates, or both. If a B. vulgaris ssp. vulgaris geosmin synthase gene complex included transcription factor binding genes like that of M. asticus (Churro et al. 2020), mutations in these genes could alter geosmin synthase activity. Due to the ubiquity of C5 and C15 compounds within plant tissues (Singh and Sharma 2015), geosmin biosynthesis seems unlikely to be affected by substrate availability.

It is also possible that some QTL for geosmin concentration could have gone undetected in this analysis. In selective genotyping experiments, power to detect QTL is influenced by the overall population size, the proportion of individuals selected for genotyping, and the effect size of QTL associated with the trait (Navabi et al. 2009; Sun et al. 2010). In the present experiment, a 16% selection proportion from a population of 542 individuals allowed detection of QTL explaining 8.5% and 6.4% of variation in geosmin concentration. These effect sizes are consistent with those from simulation studies using similar selected proportions and population sizes (Navabi et al. 2009; Sun et al. 2010). However, larger overall population size and possibly larger selected proportion would have been needed to detect smaller-effect QTL. This analysis did detect some small-effect QTL—such as the marker at EL10.2 position 25,275,567 that explained 2.3% of variation in geosmin concentration—but it may have lacked the power for such QTL to be included in a regression model. In addition, since this population’s parents were from open pollinated cultivars rather than inbred lines, and since ‘Touchstone Gold’ and ‘Chioggia’ are not extremely different in geosmin concentration, it is likely that this mapping population did not segregate at all loci related to geosmin concentration in table beet.

This study detected two QTL with estimated effect size 8.7 μg·kg-1 geosmin concentration, but numerical estimates of effect sizes should be viewed as approximate. It is well known that the statistical methods used for association analysis can cause overestimation of QTL effects, especially in relatively small population sizes like that used in this experiment (Beavis 1998; Xu 2003). In addition, Darvasi and Soller (1992) caution that QTL effects are overestimated when a selective genotyping approach is used. Lee (2005) shows that unidirectional selective genotyping for marker-based analysis can reduce accuracy of QTL detection and produce downward bias in QTL effects estimation, especially when selection is made within full-sib families. While Manichaikul et al. (2007) found that QTL effect sizes are not inflated under bidirectional selection, Navabi et al. (2009) demonstrate slight under- and overestimation of QTL effect size using unidirectional and bidirectional selection. Given the inconsistency of simulation results, the fact that this experiment selected individuals both among and within F2:3 families, and the fact that individual phenotypes were not replicated, estimated QTL effect sizes should be interpreted with caution. However, to the degree that effect sizes of 8.7 μg·kg-1 geosmin are accurate, they would be commensurate with the approximate annual change in population mean geosmin concentration under recurrent selection for that trait; preliminary taste tests are inconclusive with respect to whether this magnitude of difference in geosmin concentration is perceptible to consumers (Maher and Goldman 2017). Because this experiment used unphased sequence data, it cannot be known whether alternate alleles originated from the ‘Touchstone Gold’ or Chioggia’ parent.

To validate the association of chromosome 8 QTL with geosmin concentration in table beet, and to more precisely locate QTL, fine mapping could be carried out with either a linkage or association approach. A linkage approach could increase power to detect QTL by creating a segregating F2 population from a cross of two parents with widely divergent geosmin concentration, which are now available due to the recurrent selection work of Maher and Goldman (2017) and participatory cultivar development work of Hanson (2020). Parental roots could be genotyped to facilitate haplotype phasing, and F2 individuals could be genotyped and paired with mean F2:3 family phenotypes to reduce phenotypic error variance. Such a population would facilitate composite interval mapping, which should yield a much smaller QTL region than did this single marker analysis. An association approach incorporating accessions with diverse recombination histories and widely variable geosmin concentration could also be used for fine mapping of loci controlling geosmin concentration in table beet.

Direct gene identification could also be pursued via either allele frequency analysis or a candidate gene strategy. Allele frequency analysis using bulked sequence data is particularly effective in B. vulgaris ssp. vulgaris due to its genetic heterogeneity and the fact that phenotypes are both changed and measured on a population mean basis (Ries et al. 2016; Tränkner et al. 2016; Galewski and Mcgrath 2020). This trait-based selective genotyping strategy could be applied to any population yielding two pools with extreme geosmin concentration, including those created for linkage mapping or GWAS. Alternatively, candidate geosmin synthase genes—perhaps including those identified by Maher (2017) and additional chromosome 8 sequences encoding predicted terpene synthases—could be validated against sequence data from extreme-geosmin individuals. Finally, RNA-seq could be used to identify candidate genes with differential expression levels between high and low geosmin genotypes, or between epidermal and parenchymal tissues of the same root.

This study was the first molecular marker-based genetic mapping experiment in table beet, and it succeeded in detecting two QTL for geosmin concentration in table beet. This finding adds to the body of evidence documenting the genetic basis of geosmin concentration in table beet, and it provides a starting point for more thorough investigation of chromosome 8 with fine mapping or direct gene identification strategies. If successful, such strategies might locate one or more genes encoding B. vulgaris ssp. vulgaris geosmin synthase, allowing development of molecular markers for this trait. While more research is warranted into consumer preference for geosmin concentration—and more broadly, into consumer flavor perception and hedonic liking for table beet—manipulation of geosmin concentration in table beet offers one way to adjust the physicochemical composition of table beet and therefore, plausibly, its perceived flavor profile, consumer acceptance, and market potential.

Data availability

Code and raw data are available via figshare at https://doi.org/10.25387/g3.16649746 (Accessed: 2021 October 1). Supplementary File S1 contains R code used to analyze phenotypic data found in Supplementary Files S2 and S3. Supplementary File S4 contains R code used for association analysis and PCA. Association analysis uses phenotypic data found in Supplementary File S5, and PCA uses categorical data in Supplementary File S6. Both analyses use the filtered marker data set in Supplementary File S7. Unfiltered genomic data in Variant Call Format are found in Supplementary File S8. Supplementary Files S7 and S8 include original EL10.2 scaffold names that were later renamed according to Butterfass nomenclature. Aligned population sequence data is available in the NCBI SRA database at https://www.ncbi.nlm.nih.gov/sra/?term=PRJNA764531 (Accessed: 2021 October 1).

Acknowledgments

The authors’ thanks go to: Nicholas Keuler of the University of Wisconsin—Madison Department of Statistics for assistance with phenotypic data analysis; Dean Sanders of the University of Wisconsin—Madison Bioinformatics Resource Center for assistance with genome alignment and variant calling; and the University of Wisconsin—Madison Biotechnology for genome sequencing services.

Funding

This study was funded by United States Department of Agriculture Hatch Grant number WIS02032.

Conflicts of interest

The authors declare that there is no conflict of interest.

Literature cited

  1. Adetunji I, Willems G, Tschoep H, Bürkholz A, Barnes S, et al. 2014. Genetic diversity and linkage disequilibrium analysis in elite sugar beet breeding lines and wild beet accessions. Theor Appl Genet. 127:559–571. doi:10.1007/s00122-013-2239-x. [DOI] [PubMed] [Google Scholar]
  2. Andrello M, Henry K, Devaux P, Verdelet D, Desprez B, et al. 2017. Insights into the genetic relationships among plants of Beta section Beta using SNP markers. Theor Appl Genet. 130:1857–1866. doi:10.1007/s00122-017-2929-x. [DOI] [PubMed] [Google Scholar]
  3. Arumuganathan K, Earle ED.. 1991. Nuclear DNA content of some important plant species. Plant Mol Biol Report. 9:208–218. doi:10.1007/BF02672069. [Google Scholar]
  4. Bach V, Mikkelsen L, Kidmose U, Edelenbos M.. 2014. Culinary preparation of beetroot (Beta vulgaris L.): the impact on sensory quality and appropriateness. J Sci Food Agric. 95:1852–1859. doi:10.1002/jsfa.6886. [DOI] [PubMed] [Google Scholar]
  5. Barzen E, Mechelke W, Ritter E, Seitzer JF, Salamini F.. 1992. RFLP markers for sugar beet breeding: chromosomal linkage maps and location of major genes for rhizomania resistance, monogermy and hypocotyl colour. Plant J. 2:601–611. doi:10.1111/j.1365-313X.1992.00601.x. [Google Scholar]
  6. Bavec M, Turinek M, Grobelnik-Mlakar S., Slatnar A., Bavec F.. 2010. Influence of industrial and alternative farming systems on contents of sugars, organic acids, total phenolic content, and the antioxidant activity of red beet (Beta vulgaris L. ssp. vulgaris Rote Kugel). J Agric Food Chem. 58:11825–11831. [DOI] [PubMed] [Google Scholar]
  7. Beavis WD. 1998. QTL analyses: power, precision, and accuracy. In: Paterson AH, editor. Molecular Dissection of Complex Traits. New York, NY: CRC Press. p. 145–162. [Google Scholar]
  8. Brouwer B, Colley M.. 2016. Pacific Northwest Organic Plant Breeding Assessment of Needs, 2016. Port Townsend, WA: Organic Seed Alliance.
  9. Butterfass T. 1964. Die Chloroplastenzahlen in verschiedenartigen Zellen trisomer Zuckerrüben (Beta vulgaris). Z Bot. 52:46–77. [DOI] [PubMed] [Google Scholar]
  10. Buttery RG, Guadagni DG, Ling LC.. 1976. Geosmin, a musty off-flavor of dry beans. J Agric Food Chem. 24:419–420. doi:10.1021/jf60204a050. [Google Scholar]
  11. Cane DE, Watt RM.. 2003. Expression and mechanistic analysis of a germacradienol synthase from Streptomyces coelicolor implicated in geosmin biosynthesis. Proc Natl Acad Sci USA. 100:1547–1551. doi:10.1073/pnas.0337625100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Churchill GA, Doerge RW.. 1994. Empirical threshold values for quantitative trait mapping. Genetics. 138:963–971. doi:10.1007/s11703-007-0022-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Churro C, Semedo-Aguiar AP, Silva AD, Pereira-Leal JB, Leite RB.. 2020. A novel cyanobacterial geosmin producer, revising GeoA distribution and dispersion patterns in Bacteria. Sci Rep. 10:1–19. doi:10.1038/s41598-020-64774-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, et al. ; 1000 Genomes Project Analysis Group. 2011. The variant call format and VCF tools. Bioinformatics. 27:2156–2158. doi:10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Darriet P, Pons M, Lamy S, Dubourdieu D.. 2000. Identification and Quantification of Geosmin, an Earthy Odorant Contaminating Wines. J Agric Food Chem. 48:4835–4838. [DOI] [PubMed] [Google Scholar]
  16. Darvasi A, Soller M.. 1992. Selective genotyping for determination of linkage between a marker locus and a quantitative trait locus. Theor Appl Genet. 85:353–359. doi:10.1007/BF00222881. [DOI] [PubMed] [Google Scholar]
  17. Dawson JC, Healy GK.. 2018. Flavor Evaluation for Plant Breeders In: Goldman IL, editor. Plant Breeding Reviews, Vol 41. 1st ed.New York, NY: John Wiley & Sons, Inc. p. 215–262. [Google Scholar]
  18. Dohm JC, Minoche AE, Holtgräwe D, Capella-Gutiérrez S, Zakrzewski F, et al. 2014. The genome of the recently domesticated crop plant sugar beet (Beta vulgaris). Nature. 505:546–549. doi:10.1038/nature12817. [DOI] [PubMed] [Google Scholar]
  19. Bates D, Mächler M, Bolker B, Walker S.. 2015. Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software. 67:1–48. doi:10.18637/jss.v067.i01. [Google Scholar]
  20. El-Mezawy A, Dreyer F, Jacobs G, Jung C.. 2002. High-resolution mapping of the bolting gene B of sugar beet. Theor Appl Genet. 105:100–105. doi:10.1007/s00122-001-0859-z. [DOI] [PubMed] [Google Scholar]
  21. Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, et al. 2011. A Robust, Simple Genotyping-by-Sequencing (GBS) approach for high diversity species. PLoS One. 6:e19379.doi:10.1371/journal.pone.0019379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Feng Z, Huber U, Boland W.. 1993. Biosynthesis of the irregular C12-terpenoid dehydrogeosmin in flower heads of Rebutia marsoneri Werd. (Cactaceae). Helv Chim Acta. 76:2547–2552. doi:10.1002/hlca.19930760714. [Google Scholar]
  23. Flavell RB, Bennett MD, Smith JB, Smith DB.. 1974. Genome size and the proportion of repeated nucleotide sequence DNA in plants. Biochem. Genet. 12:257–269. [DOI] [PubMed] [Google Scholar]
  24. Flint-Garcia SA, Thornsberry JM, Buckler ES.. 2003. Structure of linkage disequilibrium in plants. Annu Rev Plant Biol. 54:357–374. [DOI] [PubMed] [Google Scholar]
  25. Freidig AK, Goldman IL.. 2011. Variation in oxalic acid content among commercial table beet cultivars and related crops. J Am Soc Hortic Sci. 136:54–60. [Google Scholar]
  26. Freidig AK, Goldman IL.. 2014. Geosmin production associated with Beta vulgaris ssp. vulgaris is cultivar specific. J Agric Food Chem. 62:2031–2036. [DOI] [PubMed] [Google Scholar]
  27. Frisvad JC, Samson RA, Rassing BR, Van Der Horst MI, Van Rijn FTJ, et al. 1997. Penicillium discolor, a new species from cheese, nuts and vegetables. Antonie van Leeuwenhoek. Int J Gen Mol Microbiol. 72:119–126. doi:10.1023/A:1000244502608. [DOI] [PubMed] [Google Scholar]
  28. Funk A, Galewski P, McGrath JM.. 2018. Nucleotide-binding resistance gene signatures in sugar beet, insights from a new reference genome. Plant J. 95:659–671. doi:10.1111/tpj.13977. [DOI] [PubMed] [Google Scholar]
  29. Galewski PJ, Eujayl I.. 2021. A roadmap to durable BCTV resistance using long-read genome assembly of genetic Stock KDH13. Plant Mol Biol Report. doi:10.1007/s11105-021-01307-5. [Google Scholar]
  30. Galewski PJ, McGrathJM.. 2020. Genetic diversity among cultivated beets (Beta vulgaris) assessed via population-based whole genome sequences. BMC Genomics. 21:1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gallais A, Moreau L, Charcosset A.. 2007. Detection of marker-QTL associations by studying change in marker frequencies with selection. Theor Appl Genet. 114:669–681. doi:10.1007/s00122-006-0467-z. [DOI] [PubMed] [Google Scholar]
  32. Gerber NN. 1967. Geosmin, an earthy- smelling substance isolated from actinomycetes. Biotechnol Bioeng. 9:321–327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Giglio S, Jiang J, Saint CP, Cane DE, Monis PT.. 2008. Isolation and characterization of the gene associated with geosmin production in Cyanobacteria RID B-8539-2011. Environ Sci Technol. 42:8027–8032. doi:10.1021/es801465w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Glaubitz JC, Casstevens TM, Lu F, Harriman J, Elshire RJ, et al. 2014. TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS One. 9:e90346. doi: 10.1371/journal.pone.0090346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Goldman IL, Navazio JP.. 2003. History and breeding of table beet in the United States. In: Janick J, editor. Plant Breeding Reviews, Vol. 22. New York, NY: John Wiley and Sons. p. 357–388. [Google Scholar]
  36. Goldman IL, Navazio JP.. 2008. Table Beet. In: Prohens J, Nuez F, editors, Handbook of Plant Breeding, Vol 1. New York, NY: Springer. doi:10.1007/978-0-387-30443-4_7. [Google Scholar]
  37. Hanson SJ. 2020. Geosmin Concentration in Table Beet: Genetic Characterization and Participatory Cultivar Development. The University of Wisconsin-Madison. 256 pages.
  38. Hanson SJ, Goldman IL.. 2019. Genotype is primarily responsible for variance in table beet geosmin concentration, but complex genotype environment interactions influence variance in total dissolved solids. J Am Soc Hortic Sci. 144:429–438. doi:10.21273/JASHS04758-19. [Google Scholar]
  39. Harris GG, Lombardi PM, Pemberton TA, Matsui T, Weiss TM, et al. 2015. Structural studies of geosmin synthase, a bifunctional sesquiterpene synthase with αα domain architecture that catalyzes a unique cyclization-fragmentation reaction sequence. Biochemistry. 54:7142–7155. doi:10.1021/acs.biochem.5b01143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Huang AC, Kautsar SA, Hong YA, Medema MH, Bond AD, et al. 2017. Unearthing a sesterterpene biosynthetic repertoire in the Brassicaceae through genome mining reveals convergent evolution. Proc Natl Acad Sci USA. 114:E6005–E6014. doi:10.1073/pnas.1705567114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Hultengren RL, Glos M, Mazourek M.. 2016. Breeding, Research, and Education Needs Assessment for Organic Vegetable Growers in The Northeast. Ithaca, NY: Cornell University.
  42. Jiang H, Lei R, Ding S, Zhu S.. 2014. Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinformatics. 15:182. doi:10.1186/1471-2105-15-182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Jiang J, He X, Cane DE.. 2006. Geosmin biosynthesis. Streptomyces coelicolor germacradienol/germacrene D synthase converts farnesyl diphosphate to geosmin. J Am Chem. Soc. 128:8128–8129. doi:10.1021/ja062669x. [DOI] [PubMed] [Google Scholar]
  44. Korth KL, Doege SJ, Park SH, Goggin FL, Wang Q, et al. 2006. Medicago truncatula mutants demonstrate the role of plant calcium oxalate crystals as an effective defense against chewing insects. Plant Physiol. 141:188–195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kujala TS, Vienola MS, Klika KD, Loponen JM, Pihlaja K.. 2002. Betalain and phenolic compositions of four beetroot (Beta vulgaris) cultivars. Eur Food Res Technol. 214:505–510. [Google Scholar]
  46. Kuznetsova A, Brockhoff PB, Christensen RHB.. 2017. lmerTest Package: tests in linear mixed effects models. J Stat Soft. 82:1–26. doi:10.18637/jss.v082.i13. [Google Scholar]
  47. Lander ES, Botstein D.. 1989. Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics. 121:185–199. doi:10.1093/genetics/121.1.185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Langmead B, Salzberg S., , , 2012. Fast gapped-read alignment with Bowtie 2. Nature Methods. 9:357–359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Lee C. 2005. Selection bias in quantitative trait loci mapping. J Hered. 96:363–367. doi:10.1093/jhered/esi062. [DOI] [PubMed] [Google Scholar]
  50. Lenth R. 2020. emmeans: Estimated Marginal Means, aka Least-Squares Means. R package version 1.4.8. https://CRAN.R-project.org/package=emmeans (Accessed: 2021 October 1).
  51. Letschert JPW 1993. Beta section Beta: biogeographical patterns of variation, and taxonomy. Wagening Agric Univ Pap. 93:1–155. [Google Scholar]
  52. Liato V, Aïder M.. 2017. Geosmin as a source of the earthy-musty smell in fruits, vegetables and water: Origins, impact on foods and water, and review of the removing techniques. Chemosphere. 181:9–18. doi:10.1016/j.chemosphere.2017.04.039. [DOI] [PubMed] [Google Scholar]
  53. Lu G, Edwards CG, Fellman JK, Mattinson DS, Navazio J.. 2003a. Biosynthetic origin of geosmin in red beets (Beta vulgaris L.). J Agric Food Chem. 51:1026–1029. doi:10.1021/jf020905r. [DOI] [PubMed] [Google Scholar]
  54. Lu G, Fellman J, Edwards C, Mattinson S, Navazio J.. 2003b. Quantitative determination of geosmin in red beets (Beta vulgaris L.) using headspace solid-phase microextraction. J Agric Food Chem. 51:1021–1025. [DOI] [PubMed] [Google Scholar]
  55. Lynch M, Walsh B.. 1998. Genetics and Analysis of Quantitative Traits. Sunderland, MA: Sinauer Associates, Inc. [Google Scholar]
  56. Lyon A, Silva E, Zystro J, Bell M.. 2015. Seed and plant breeding for Wisconsin’s organic vegetable sector: understanding farmers’ needs. Agroecol Sustain Food Syst. 39:601–624. doi:10.1080/21683565.2015.1017786. [Google Scholar]
  57. Maher L. 2017. Bidirectional recurrent selection and endogenous production of geosmin in table beet. [dissertation]. Madison, WI: University of Wisconsin.
  58. Maher L, Goldman IL.. 2017. Bidirectional recurrent half-sib family selection for geosmin concentration in table beet. Crop Sci. 57:2564–2574. doi:10.2135/cropsci2017.02.0111. [Google Scholar]
  59. Maher L, Goldman IL.. 2018. Endogenous production of geosmin in table beet. HortScience. 53:67–72. doi:10.21273/HORTSCI12488-17. [Google Scholar]
  60. Mangin B, Sandron F, Henry K, Devaux B, Willems G, et al. 2015. Breeding patterns and cultivated beets origins by genetic diversity and linkage disequilibrium analyses. Theor Appl Genet. 128:2255–2271. doi:10.1007/s00122-015-2582-1. [DOI] [PubMed] [Google Scholar]
  61. Manichaikul A, Palmer AA, Sen Ś, Broman KW.. 2007. Significance thresholds for quantitative trait locus mapping under selective genotyping. Genetics. 177:1963–1966. doi:10.1534/genetics.107.080093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. McGrath JM, Funk A, Galewski P, Ou S, Townsend B, et al. 2020. A contiguous de novo genome assembly of sugar beet EL10 (Beta vulgaris L.). BioRxiv. 10.1101/2020.09.15.298315 (Accessed: 2021 October 1). [DOI] [PMC free article] [PubMed]
  63. McGrath JM, Trebbi D, Fenwick A, Panella L, Schultz B, et al. 2007. An open source first generation molecular genetic map from a sugarbeet x table beet cross and its extension to physical mapping. Plant Genome J. 1:S27. [Google Scholar]
  64. Mikołajczyk-Bator K, Błaszczyk A, Czyżniejewski M, Kachlicki P.. 2016. Characterisation and identification of triterpene saponins in the roots of red beets (Beta vulgaris L.) using two HPLC–MS systems. Food Chem. 192:979–990. doi:10.1016/j.foodchem.2015.07.111. [DOI] [PubMed] [Google Scholar]
  65. Murray KE, Bannister PA, Buttery RG.. 1975. Geosmin: an important volatile constituent of beetroot (Beta vulgaris). Chem Ind. 26:973. [Google Scholar]
  66. Navabi A, Mather DE, Bernier J, Spaner DM, Atlin GN.. 2009. QTL detection with bidirectional and unidirectional selective genotyping: marker-based and trait-based analyses. Theor Appl Genet. 118:347–358. doi:10.1007/s00122-008-0904-2. [DOI] [PubMed] [Google Scholar]
  67. Paesold S, Borchardt D, Schmidt T, Dechyeva D.. 2012. A sugar beet (Beta vulgaris L.) reference FISH karyotype for chromosome and chromosome-arm identification, integration of genetic linkage groups and analysis of major repeat family distribution. Plant J. 72:600–611. doi:10.1111/j.1365-313X.2012.05102.x. [DOI] [PubMed] [Google Scholar]
  68. Phytozome 13: Beta vulgaris EL10_1.0. c2016-2021 2021. The Regents of the University of California: United States Department of Energy. https://phytozome-next.jgi.doe.gov/info/Bvulgaris_EL10_1_0 (Accessed: 2021 October 1).
  69. Pillen K, Sleinrücken G, Herrmann RG, Jung C.. 1993. An extended linkage map of sugar beet (Beta vulgaris L.) including nine putative lethal genes and the restorer Gene X. Plant Breed. 111:265–272. doi:10.1111/j.1439-0523.1993.tb00641.x. [Google Scholar]
  70. Pillen K, Steinrücken G, Wricke G, Herrmann RG, Jung C.. 1992. A linkage map of sugar beet (Beta vulgaris L.). Theor Appl Genet. 84:129–135. [DOI] [PubMed] [Google Scholar]
  71. R Core Team. 2021. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
  72. Rabier CE. 2014. On statistical inference for selective genotyping. J Stat Plan Inference. 147:24–52. doi:10.1016/j.jspi.2013.11.010. [Google Scholar]
  73. Rajabi A, Borchardt D.. 2015. QTL mapping for root yield and leaf traits in sugar beet (Beta vulgaris L.) under drought stress condition. Iran J Crop Sci. 17:Pe46–Pe62. [Google Scholar]
  74. Ries D, Holtgräwe D, Viehöver P, Weisshaar B.. 2016. Rapid gene identification in sugar beet using deep sequencing of DNA from phenotypic pools selected from breeding panels. BMC Genomics. 17:1–13. doi:10.1186/s12864-016-2566-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Rogers AR, Huff C.. 2009. Linkage disequilibrium between loci with unknown phase. Genetics. 182:839–844. doi:10.1534/genetics.108.093153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Rosyara UR, De Jong WS, Douches DS, Endelman JB.. 2016. Software for genome-wide association studies in autopolyploids and its application to potato. Plant Genome. 9. doi:10.3835/plantgenome2015.08.0073. [DOI] [PubMed] [Google Scholar]
  77. Salinas ML, Ogura T, Soffchi L.. 2001. Irritant contact dermatitis caused by needle-like calcium oxalate crystals, raphides, in Agave tequilana among workers in tequila distilleries and agave plantations. Contact Dermatitis. 44:94–96. [DOI] [PubMed] [Google Scholar]
  78. Schlumpberger BO, Jux A, Kunert M, Boland W, Wittmann D.. 2004. Musty-earthy scent in cactus flowers: characteristics of floral scent production in dehydrogeosmin-producing cacti. Int J Plant Sci. 165:1007–1015. doi:10.1086/423878. [Google Scholar]
  79. Schondelmaier J, Jung C.. 1997. Chromosomal assignment of the nine linkage groups of sugar beet (Beta vulgaris L.) using primary trisomics. Theor Appl Genet. 95:590–596. [Google Scholar]
  80. Singh B, Sharma RA.. 2015. Plant terpenes: defense responses, phylogenetic analysis, regulation and clinical applications. 3 Biotech. 5:129–151. doi:10.1007/s13205-014-0220-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Spiteller D, Jux A, Piel J, Boland W.. 2002. Feeding of 5,5-H-2(2) -1-desoxy-D-xylulose and 4,4,6,6,6-H-2(5) -mevalolactone to a geosmin-producing Streptomyces sp and Fossombronia pusilla. Phytochemistry. 61:827–834. [DOI] [PubMed] [Google Scholar]
  82. Stuber CW, Moll RH, Goodman MM, Schaffer H, Weir BS.. 1980. Allozyme frequency changes associated with selection for increased grain yield in maize. Genetics. 95:225–236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Sun Y, Wang J, Crouch JH, Xu Y.. 2010. Efficiency of selective genotyping for genetic analysis of complex traits and potential applications in crop improvement. Mol. Breed. 26:493–511. doi:10.1007/s11032-010-9390-8. [Google Scholar]
  84. Tränkner C, Lemnian IM, Emrani N, Pfeiffer N, Tiwari SP, et al. 2016. A detailed analysis of the BR1 locus suggests a new mechanism for bolting after winter in sugar beet (Beta vulgaris L). Front Plant Sci. 7:1662. doi:10.3389/fpls.2016.0 1662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Treangen TJ, Salzberg SL.. 2011. Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet. 13:36–46. doi:10.1038/nrg3117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Tyler LD, Acree TE, Becker RF, Nelson RR, Butts RM.. 1978. Effect of maturity, cultivar, field history, and the operations of peeling and coring on the geosmin content. Am Chem Soc. 26:1466–1469. doi:10.1021/jf60220a049. [Google Scholar]
  87. Tyler LD, Acree TE, Smith NL.. 1979. Sensory evaluation of geosmin in juice made from cooked beets. J Food Sci. 44:79–81. doi:10.1111/j.1365-2621.1979.tb10009.x. [Google Scholar]
  88. USDA NASS. 2020. Quick Stats: United States Department of Agriculture National Agricultural Statistics Servic. https://quickstats.nass.usda.gov/ (Accessed: 2021 October 1).
  89. Weber WE, Borchardt DC, Koch G.. 1999. Combined linkage maps and QTLs in sugar beet (Beta vulgaris L.) from different populations. Plant Breed. 118:193–204. doi:10.1111/j.1439-0523.1999.tb01513.x. [Google Scholar]
  90. Weber WE, Borchardt DC, Koch G.. 2000. Marker analysis for quantitative traits in sugar beet. Plant Breed. 119:97–106. doi:10.1046/j.1439-0523.2000.00482.x. [Google Scholar]
  91. Wickham H. 2016. ggplot2: Elegant Graphics for Data Analysis. New York, NY: Springer-Verlag. [Google Scholar]
  92. Würschum T, Maurer HP, Kraft T, Janssen G, Nilsson C, et al. 2011. Genome-wide association mapping of agronomic traits in sugar beet. Theor Appl Genet. 123:1121–1131. doi:10.1007/s00122-011-1653-1. [DOI] [PubMed] [Google Scholar]
  93. Xu S. 2003. Theoretical basis of the beavis effect. Genetics. 165:2259–2268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Yang J, Zaitlen N, Goddard M, Visscher P, Price A.. 2014. Advantages and pitfalls in the application of mixed-model association methods. Nat Genet. 46:100–106. doi:10.1038/ng.2876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Zou C, Wang P, Xu Y.. 2016. Bulked sample analysis in genetics, genomics and crop improvement. Plant Biotechnol J. 14:1941–1955. doi:10.1111/pbi.12559. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Code and raw data are available via figshare at https://doi.org/10.25387/g3.16649746 (Accessed: 2021 October 1). Supplementary File S1 contains R code used to analyze phenotypic data found in Supplementary Files S2 and S3. Supplementary File S4 contains R code used for association analysis and PCA. Association analysis uses phenotypic data found in Supplementary File S5, and PCA uses categorical data in Supplementary File S6. Both analyses use the filtered marker data set in Supplementary File S7. Unfiltered genomic data in Variant Call Format are found in Supplementary File S8. Supplementary Files S7 and S8 include original EL10.2 scaffold names that were later renamed according to Butterfass nomenclature. Aligned population sequence data is available in the NCBI SRA database at https://www.ncbi.nlm.nih.gov/sra/?term=PRJNA764531 (Accessed: 2021 October 1).


Articles from G3: Genes|Genomes|Genetics are provided here courtesy of Oxford University Press

RESOURCES